About Openflow Connector for SQL Server¶
Note
This connector is subject to the Snowflake Connector Terms.
This topic describes the basic concepts of Openflow Connector for SQL Server, its workflow and limitations.
The Openflow Connector for SQL Server connects a SQL Server database instance to Snowflake and replicates data from selected tables in near real-time or on a specified schedule.
Use this connector if you’re looking to do the following:
CDC replication of Microsoft SQL Server data with Snowflake for comprehensive, centralized reporting
Workflow¶
A SQL Server database administrator performs the following tasks:
Configures SQL Server replication settings and enables change tracking on the database and tables.
Creates credentials for the connector.
(Optional) Provides the SSL certificate.
A Snowflake account administrator performs the following tasks:
Creates a service user for the connector, a warehouse for the connector, and a destination database to replicate into.
Installs the connector.
Specifies the required parameters for the connector flow definition.
Runs the flow. The connector does the following when run in Openflow:
Creates a schema for journal tables.
Creates the schemas and destination tables matching the source tables configured for replication.
Starts replication following the table replication lifecycle as described in How tables are replicated.
How tables are replicated¶
The tables are replicated in the following stages:
Schema introspection: The connector discovers the columns in the source table, including the column names and types, then validates them against Snowflake’s and the connector’s Limitations. Validation failures cause this stage to fail, and the cycle completes. After successful completion of this stage, the connector creates an empty destination table.
Snapshot load: The connector copies all data available in the source table into the destination table. If this stage fails, then no more data is replicated. After successful completion, the data from the source table is available in the destination table.
Incremental load: The connector tracks changes in the source table and applies those changes to the destination table. This process continues until the table is removed from replication. Failure at this stage permanently stops replication of the source table, until the issue is resolved.
Note
Interim failures (such as connection errors) do not prevent the table from being replicated. However, permanent failures (such as unsupported data types) do prevent the table from being replicated. If a permanent failure prevents a table from being replicated, remove the table from the list of tables to be replicated. After you address the problem that caused the failure, you can add the table back to the list of tables to be replicated.
Supported SQL Server versions¶
The following table lists the tested and officially supported SQL Server versions:
Platform |
Service/Version |
Edition/Tier |
Supported |
---|---|---|---|
On-premises |
Developer, Enterprise, Standard |
âś” Supported |
|
Microsoft SQL Server 2019 |
Developer, Enterprise, Standard |
âś” Supported |
|
Microsoft SQL Server 2017 |
Developer, Enterprise, Standard |
âś” Supported |
|
Microsoft SQL Server 2016 |
Developer, Enterprise, Standard |
âś” Supported |
|
Microsoft SQL Server 2014 |
All |
Not tested |
|
Microsoft SQL Server 2012 |
All |
Not tested |
|
Azure |
All instance types |
âś” Supported |
|
All instance types |
âś” Supported |
||
SQL Server on Azure VM |
All |
Not tested |
|
AWS |
All instance types |
âś” Supported |
|
SQL Server for Amazon EC2 |
All |
Not tested |
|
Google Cloud |
Google Cloud SQL for SQL Server |
All |
Not tested |
Openflow requirements¶
The runtime size must be at least Medium. Use a bigger runtime when replicating large data volumes, especially when row sizes are large.
The connector does not support multi-node Openflow runtimes. Configure the runtime for this connector with Min nodes and Max nodes set to
1
.
Limitations¶
The connector supports only username and password authentication with SQL Server.
The connector only replicates tables with data types that are supported by Snowflake. For a list of these data types, see Summary of data types.
The connector requires every replicated table to have a primary key.
The connector does not update existing records in the Snowflake database when a new NOT NULL column with a default value is added to the source database.
The connector does not update existing records in the Snowflake database when a new column is added to the included list in the Column Filter JSON.
After deleting the column in the source database and adding it back with the same name, additional deletes will cause errors.
After including a column in Column Filter JSON and excluding it, additional include attempts will cause errors.
The connector supports source table schema changes with the exception of changing primary key definitions, changing the precision, or the scale of a numeric column.
The connector does not support the truncate table operation.
Note
Limitations affecting certain table columns can be bypassed by excluding these specific columns from replication.