Data migration and data validation¶
Data migration copies rows from a source system into Snowflake tables (historical loads, partitions, and optional incremental runs). Data validation compares what landed in Snowflake with the source (schema, metrics, and optional row-level checks). SnowConvert AI runs both through the same Orchestrator and Workers model, driven by the Snowflake AIM Migration Agent in Cortex Code or the scai data … CLI from a migration project.
Note
Cloud data migration and validation always use Snowflake as the target. PostgreSQL (and other dialects) are supported as sources you migrate from, not as destinations you load into through these workflows.
Data migration and validation in the Snowflake AIM Migration Agent¶
The agent’s guided workflow covers deployment, data movement, and checks after SQL conversion:
| Capability | What it means |
|---|---|
| Migrate data | Copy rows from source tables into Snowflake with automatic row-count validation as part of the flow |
| Validate data | Compare schema, metrics, and optionally rows between source and Snowflake after migration |
migrate-objects | Deploys objects wave by wave; for tables, deploys to Snowflake then migrates data from the source |
Procedure-level assurance uses baseline-capture and the migrate-objects test loop (two-sided validation of function and procedure output against captured baselines). That complements table-level migration and validation above.
Example prompt (from the Snowflake AIM Migration Agent topic):
Supported source systems (cloud)¶
Not every source supports every command. Use this table for cloud workflows (scai data migrate … and scai data validate …). Legacy commands (migrate-legacy, validate-legacy) have a narrower dialect list; see the technical pages below.
| Source | Cloud data migration | Cloud data validation | Worker connectivity highlights |
|---|---|---|---|
| SQL Server | Yes | Yes | ODBC (or BCP when enabled) |
| Amazon Redshift | Yes | Yes | ODBC; optional UNLOAD to S3 |
| Oracle | Yes | Yes | Oracle Instant Client + ODBC |
| PostgreSQL | Yes | Yes | Npgsql (no ODBC); set ssl_mode in Worker TOML |
| Teradata | Yes | Yes | teradatasql or ODBC; regular, write_nos, or tpt extraction for migration |
For the full capability matrix (code extraction, deploy, testing, and more), see Supported source systems in the Snowflake AIM Migration Agent.
Quick start by source¶
After scai init, scai code extract, scai code convert, and scai code deploy, use the connection and workflow commands for your dialect.
| Source | Register source | Cloud migration | Cloud validation |
|---|---|---|---|
| SQL Server | scai connection add-sql-server | scai data migrate start | scai data validate start |
| Amazon Redshift | scai connection add-redshift | scai data migrate start | scai data validate start |
| Teradata | scai connection add-teradata | scai data migrate start | scai data validate start |
| Oracle | scai connection add-oracle | scai data migrate start | scai data validate start |
| PostgreSQL | scai connection add-postgresql | scai data migrate start | scai data validate start |
Workflow configuration is YAML (for example .scai/config/data-migration-config.yaml and data-validation-config.yaml). scai data migrate generate-config and scai data validate generate-config set source_platform from your project dialect.
Per-source platform details (drivers, extraction strategies, TOML examples, and validation YAML samples) live in the tabs on the technical pages linked in the next section.
Technical documentation¶
Use these pages for architecture, prerequisites, scai data worker / scai data orchestrator setup, workflow reference, monitoring, and per-source Worker TOML.
| Topic | Page |
|---|---|
| Data migration [Preview] | SnowConvert AI: Data migration |
| Data validation [Preview] | SnowConvert AI: Data validation |
| Migration skill (agent) | Snowflake AIM Migration Agent |
| CLI reference | SCAI command reference |
On the data migration page, open Source-platform specifics and Source connection configuration examples for SQL Server, Amazon Redshift, Teradata, Oracle, and PostgreSQL tabs. The data validation page has matching tabs for validation behavior and YAML examples.
Related content¶
- Hybrid tables (common migration target type)
- Snowpark Container Services overview (optional Orchestrator/Worker hosting)
- Programmatic Access Tokens (recommended for Snowflake connections)