Openflow Connector for MongoDB: Iceberg table destinations¶
Note
This connector is subject to the Snowflake Connector Terms.
The Openflow Connector for MongoDB supports writing to Snowflake-managed Apache Apache Iceberg™ tables as an opt-in
destination format. Iceberg v2 and v3 are both supported. Setting Table Storage Format = ICEBERG
in the connector’s parameter context is the only change required. All Iceberg storage settings
(external volume, catalog, version, and serialization policy) are inherited from the Snowflake
destination database defaults.
Existing connectors using standard tables aren’t affected.
Prerequisites¶
- Openflow runtime: An existing runtime to host the connector (minimum size Medium).
- MongoDB source configured for replication: A replica set or sharded cluster with an oplog
of sufficient size, and a user with
readAnyDatabaseprivileges. Standalone instances aren’t supported. For details, see Set up the Openflow Connector for MongoDB. - Snowflake external volume: An external volume configured for Iceberg storage. See CREATE EXTERNAL VOLUME.
- Snowflake destination database: An existing database configured with Iceberg parameters (next section).
Step 1: Configure the Snowflake destination database¶
Set the Iceberg defaults on the destination database. The connector reads these defaults at runtime, so no per-connector Iceberg configuration is needed beyond Table Storage Format:
If you need to create a new database for Iceberg destinations, you can set the Iceberg properties at creation time:
To configure an existing database, use ALTER DATABASE:
| Parameter | Required | Notes |
|---|---|---|
| EXTERNAL_VOLUME | Yes | The external volume for Iceberg file storage. |
| ICEBERG_VERSION_DEFAULT | Yes |
|
| STORAGE_SERIALIZATION_POLICY | Yes |
|
Note
CATALOG = 'SNOWFLAKE' is set automatically by the connector on each CREATE ICEBERG TABLE
statement. Don’t set it at the database level.
The base location for each table is auto-derived using the
flat layout:
STORAGE_BASE_URL/database/schema/table_name.randomId/[data | metadata]/.
No user configuration is needed.
Step 2: Set Table Storage Format in the connector’s parameter context¶
Set the Table Storage Format parameter to ICEBERG in the connector’s destination parameter context.
The default is STANDARD.
For the full connector creation and configuration workflow, see Set up the Openflow Connector for MongoDB.
Step 3: Start and verify¶
Start the connector as usual. After the initial snapshot completes, verify the destination tables are Iceberg:
Known limitations¶
- No per-field type mapping: MongoDB stores entire documents as a single
datacolumn. Individual BSON fields aren’t mapped to separate Iceberg columns. All BSON types are serialized into thevariant(v3) orstring(v2)datacolumn.
Table structure¶
The connector maps each MongoDB collection to a Snowflake Iceberg table with the following columns:
| Column | Iceberg v3 type | Iceberg v2 type | Description |
|---|---|---|---|
| id | string | string | The MongoDB document _id field. |
| data | variant | string | The full document payload serialized as JSON. |
Switching table storage format or Iceberg version¶
Switching between Standard and Iceberg, or between Iceberg v2 and v3, requires creating a new connector with a new destination database configured for the intended format and version. To reuse the same destination database, you need a full connector reset: drop all schemas in the destination database and then adjust the connector configuration.
- Existing destination tables aren’t migrated. The new connector performs a fresh snapshot.
- You can’t change Table Storage Format on a running connector. The table storage format is baked into every destination and journal table created and can only be changed after the connector has been stopped and after destination tables have been dropped.