Snowpark Migration Accelerator: Configuração de conversão

Ao iniciar o Snowpark Migration Accelerator pela primeira vez (SMA), você precisa criar um novo projeto ou abrir um já existente. Cada projeto pode armazenar várias execuções do SMA para as fases de Avaliação e Conversão. Depois de concluir a fase de Avaliação, você precisará configurar o projeto para a fase de Conversão.

Página de configuração da conversão

Durante o processo de conversão, você tem várias opções de configuração disponíveis, embora a maioria das configurações padrão deva funcionar bem na maioria dos casos.

On the Conversion settings page, choose whether to run the conversion using Default Settings or to select Customize settings to configure advanced options.

Conversion settings page

If you select Customize settings, SMA opens a Conversion settings dialog where you can review and update settings and then click Save settings.

Conversion settings dialog

Conversion Settings

With the following settings from the user interface, you can more finely control how the SMA performs conversion.

  • Pandas

    Convert Pandas API to Snowpark API - Specifies to automatically convert Pandas code to the Snowpark equivalent Pandas API (Snowpark Pandas). When enabled, the tool transforms any Pandas operations it finds in your code into their Snowpark counterparts.

  • DBX

    Convert DBX notebooks to Snowflake notebooks - Specifies to convert the .dbc into Jupyter files in a new folder with the .dbc name.

    Nota

    When exporting notebooks, consider exporting them as Databricks, rather than Jupyter. When Jupyter files contain different sources than Python, SMA behavior may be unexpected.

  • Checkpoints

    • Identify and collect checkpoints - Activates the feature.

    • Collect checkpoints as active - Specifies to execute the collected checkpoint in VS Code when running the workload.

    • Collect user-defined functions returning data frame type - Specifies to validate that dataframes should be collected if the user has their own functions that return DataFrames.

    • Mode - Specifies the mode type to validate (Schema or DataFrame).

    • Sample - Specifies the sampling percentage of each DataFrame to validate.

    • Relevant PySpark functions to collect - Specifies the PySpark packages to collect (by default, all of them are checked). You can also add more packages by adding the package’s full name.

Configuração concluída

Once your setup is complete, click the Continue button. This action will initiate the SMA Conversion processes. A progress screen will display the current status of your conversion.

Conversão concluída

After the conversion finishes, SMA automatically displays the Conversion Results screen.