Notas de lançamento de Snowpark Connect for Spark para 2025

Snowflake usa controle de versão semântico para atualizações de Snowpark Connect for Spark.

For documentation, see Execução de cargas de trabalho do Apache Spark™ no Snowflake com Snowpark Connect for Spark and Execução de cargas de trabalho em lote do Spark de Snowpark Submit.

Version 1.2.0 (November 17, 2025)

Snowpark Connect for Spark

Novos recursos

  • Relax version requirements for grpcio and aiobotocore.

Melhorias

  • Specify dependencies version in meta.yaml.

  • Build compiled and architecture-specific conda package.

  • Ensure all CloudPickleSerializer.loads are not done in TCM.

  • Include OSS SQL tests that start with the WITH clause.

  • Do not upload Spark jars when running the server for pyt.

  • Update internal queries count.

Correções de bugs

  • Fix tests for tcm.

  • Fix CSV column name discrepancy from Spark.

  • Use type cache for empty frames.

  • Resolve Windows OSS runner general issues.

Snowpark Submit

Melhorias

  • Generate unique workload names.

Bug Fixes

  • Fix staged file reading.

Version 1.0.1 (November 3, 2025)

Nota

With the release of this version, version 0.24 and previous versions are deprecated.

Snowpark Connect for Spark

Novos recursos

  • Add parameter for view creation strategies.

  • Support string <-> year month interval.

  • Support multiple pivot columns and aliases for pivot values in Spark SQL.

  • Integrate OpenTelemetry span and traces.

Melhorias

Nenhum.

Correções de bugs

  • Add a trailing slash for remove command.

  • Invalid GROUP BY issue with aggregation function and nilary functions.

  • Notebook exceeds gRPC maximum message size.

  • Fix temporary view creation with colliding names.

  • array_size with null argument.

  • Fix for $.0 JSON array access in get_json_object function.

  • Fix self ANTI and SEMI LEFT joins.

  • Handle different types in SQL function range.

  • Fixed temporary view describe.

Version 1.0.0 (October 28, 2025)

Snowpark Connect for Spark

Novos recursos

  • Add rowToInferSchema for CSV reading.

  • Support INSERT INTO with CTE SQL command.

  • I/O changes to add _SUCCESS file generation and metadata file filtering.

  • update(submit): Support installing Snowpark Connect for Spark in the Snowpark Submit client container.

Melhorias

Nenhum.

Correções de bugs

  • Fix _SUCCESS path update.

  • Throw error on remove failure update.

  • Sequence function supporting integral types inputs.

  • Fix types in empty CreateTempViewUsing.

  • Fix Parquet file repartitioning on write.

  • Resolve aliases in ORDER BY clause correctly.

  • Remove scope temp session parameter.

  • Fixed multiple self joins with join condition.

  • Fix column name resolution in pivot.

  • SQL parser aware of session timezone.

  • Interval type coercion with other types.

  • Fix having with nested CTEs.

  • Improve qualified name resolution in Spark.

Version 0.33.0 (October 10, 2025)

Snowpark Connect for Spark

Novos recursos

  • Add script to run on the output from Git action for merging SQLs.

  • Add --rebuild-whl parameter to notebook test runner.

  • Add support for both qualifiers after join.

Melhorias

Nenhum.

Correções de bugs

  • Support escape parameter in SQL LIKE commands.

  • Overwrite bug in partitions.

  • Validate column count on INSERT.

  • Incompatibility for pow with NAN.

  • Cross JOIN with condition.

  • Column attribution logic in nested queries.

  • Update error message for interval test.

  • String type coercion in set operation UNION and EXCEPT, coerce NUMERIC, DATE, DATETIME to STRING.

  • Correctly resolve Snowpark columns after a full outer self JOIN.

  • Expression in aggregate function might be zero improvement.

  • Update: Revert «[SCOS GA BUG] string type coercion in set opera»

  • DataFrame union of decimal type columns now widen as necessary.

  • String type coercion in set operation UNION and EXCEPT, coerce NUMERIC, DATE, DATETIME to STRING (part1).

  • Object not existed issue in TCM.

  • Fix to_binary(x, 'hex') where x has odd number of letters and digits.

  • Fix joins with empty tables.

  • Fix HAVING clause to prioritize grouping columns over aggregate aliases with same name.

Versão 0.32.0 (17 de outubro de 2025)

Snowpark Connect for Spark

Novos recursos

  • Suporte para RepairTable

  • Torne jdk4py uma dependência opcional de Snowpark Connect for Spark para simplificar a configuração da página inicial do Java para usuários finais.

  • Suporte a mais casos de tipo intervalo.

Melhorias

Nenhum.

Correções de bugs

  • Corrigir problemas de Join ao refatorar qualificadores

  • Corrija percentile_cont para permitir expressões de filtro e ordem de classificação.

  • Corrija histogram_numeric UDAF.

  • Corrija a função COUNT quando chamada com vários argumentos.

Versão 0.31.0 (9 de outubro de 2025)

Snowpark Connect for Spark

Novos recursos

  • Adicione suporte para expressões na cláusula GROUP BY quando a cláusula é explicitamente selecionada.

  • Adicione códigos de erro às mensagens de erro do para uma melhor solução de problemas.

Melhorias

Nenhum.

Correções de bugs

  • Correção do problema de conversão da função de janela sem suporte.