DataFrame¶
Classes
  | 
Represents a lazily-evaluated relational dataset that contains a collection of   | 
  | 
Provides functions for handling missing values in a   | 
  | 
Provides computed statistical functions for DataFrames.  | 
  | 
Provides data analytics functions for DataFrames.  | 
  | 
Provides AI-powered functions for a   | 
Methods
  | 
Aggregate the data in the DataFrame.  | 
  | 
For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles.  | 
  | 
For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles.  | 
  | 
Caches the content of this DataFrame to create a new cached Table DataFrame.  | 
  | 
Returns a reference to a column in the DataFrame.  | 
  | 
Returns a new DataFrame with only the columns whose names match the specified pattern using case-insensitive ILIKE matching (similar to SELECT * ILIKE 'pattern' in SQL).  | 
Executes the query representing this DataFrame and returns the result as a list of   | 
|
  | 
Executes the query representing this DataFrame asynchronously and returns: class:AsyncJob.  | 
  | 
Executes a COPY INTO <table> command to load data from files in a stage location into a specified table.  | 
  | 
Calculates the correlation coefficient for non-null pairs in two numeric columns.  | 
Executes the query representing this DataFrame and returns the number of rows in the result (similar to the COUNT function in SQL).  | 
|
  | 
Calculates the sample covariance for non-null pairs in two numeric columns.  | 
Creates a dynamic table that captures the computation expressed by this DataFrame.  | 
|
  | 
Creates or replace a temporary view that returns the same results as this DataFrame.  | 
  | 
Creates a view that captures the computation expressed by this DataFrame.  | 
  | 
Creates a temporary view that returns the same results as this DataFrame.  | 
Creates or replace a temporary view that returns the same results as this DataFrame.  | 
|
  | 
Creates a view that captures the computation expressed by this DataFrame.  | 
  | 
Creates a temporary view that returns the same results as this DataFrame.  | 
  | 
Performs a cross join, which returns the Cartesian product of the current   | 
  | 
Performs a cross join, which returns the Cartesian product of the current   | 
  | 
Computes a pair-wise frequency table (a   | 
  | 
Performs a SQL GROUP BY CUBE.  | 
  | 
Computes basic statistics for numeric columns, which includes   | 
Returns a new DataFrame that contains only the rows with distinct values from the current DataFrame.  | 
|
  | 
Returns a new DataFrame that excludes the columns with the specified names from the output.  | 
  | 
Creates a new DataFrame by removing duplicated rows on given subset of columns.  | 
  | 
Creates a new DataFrame by removing duplicated rows on given subset of columns.  | 
  | 
Returns a new DataFrame that excludes all rows containing fewer than a specified number of non-null and non-NaN values in the specified columns.  | 
  | 
Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the   | 
Prints the list of queries that will be executed to evaluate this DataFrame.  | 
|
  | 
Returns a new DataFrame that replaces all null and NaN values in the specified columns with the values provided.  | 
  | 
Filters rows based on the specified conditional expression (similar to WHERE in SQL).  | 
Executes the query representing this DataFrame and returns the first   | 
|
  | 
Flattens (explodes) compound values into multiple rows.  | 
  | 
Groups rows by the columns specified by expressions (similar to GROUP BY in SQL).  | 
  | 
Groups rows by the columns specified by expressions (similar to GROUP BY in SQL).  | 
  | 
Performs a SQL GROUP BY GROUPING SETS.  | 
  | 
Returns a new DataFrame that contains the intersection of rows from the current DataFrame and another DataFrame (  | 
  | 
Performs a join of the specified type (  | 
  | 
Lateral joins the current DataFrame with the output of the specified table function.  | 
  | 
Returns a new DataFrame that contains at most   | 
  | 
Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the   | 
  | 
Performs a natural join of the specified type (  | 
  | 
Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL).  | 
  | 
Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL).  | 
  | 
Rotates this DataFrame by turning the unique values from one column in the input expression into multiple columns and aggregating results where required on any remaining column values.  | 
  | 
Prints the schema of a dataframe in tree format.  | 
  | 
Prints the schema of a dataframe in tree format.  | 
  | 
Randomly splits the current DataFrame into separate DataFrames, using the specified weights.  | 
  | 
Randomly splits the current DataFrame into separate DataFrames, using the specified weights.  | 
  | 
Returns a DataFrame with the specified column   | 
  | 
Returns a new DataFrame that replaces values in the specified columns.  | 
  | 
Performs a SQL GROUP BY ROLLUP.  | 
  | 
Samples rows based on either the number of rows to be returned or a percentage of rows to be returned.  | 
  | 
Returns a DataFrame containing a stratified sample without replacement, based on a   | 
  | 
Returns a DataFrame containing a stratified sample without replacement, based on a   | 
  | 
Returns a new DataFrame with the specified Column expressions as output (similar to SELECT in SQL).  | 
  | 
Projects a set of SQL expressions and returns a new   | 
  | 
Projects a set of SQL expressions and returns a new   | 
  | 
Evaluates this DataFrame and prints out the first   | 
  | 
Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL).  | 
  | 
Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the   | 
  | 
Executes the query representing this DataFrame and returns the first   | 
  | 
Creates a new DataFrame containing columns with the specified names.  | 
  | 
Executes the query representing this DataFrame and returns an iterator of   | 
  | 
Executes the query representing this DataFrame and returns the result as a pandas DataFrame.  | 
  | 
Creates a new DataFrame containing columns with the specified names.  | 
Executes the query representing this DataFrame and returns an iterator of   | 
|
Executes the query representing this DataFrame and returns the result as a pandas DataFrame.  | 
|
Executes the query representing this DataFrame and returns an iterator of pandas dataframes (containing a subset of rows) that you can use to retrieve the results.  | 
|
  | 
Convert the Snowpark DataFrame to Snowpark pandas DataFrame.  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame (  | 
  | 
Rotates a table by transforming columns into rows.  | 
  | 
Filters rows based on the specified conditional expression (similar to WHERE in SQL).  | 
  | 
Returns a DataFrame with an additional column with the specified name   | 
  | 
Returns a DataFrame with the specified column   | 
  | 
Returns a DataFrame with an additional column with the specified name   | 
  | 
Returns a DataFrame with the specified column   | 
  | 
Returns a DataFrame with additional columns with the specified names   | 
  | 
Returns a new DataFrame that excludes all rows containing fewer than a specified number of non-null and non-NaN values in the specified columns.  | 
  | 
Returns a new DataFrame that replaces all null and NaN values in the specified columns with the values provided.  | 
  | 
Returns a new DataFrame that replaces values in the specified columns.  | 
  | 
For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles.  | 
  | 
For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles.  | 
  | 
Calculates the correlation coefficient for non-null pairs in two numeric columns.  | 
  | 
Calculates the sample covariance for non-null pairs in two numeric columns.  | 
  | 
Computes a pair-wise frequency table (a   | 
  | 
Returns a DataFrame containing a stratified sample without replacement, based on a   | 
  | 
Returns a DataFrame containing a stratified sample without replacement, based on a   | 
  | 
Applies moving aggregations to the specified columns of the DataFrame using defined window sizes, and grouping and ordering criteria.  | 
Applies cummulative aggregations to the specified columns of the DataFrame using defined window direction, and grouping and ordering criteria.  | 
|
Creates lag columns to the specified columns of the DataFrame by grouping and ordering criteria.  | 
|
Creates lead columns to the specified columns of the DataFrame by grouping and ordering criteria.  | 
|
Applies aggregations to the specified columns of the DataFrame over specified time windows, and grouping criteria.  | 
|
  | 
Aggregate a column of text data using a natural language task description.  | 
  | 
Classify text or images into specified categories using AI.  | 
  | 
Generate a response (completion) on each row using the specified language model.  | 
  | 
Count the number of tokens in text for a specified language model.  | 
  | 
Generate embedding vectors from text or images.  | 
  | 
Extract structured information from text or files using a response schema.  | 
  | 
Filter rows using AI-powered boolean classification.  | 
  | 
Extract content from a document (OCR or layout parsing) as JSON text.  | 
  | 
Extract sentiment analysis from text content.  | 
  | 
Compute similarity scores between two columns using AI-powered embeddings.  | 
Split Markdown-formatted text into structured chunks based on header levels.  | 
|
Split text into chunks using recursive character-based splitting.  | 
|
  | 
Summarize a column of text data using AI.  | 
  | 
Transcribe text from an audio file with optional timestamps and speaker labels.  | 
  | 
Returns a new DataFrame with the result of applying func to each of the rows of the specified DataFrame.  | 
  | 
Returns a new DataFrame with the result of applying func to each batch of data in the dataframe.  | 
Attributes
Returns a   | 
|
Returns all column names as a list.  | 
|
Returns a   | 
|
Returns a   | 
|
The definition of the columns in this DataFrame (the "relational schema" for the DataFrame).  | 
|
Returns a new   | 
|
Whether the dataframe is cached.  | 
|
Returns a   |