DataFrame¶
Classes
| 
 | Represents a lazily-evaluated relational dataset that contains a collection of  | 
| 
 | Provides functions for handling missing values in a  | 
| 
 | Provides computed statistical functions for DataFrames. | 
| 
 | Provides data analytics functions for DataFrames. | 
| 
 | Provides AI-powered functions for a  | 
Methods
| 
 | Aggregate the data in the DataFrame. | 
| 
 | For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles. | 
| 
 | For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles. | 
| 
 | Caches the content of this DataFrame to create a new cached Table DataFrame. | 
| 
 | Returns a reference to a column in the DataFrame. | 
| 
 | Returns a new DataFrame with only the columns whose names match the specified pattern using case-insensitive ILIKE matching (similar to SELECT * ILIKE 'pattern' in SQL). | 
| Executes the query representing this DataFrame and returns the result as a list of  | |
| 
 | Executes the query representing this DataFrame asynchronously and returns: class:AsyncJob. | 
| 
 | Executes a COPY INTO <table> command to load data from files in a stage location into a specified table. | 
| 
 | Calculates the correlation coefficient for non-null pairs in two numeric columns. | 
| Executes the query representing this DataFrame and returns the number of rows in the result (similar to the COUNT function in SQL). | |
| 
 | Calculates the sample covariance for non-null pairs in two numeric columns. | 
| Creates a dynamic table that captures the computation expressed by this DataFrame. | |
| 
 | Creates or replace a temporary view that returns the same results as this DataFrame. | 
| 
 | Creates a view that captures the computation expressed by this DataFrame. | 
| 
 | Creates a temporary view that returns the same results as this DataFrame. | 
| Creates or replace a temporary view that returns the same results as this DataFrame. | |
| 
 | Creates a view that captures the computation expressed by this DataFrame. | 
| 
 | Creates a temporary view that returns the same results as this DataFrame. | 
| 
 | Performs a cross join, which returns the Cartesian product of the current  | 
| 
 | Performs a cross join, which returns the Cartesian product of the current  | 
| 
 | Computes a pair-wise frequency table (a  | 
| 
 | Performs a SQL GROUP BY CUBE. | 
| 
 | Computes basic statistics for numeric columns, which includes  | 
| Returns a new DataFrame that contains only the rows with distinct values from the current DataFrame. | |
| 
 | Returns a new DataFrame that excludes the columns with the specified names from the output. | 
| 
 | Creates a new DataFrame by removing duplicated rows on given subset of columns. | 
| 
 | Creates a new DataFrame by removing duplicated rows on given subset of columns. | 
| 
 | Returns a new DataFrame that excludes all rows containing fewer than a specified number of non-null and non-NaN values in the specified columns. | 
| 
 | Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the  | 
| Prints the list of queries that will be executed to evaluate this DataFrame. | |
| 
 | Returns a new DataFrame that replaces all null and NaN values in the specified columns with the values provided. | 
| 
 | Filters rows based on the specified conditional expression (similar to WHERE in SQL). | 
| Executes the query representing this DataFrame and returns the first  | |
| 
 | Flattens (explodes) compound values into multiple rows. | 
| 
 | Groups rows by the columns specified by expressions (similar to GROUP BY in SQL). | 
| 
 | Groups rows by the columns specified by expressions (similar to GROUP BY in SQL). | 
| 
 | Performs a SQL GROUP BY GROUPING SETS. | 
| 
 | Returns a new DataFrame that contains the intersection of rows from the current DataFrame and another DataFrame ( | 
| 
 | Performs a join of the specified type ( | 
| 
 | Lateral joins the current DataFrame with the output of the specified table function. | 
| 
 | Returns a new DataFrame that contains at most  | 
| 
 | Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the  | 
| 
 | Performs a natural join of the specified type ( | 
| 
 | Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL). | 
| 
 | Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL). | 
| 
 | Rotates this DataFrame by turning the unique values from one column in the input expression into multiple columns and aggregating results where required on any remaining column values. | 
| 
 | Prints the schema of a dataframe in tree format. | 
| 
 | Prints the schema of a dataframe in tree format. | 
| 
 | Randomly splits the current DataFrame into separate DataFrames, using the specified weights. | 
| 
 | Randomly splits the current DataFrame into separate DataFrames, using the specified weights. | 
| 
 | Returns a DataFrame with the specified column  | 
| 
 | Returns a new DataFrame that replaces values in the specified columns. | 
| 
 | Performs a SQL GROUP BY ROLLUP. | 
| 
 | Samples rows based on either the number of rows to be returned or a percentage of rows to be returned. | 
| 
 | Returns a DataFrame containing a stratified sample without replacement, based on a  | 
| 
 | Returns a DataFrame containing a stratified sample without replacement, based on a  | 
| 
 | Returns a new DataFrame with the specified Column expressions as output (similar to SELECT in SQL). | 
| 
 | Projects a set of SQL expressions and returns a new  | 
| 
 | Projects a set of SQL expressions and returns a new  | 
| 
 | Evaluates this DataFrame and prints out the first  | 
| 
 | Sorts a DataFrame by the specified expressions (similar to ORDER BY in SQL). | 
| 
 | Returns a new DataFrame that contains all the rows from the current DataFrame except for the rows that also appear in the  | 
| 
 | Executes the query representing this DataFrame and returns the first  | 
| 
 | Creates a new DataFrame containing columns with the specified names. | 
| 
 | Executes the query representing this DataFrame and returns an iterator of  | 
| 
 | Executes the query representing this DataFrame and returns the result as a pandas DataFrame. | 
| 
 | Creates a new DataFrame containing columns with the specified names. | 
| Executes the query representing this DataFrame and returns an iterator of  | |
| Executes the query representing this DataFrame and returns the result as a pandas DataFrame. | |
| Executes the query representing this DataFrame and returns an iterator of pandas dataframes (containing a subset of rows) that you can use to retrieve the results. | |
| 
 | Convert the Snowpark DataFrame to Snowpark pandas DataFrame. | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Returns a new DataFrame that contains all the rows in the current DataFrame and another DataFrame ( | 
| 
 | Rotates a table by transforming columns into rows. | 
| 
 | Filters rows based on the specified conditional expression (similar to WHERE in SQL). | 
| 
 | Returns a DataFrame with an additional column with the specified name  | 
| 
 | Returns a DataFrame with the specified column  | 
| 
 | Returns a DataFrame with an additional column with the specified name  | 
| 
 | Returns a DataFrame with the specified column  | 
| 
 | Returns a DataFrame with additional columns with the specified names  | 
| 
 | Returns a new DataFrame that excludes all rows containing fewer than a specified number of non-null and non-NaN values in the specified columns. | 
| 
 | Returns a new DataFrame that replaces all null and NaN values in the specified columns with the values provided. | 
| 
 | Returns a new DataFrame that replaces values in the specified columns. | 
| 
 | For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles. | 
| 
 | For a specified numeric column and a list of desired quantiles, returns an approximate value for the column at each of the desired quantiles. | 
| 
 | Calculates the correlation coefficient for non-null pairs in two numeric columns. | 
| 
 | Calculates the sample covariance for non-null pairs in two numeric columns. | 
| 
 | Computes a pair-wise frequency table (a  | 
| 
 | Returns a DataFrame containing a stratified sample without replacement, based on a  | 
| 
 | Returns a DataFrame containing a stratified sample without replacement, based on a  | 
| 
 | Applies moving aggregations to the specified columns of the DataFrame using defined window sizes, and grouping and ordering criteria. | 
| Applies cummulative aggregations to the specified columns of the DataFrame using defined window direction, and grouping and ordering criteria. | |
| Creates lag columns to the specified columns of the DataFrame by grouping and ordering criteria. | |
| Creates lead columns to the specified columns of the DataFrame by grouping and ordering criteria. | |
| Applies aggregations to the specified columns of the DataFrame over specified time windows, and grouping criteria. | |
| 
 | Aggregate a column of text data using a natural language task description. | 
| 
 | Classify text or images into specified categories using AI. | 
| 
 | Generate a response (completion) on each row using the specified language model. | 
| 
 | Count the number of tokens in text for a specified language model. | 
| 
 | Generate embedding vectors from text or images. | 
| 
 | Extract structured information from text or files using a response schema. | 
| 
 | Filter rows using AI-powered boolean classification. | 
| 
 | Extract content from a document (OCR or layout parsing) as JSON text. | 
| 
 | Extract sentiment analysis from text content. | 
| 
 | Compute similarity scores between two columns using AI-powered embeddings. | 
| Split Markdown-formatted text into structured chunks based on header levels. | |
| Split text into chunks using recursive character-based splitting. | |
| 
 | Summarize a column of text data using AI. | 
| 
 | Transcribe text from an audio file with optional timestamps and speaker labels. | 
| 
 | Returns a new DataFrame with the result of applying func to each of the rows of the specified DataFrame. | 
| 
 | Returns a new DataFrame with the result of applying func to each batch of data in the dataframe. | 
Attributes
| Returns a  | |
| Returns all column names as a list. | |
| Returns a  | |
| Returns a  | |
| The definition of the columns in this DataFrame (the "relational schema" for the DataFrame). | |
| Returns a new  | |
| Whether the dataframe is cached. | |
| Returns a  |