DataFrame¶
Constructor
|
Snowpark pandas representation of |
Attributes
Get the index for this DataFrame. |
|
Get the columns for this Snowpark pandas |
|
Return the dtypes in the |
|
|
Print a concise summary of the |
|
Return a subset of the |
Return a NumPy representation of the dataset. |
|
Return a list representing the axes of the DataFrame. |
|
Return the number of dimensions of the underlying data, by definition 2. |
|
Return an int representing the number of elements in this object. |
|
Return a tuple representing the dimensionality of the |
|
Indicator whether the DataFrame is empty. |
Snowflake Specific
|
Convert Snowpark pandas DataFrame to pandas.DataFrame |
|
Save the Snowpark pandas DataFrame as a Snowflake table. |
|
Convert the Snowpark pandas DataFrame to a Snowpark DataFrame. |
|
Persists the current Snowpark pandas DataFrame to a temporary table to improve the latency of subsequent operations. |
Conversion
|
Cast a pandas object to a specified dtype |
|
Convert columns to best possible dtypes using dtypes supporting |
|
Make a copy of this object's indices and data. |
Indexing, iteration
|
Return the first n rows. |
Access a group of rows and columns by label(s) or a boolean array. |
|
Purely integer-location based indexing for selection by position. |
|
|
Insert column into DataFrame at specified location. |
Iterate over info axis. |
|
Get columns of the |
|
Iterate over |
|
|
Iterate over DataFrame rows as namedtuples. |
|
Return the last n rows. |
|
Whether each element in the DataFrame is contained in values. |
|
Replace values where the condition is False. |
|
Replace values where the condition is True. |
Binary operator functions
|
Get addition of |
|
Get subtraction of |
|
Get multiplication of |
|
Get floating division of |
|
Get floating division of |
|
Get integer division of |
|
Get modulo of |
|
Get exponential power of |
|
Get addition of |
|
Get subtraction of |
|
Get multiplication of |
|
Get floating division of |
|
Get floating division of |
|
Get integer division of |
|
Get modulo of |
|
Get exponential power of |
|
Get less than comparison of |
|
Get greater than comparison of |
|
Get less than or equal comparison of |
|
Get greater than or equal comparison of |
|
Get not equal comparison of |
|
Perform equality comparison of |
Function application, GroupBy & window
|
Apply a function along an axis of the DataFrame. |
|
Apply a function to a Dataframe elementwise. |
|
Aggregate using one or more operations over the specified axis. |
|
Aggregate using one or more operations over the specified axis. |
|
Call |
|
Group DataFrame using a mapper or by a Series of columns. |
|
Provide rolling window calculations. |
Computations / descriptive stats
Return a DataFrame with absolute numeric value of each element. |
|
|
Return whether all elements are True, potentially over an axis. |
|
Return whether any element are True, potentially over an axis. |
|
Count non-NA cells for each column or row. |
|
Return cumulative maximum over a BasePandasDataset axis. |
|
Return cumulative minimum over a BasePandasDataset axis. |
|
Return cumulative sum over a BasePandasDataset axis. |
|
Generate descriptive statistics for columns in the dataset. |
|
First discrete difference of element. |
|
Return the maximum of the values over the requested axis. |
|
Return the mean of the values over the requested axis. |
|
Return the median of the values over the requested axis. |
|
Return the minimum of the values over the requested axis. |
|
Return values at the given quantile over requested axis. |
|
Compute numerical data ranks (1 through n) along axis. |
|
Round a DataFrame to a variable number of decimal places. |
|
Return unbiased skew, normalized over n-1 |
|
Return the sum of the values over the requested axis. |
|
Return sample standard deviation over requested axis. |
|
Return unbiased variance over requested axis. |
|
Count number of distinct elements in specified axis. |
|
Return a Series containing the frequency of each distinct row in the Dataframe. |
Reindexing / selection / label manipulation
|
Prefix labels with string prefix. |
|
Suffix labels with string suffix. |
|
Drop specified labels from rows or columns. |
|
Return |
|
Return boolean Series denoting duplicate rows. |
|
Select initial periods of time series data based on a date offset. |
|
Get item from object for given key (ex: DataFrame column). |
|
Return the first n rows. |
|
Return index of first occurrence of maximum over requested axis. |
|
Return index of first occurrence of minimum over requested axis. |
|
Select final periods of time series data based on a date offset. |
|
Rename columns or index labels. |
|
Set the name of the axis for the index or columns. |
|
Reset the index, or a level of it. |
|
Return a random sample of items from an axis of object. |
|
Assign desired index to given axis. |
|
Set the DataFrame index using existing columns. |
|
Return the last n rows. |
|
Return the elements in the given positional indices along an axis. |
Missing data handling
|
Remove missing values. |
|
Synonym for |
|
Fill NA/NaN values using the specified method. |
Detect missing values. |
|
DataFrame.isnull is an alias for DataFrame.isna. |
|
Detect non-missing values for an array-like object. |
|
Detect non-missing values for an array-like object. |
|
|
Synonym for |
|
Replace values given in to_replace with value. |
Reshaping, sorting, transposing
|
Create a spreadsheet-style pivot table as a |
|
Sort by the values along either axis. |
|
Sort object by labels (along an axis). |
|
Unpivot a |
|
Squeeze 1 dimensional axis objects into scalars. |
Transpose index and columns. |
|
|
Transpose index and columns. |
Combining / comparing / joining / merging
|
Join columns of another DataFrame. |
|
Merge DataFrame or named Series objects with a database-style join. |
|
Modify in place using non-NA values from another |
Time Series-related
|
Shift data by desired number of periods along axis and replace columns with fill_value (default: None). |
Return index for first non-NA value or None, if no non-NA value is found. |
|
Return index for last non-NA value or None, if no non-NA value is found. |
|
|
Resample time-series data. |