modin.pandas.Resampler.sum¶
- Resampler.sum(numeric_only: bool = False, min_count: int = 0, *args: Any, **kwargs: Any) Union[DataFrame, Series] [source]¶
Compute sum of resample bins.
- Parameters:
numeric_only (bool, default False) – Include only float, int, boolean columns.
min_count (int, default 0) – The required number of valid values to perform the operation. If fewer than
min_count
and non-NA values are present, the result will be NA.engine (str, default None) – This parameter is ignored in Snowpark pandas. The execution engine will always be Snowflake.
engine_kwargs (dict, default None) – This parameter is ignored in Snowpark pandas. The execution engine will always be Snowflake.
- Returns:
Computed sum of values within each resample bin.
- Return type:
Series
orDataFrame
Examples
For Series:
>>> lst1 = pd.date_range('2020-01-01', periods=4, freq='1D') >>> ser1 = pd.Series([1, 2, 3, 4], index=lst1) >>> ser1 2020-01-01 1 2020-01-02 2 2020-01-03 3 2020-01-04 4 Freq: None, dtype: int64
>>> ser1.resample('2D').sum() 2020-01-01 3 2020-01-03 7 Freq: None, dtype: int64
>>> lst2 = pd.date_range('2020-01-01', periods=4, freq='S') >>> ser2 = pd.Series([1, 2, np.nan, 4], index=lst2) >>> ser2 2020-01-01 00:00:00 1.0 2020-01-01 00:00:01 2.0 2020-01-01 00:00:02 NaN 2020-01-01 00:00:03 4.0 Freq: None, dtype: float64
>>> ser2.resample('2S').sum() 2020-01-01 00:00:00 3.0 2020-01-01 00:00:02 4.0 Freq: None, dtype: float64
For DataFrame:
>>> data = [[1, 8, 2], [1, 2, 5], [2, 5, 8], [2, 6, 9]] >>> df = pd.DataFrame(data, ... columns=["a", "b", "c"], ... index=pd.date_range('2020-01-01', periods=4, freq='1D')) >>> df a b c 2020-01-01 1 8 2 2020-01-02 1 2 5 2020-01-03 2 5 8 2020-01-04 2 6 9
>>> df.resample('2D').sum() a b c 2020-01-01 2 10 7 2020-01-03 4 11 17