探索一下Pandas的累加函數cumsum,我們可以先建立一個空的dataframe,用於存放接下來的值。
import pandas as pd columns = ['id_','name','money'] data_frame = pd.DataFrame(columns = columns) #創建一個3列的空dataframe
1. 賦值:給每一列增加數據
#建立數據 id_ = [1, 3, 2, 3, 2] name = ['A','B','C','D','E'] money = [100, 400, 280, 170, 500] data_frame['id_'] = id_ data_frame['name'] = name data_frame['money'] = money #給dataframe賦值
print(' 1. did not cumsum is: \n'+str(data_frame))
2. 直接對 'money' 列進行cumsum:
data_frame['cumsum_money'] = data_frame['money'].cumsum() print(' 2. the cumsum money is: \n'+str(data_frame))
3. 按照 'id_' 列的分組 group by,再進行分別cumsum,如圖所示:
# 先按照id分組,再對money列進行cumsum data_2 = data_frame.groupby(['id_']) print(data_2) data_frame['cumsum_money_groupby'] = data_2['money'].cumsum() print(' 3. the cumsum money after groupby is: \n'+str(data_frame))
##