對同一維度的數據分析數據分布范圍及分布趨勢,要通過制作直方圖和正態分布曲線圖體現。
例如:已知所有員工的日收入,分析員工收入分布情況(51.7,50.6,57.9,56.9,56.7,56.7,55.3,56.1,53.7,54.5,56.9,51.9,52.1,55.1,54.9,54.7,55.3,55.3,54.5,54.9,54.5,55.3,54.9,54.3,53.7,53.5,53.7,53.1,54.5,53.1,53.9,53.5,53.3,53.9,53.5,53.5,52.5,53.3,53.5,53.3,53.7,53.1,54.5,53.9,56.7,54.5,54.3,55.1,54.1,54.5,53.9,53.1,53.3,55.3,55.7,56.1,54.7,53.1,53.3,52.7,53.1,52.9,53.1,54.3,53.1,52.7,53.1,53.3,53.1,53.3,53.1,53.3,55.1,54.7,54.9,54.3,53.9,53.7,53.9,53.5,54.5,54.3,55.5,55.7,55.5,54.9,55.3,55.5,53.7,54.1,53.9,55.7,55.9,53.7,53.5,53.1,52.3,52.7,52.9,53.3,53.9,52.7,53.5,53.1,52.7,51.9,52.5,53.9,54.5,55.7,55.3,54.9,53.1,52.9,54.1,53.3,54.7,53.9,54.3,54.1,53.7,53.3,52.7,52.9,52.5,53.9,53.5,54.1,54.1,54.7,54.9,54.9,54.1,53.3,52.9,53.7,53.9,54.3,54.1,54.5,54.7,54.9,52.1,52.9,53.5,52.7,53.1,53.1,53.5,52.9,52.9,53.1,53.3,52.7,53.5,53.9,54.9,55.1,54.3,55.1,54.3,54.3,53.9,54.5,54.5,54.3,55.3,54.5,54.9,53.5,52.1,55.3,55.7,55.7,55.5,54.5,57.7,54.7,53.7,53.1,53.7,55.9,56.1,53.9,53.7,53.3,53.9,53.9,54.5,54.7,56.1,55.7,53.1,53.7,53.5,53.9,53.9,53.5,53.3,53.1,52.5,55.9,55.7,54.1,54.3,54.1,54.1,54.5,54.5,55.1,53.1,53.3,54.1,54.3,53.9,54.1,54.7,54.7,53.7,53.1,53.3,52.7,53.5,52.9,53.7,56.5,56.1,55.7,55.5,56.9,57.7,56.5,55.7,54.1,54.7,55.7,55.5,53.1,52.7,53.1,53.3,53.5,54.3,54.1,54.5,54.7,55.7,55.5,54.1,54.3,54.7,53.1,53.3,53.1,52.7,53.1,53.7,53.1,54.7,54.5,55.1,54.7,54.5,56.1,55.7,53.3,52.5,53.7,54.1,53.3,52.1,52.3,53.1,53.3,53.5,53.3,53.1,52.7,53.1,55.7,55.1,54.3,53.7,53.1,52.9,53.1,52.7,52.5,53.1,53.5,53.1,53.3,54.1,55.1,54.9,56.1,55.7,56.5,54.7,53.7)
一、制作直方圖
- 將數據輸入到EXCEL同一列中(這里放入A列);
- 計算“最大值”、“最小值”、“極差”、“分組數”、“分組組距”;
最大值:max(A:A);(=57.9)
最小值:min(A:A);(=50.6)
極差:最大值-最小值;(=7.3)
分組數:roundup(sqrt(count(A;A)),0);(=18)/*count(A:A)計算A列包含數字的單元格個數,sqrt求平方根,roundup按指定位數對數據進行向上四舍五入*/;
分組組距:極差/分組數;(0.4)
- 數據分組:選一個比最小值小的一個恰當的值作為第一個組的起始坐標,然后依次加上“分組組距”,直到最后一個數據值比“最大值”大為止。
這里第一個組的起始坐標選為50.5,依次增加0.4,最后一組坐標為58.2,共計20組
- 統計頻率:統計每個分組中所包含的數據的個數。
方法:采用FREQUENCY函數,以一列垂直數組返回一組數據的頻率分布,
1、=frequency(原始數據的范圍,直方圖分組的數據源);
2、先選中將要統計直方圖每個子組中數據數量的區域
3、再按“F2”健,進入到“編輯”狀態
4、再同時按住“Ctrl”和“Shift”兩個鍵,再按“回車Enter”鍵,最后三鍵同時松開.
- 制作直方圖:選擇頻率數插入柱狀圖
- 修整柱形圖:設置數據系列格式-調制無間距
二、制作正態分布圖
- 獲取正態分布概念密度:NORMDIST(作用:返回指定平均值和標准偏差的正態分布函數)
語法:
NORMDIST(x,mean,standard_dev,cumulative)
X 為需要計算其分布的數值;(以每一個分組邊界值為“X”,依次往下拉)
Mean 分布的算術平均值;(Mean=AVERAGE(A:A)(數據算術平均))【這里為54.09】
Standard_dev 分布的標准偏差;(Standard_dev=STDEV.S(A:A)(數據的標准方差)【1.15】
Cumulative=false(概率密度函數)
Cumulative 為一邏輯值,指明函數的形式。如果 cumulative 為 TRUE,函數 NORMDIST 返回累計分布函數;如果為 FALSE,返回概率密度函數。
- 在直方圖中增加正態分布曲線圖:設置曲線圖,選擇次坐標軸
