集群測試
上傳小文件到集群,隨便選擇一個小文件上傳到hdfs的根目錄
[shaozhiqi@hadoop102 hadoop-3.1.2]$ bin/hdfs dfs -put wcinput/wc.input / [shaozhiqi@hadoop102 hadoop-3.1.2]$
驗證是否上傳成功,驗證ok,Replication的3代表三個副本數
image.png
上傳一個大文件如我們的hadoop安裝包
[shaozhiqi@hadoop102 hadoop-3.1.2]$ bin/hdfs dfs -put /opt/software/hadoop-3.1.2.tar.gz /
image.png
查看每個文件在hdfs的副本情況和存儲塊block,默認每個block大小為128M
關於塊block的說明:當hdfs上收到的文件大於128M,則該文件會被切分存儲為多個塊,多個塊以塊為單位存儲在不用的datanode上。但是值的注意的是如果某個文件大小沒有達到128M,則按文件大小存儲,不會占用整個128M的空間。
點擊:Wc.Input
image.png
點擊Hadoop-3.1.2.tar.gz
image.png
每個block最大存儲128M,多出就會再多一個block去存儲
文件在hdfs的實際路勁
[shaozhiqi@hadoop102 subdir0]$ ll total 327200 -rw-rw-r--. 1 shaozhiqi shaozhiqi 50 Jun 27 21:18 blk_1073741825 -rw-rw-r--. 1 shaozhiqi shaozhiqi 11 Jun 27 21:18 blk_1073741825_1001.meta -rw-rw-r--. 1 shaozhiqi shaozhiqi 134217728 Jun 27 21:25 blk_1073741826 -rw-rw-r--. 1 shaozhiqi shaozhiqi 1048583 Jun 27 21:25 blk_1073741826_1002.meta -rw-rw-r--. 1 shaozhiqi shaozhiqi 134217728 Jun 27 21:26 blk_1073741827 -rw-rw-r--. 1 shaozhiqi shaozhiqi 1048583 Jun 27 21:26 blk_1073741827_1003.meta -rw-rw-r--. 1 shaozhiqi shaozhiqi 63998133 Jun 27 21:26 blk_1073741828 -rw-rw-r--. 1 shaozhiqi shaozhiqi 499995 Jun 27 21:26 blk_1073741828_1004.meta [shaozhiqi@hadoop102 subdir0]$ pwd /opt/module/hadoop-3.1.2/data/tmp/dfs/data/current/BP-1884196904-192.168.1.102-1561640654781/current/finalized/subdir0/subdir0 [shaozhiqi@hadoop102 subdir0]$ cat blk_1073741825 #可以確定整個就是我們的wc.input hadoop yarn hadoop mapreduce shaozhiqi shaozhiqi [shaozhiqi@hadoop102 subdir0]$
刪除我們剛剛上傳到hdfs的文件
[shaozhiqi@hadoop102 hadoop-3.1.2]$ bin/hdfs dfs -rm -r /wc.input Deleted /wc.input [shaozhiqi@hadoop102 hadoop-3.1.2]$ bin/hdfs dfs -rm -r /hadoop-3.1.2.tar.gz Deleted /hadoop-3.1.2.tar.gz [shaozhiqi@hadoop102 hadoop-3.1.2]$
驗證是否刪除,驗證ok
image.png
