linux 下處理大文件

本文轉載自查看原文 2014-03-15 23:39 3263 linux

1、head tail more

2、先把大文件進行分割 split

split 參數：
-a, --suffix-length=N   指定輸出文件名的后綴，默認為2個
-b, --bytes=SIZE        指定輸出文件的字節數
-C, --line-bytes=SIZE  每一輸出檔中，單行的最大 byte 數
-d, --numeric-suffixes  使用數字代替字母做后綴
-l, --lines=NUMBER    NUMBER 值為每一輸出檔的列數大小
   
例：
[root@10.10.90.97 sh]# split -b 1024 -a 3 push.sh
[root@10.10.90.97 sh]# ls 
push.sh xaaa  xaab  xaac  xaad  xaae  xaaf
使用-a參數指定文件后綴名的個數為3

[root@10.10.90.97 sh]# split -b 1024 push.sh push_  
[root@10.10.90.97 sh]# ls
push_aa  push_ab  push_ac  push_ad  push_ae  push_af    push.sh
-b參數指定輸出文件的大小為1024字節，push_指定輸出文件的前綴代替默認的x 

[root@10.10.90.97 sh]# split -b 1024 -d push.sh 
[root@10.10.90.97 sh]# ls
push.sh x00  x01  x02  x03  x04  x05
-d參數設置輸出文件的后綴為數字，默認的為字符  

[root@10.10.90.97 sh]# split -l 5 push.sh 
[root@10.10.90.97 sh]# ls
push.sh  xaa  xac  xae  xag  xai  xak  xam  xao  xaq  xas  xau  xaw  xay  xba
xab  xad  xaf  xah  xaj  xal  xan  xap  xar  xat  xav  xax  xaz
-l指定輸出穩定的行數為5 

附：合並文件命令（會自動按后綴排序連接）：
    cat small_files* > large_file

3、sed 按行進行處理
    總行數：wc -l 文件名

    sed -n '1,1024（文件總行數）p' filename；#依次輸出文件的每一行

4、awk 一次讀取文件中的一行
    awk '{print;}' employee.txt  #依次輸出文件的每一行

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Nginx 的靜態文件分布在不同的目錄下處理辦法處理json大文件 linux磁盤滿了，找不到大文件處理辦法 Linux怎么查看大文件 Linux如何查找大文件大文件分割之Linux python處理大文件——文件流處理 linux大文件的日志查詢 linux查找大文件命令 linux通過命令查找大文件