wget從kaggle下載數據


step 1:導出cookie到cookie.txt。登錄kaggle.com(我用的ie11),點擊收藏夾,點擊添加到收藏夾右邊的三角形,選擇導入導出,選擇導出到文件,選擇cokies,選擇導出。
step 2: wget -cb https://www.kaggle.com/account/login?ReturnUrl=%2Fc%2Fnoaa-fisheries-steller-sea-lion-population-count%2Fdownload%2FKaggleNOAASeaLions.7z --post-data 'username=1030997649@qq.com&password=6393374'
但是這種方法下載的很慢,最后下載下來后,解壓時還出錯了。

於是,我找到了這種方法去解壓文件:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/30930

答案1:If it helps anyone: the default ubuntu Archive Manager failed to unpack the file for me. Was only showing the list of folders and files in them. I could unpack the archive by installing p7zip-full (sudo apt-get install p7zip-full) and doing 7z x Kaggle-NOAA-SeaLions -pPassword.

我用這種方法,還是沒有解決問題。有CRC校驗錯誤。

於是,我決定重新下載數據,找到了一種用aria2c的快速下載方法:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/32702

sudo apt install aria2
aria2c -c -x 16 -s 16 --load-cookies cookies.txt -p https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/download/KaggleNOAASeaLions.7z

參考;http://blog.csdn.net/laozhaokun/article/details/49587463


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM