wget从kaggle下载数据


step 1:导出cookie到cookie.txt。登录kaggle.com(我用的ie11),点击收藏夹,点击添加到收藏夹右边的三角形,选择导入导出,选择导出到文件,选择cokies,选择导出。
step 2: wget -cb https://www.kaggle.com/account/login?ReturnUrl=%2Fc%2Fnoaa-fisheries-steller-sea-lion-population-count%2Fdownload%2FKaggleNOAASeaLions.7z --post-data 'username=1030997649@qq.com&password=6393374'
但是这种方法下载的很慢,最后下载下来后,解压时还出错了。

于是,我找到了这种方法去解压文件:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/30930

答案1:If it helps anyone: the default ubuntu Archive Manager failed to unpack the file for me. Was only showing the list of folders and files in them. I could unpack the archive by installing p7zip-full (sudo apt-get install p7zip-full) and doing 7z x Kaggle-NOAA-SeaLions -pPassword.

我用这种方法,还是没有解决问题。有CRC校验错误。

于是,我决定重新下载数据,找到了一种用aria2c的快速下载方法:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/32702

sudo apt install aria2
aria2c -c -x 16 -s 16 --load-cookies cookies.txt -p https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/download/KaggleNOAASeaLions.7z

参考;http://blog.csdn.net/laozhaokun/article/details/49587463


免责声明!

本站转载的文章为个人学习借鉴使用,本站对版权不负任何法律责任。如果侵犯了您的隐私权益,请联系本站邮箱yoyou2525@163.com删除。



 
粤ICP备18138465号  © 2018-2025 CODEPRJ.COM