ImageNet數據集是Vision領域最重要的數據集之一,十分經典也十分常用,但是該數據集體量較大,而且由於在外網下載速度較慢,其中最為重要的一點是該數據集只提供給高校科研工作者,這也就意味着你必須有一個edu的郵箱才可以下載該數據集。
需要注意的是該數據不可用於商用。
數據集的官網地址:
https://image-net.org/index.php
數據集的下載頁面:
https://image-net.org/download-images.php
這里需要注意的是下載前要求必須注冊,而且必須是教育網郵箱注冊成功后才可以有權限下載。
注冊成功后獲得的郵件通知,需要驗證后才可以進行下載數據。
注冊成功后打開下載頁面:
在上面可以找到對應版本的數據集。
在歷年的數據集中我們一般使用經典的ILSVRC2012,因此這里也是對該數據集進行下載。
==============================================
Development Kit
The development kit includes
- Meta data for the competition categories.
- Matlab routines for evaluating submissions.
Please be sure to consult the readme file included in the development kit.
Development kit (Task 1 & 2). 2.5MB.
Development kit (Task 3). 22MB.
Images
Training images (Task 1 & 2). 138GB. MD5: 1d675b47d978889d74fa0da5fadfb00e
Training images (Task 3). 728MB. MD5: ccaf1013018ac1037801578038d370da
Validation images (all tasks). 6.3GB. MD5: 29b22e2961454d5413ddabcf34fc5622
Test images (all tasks). 13GB. MD5: e1b8681fff3d63731c599df9b4b6fc02
If you downloaded ILSVRC 2012 test images on or before 10/10/2019, please apply this patch to replace a subset of images (a total of 2 images are replaced). Note that training and validation images are not affected.
Terms of use: by downloading the image data from the above URLs, you agree to the following terms:
- You will use the data only for non-commercial research and educational purposes.
- You will NOT distribute the above URL(s).p
- Stanford University and Princeton University make no representations or warranties regarding the data, including but not limited to warranties of non-infringement or fitness for a particular purpose.
- You accept full responsibility for your use of the data and shall defend and indemnify Stanford University and Princeton University, including their employees, officers and agents, against any and all claims arising from your use of the data, including but not limited to your use of any copies of copyrighted images that you may create from the data.
Bounding Boxes
Training bounding box annotations (Task 1 & 2 only) . 20MB. MD5: 9271167e2176350e65cfe4e546f14b17
Training bounding box annotations (Task 3 only) . 1MB. MD5: 61ebd3cc0e4793899a841b6b27f3d6d8
Validation bounding box annotations (all tasks) . 2.2MB. MD5: f4cd18b5ea29fe6bbea62ec9c20d80f0
Test bounding box annotations (Task 3 only). 33MB. MD5: 2dfdb2677fd9661585d17d5a5d027624
=====================================================
2012年版本的所有數據集文件全部下載:
使用教育網下載速度還是可以的,基本一天時間就差不多搞定。
==================================================