有個BT的要求,在windows上使用MatConvNet,並且需要支持GPU。
費了些力氣,記錄一下過程(暫不支持vl_imreadjpeg函數)
在這里下載MatConvNet,機器配置vs2010,Matlab2014a,CUDA6.5。
- 進入Matlab,切換到{matconvnet_root}:
- mex -c -largeArrayDims -lmwblas "matlab/src/bits/im2col.cpp"
- mex -c -largeArrayDims -lmwblas "matlab/src/bits/pooling.cpp"
- mex -c -largeArrayDims -lmwblas "matlab/src/bits/normalize.cpp"
- mex -c -largeArrayDims -lmwblas "matlab/src/bits/subsample.cpp"
- 打開VS command prompt,切換到{matconvnet_root}:
- nvcc -c -gencode=arch=compute_20,code=sm_21 -gencode=arch=compute_30,code=sm_30 --compiler-options=-fPIC "matlab/src/bits/im2col_gpu.cu"
-
nvcc -c -gencode=arch=compute_20,code=sm_21 -gencode=arch=compute_30,code=sm_30 --compiler-options=-fPIC "matlab/src/bits/pooling_gpu.cu"
-
nvcc -c -gencode=arch=compute_20,code=sm_21 -gencode=arch=compute_30,code=sm_30 --compiler-options=-fPIC "matlab/src/bits/normalize_gpu.cu"
-
nvcc -c -gencode=arch=compute_20,code=sm_21 -gencode=arch=compute_30,code=sm_30 --compiler-options=-fPIC "matlab/src/bits/subsample_gpu.cu"
- 再次切換到Matlab:
- setenv('MW_NVCC_PATH','C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v6.5\bin')
-
mex "matlab/src/vl_nnconv.cu" "normalize.obj" "normalize_gpu.obj" "pooling.obj" "pooling_gpu.obj" "subsample_gpu.obj" "subsample.obj" "im2col_gpu.obj" -DENABLE_GPU -f mex_CUDA_win64.xml -largeArrayDims -lmwblas -L"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v6.5\lib\x64" -lcublas -lcudart /NODEFAULTLIB:LIBCMT.lib
- mex "matlab/src/vl_nnnormalize.cu" "normalize.obj" "normalize_gpu.obj" "pooling.obj" "pooling_gpu.obj" "subsample_gpu.obj" "subsample.obj" "im2col_gpu.obj" -DENABLE_GPU -f mex_CUDA_win64.xml -largeArrayDims -lmwblas -L"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v6.5\lib\x64" -lcublas -lcudart /NODEFAULTLIB:LIBCMT.lib
-
mex "matlab/src/vl_nnpool.cu" "normalize.obj" "normalize_gpu.obj" "pooling.obj" "pooling_gpu.obj" "subsample_gpu.obj" "subsample.obj" "im2col_gpu.obj" -DENABLE_GPU -f mex_CUDA_win64.xml -largeArrayDims -lmwblas -L"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v6.5\lib\x64" -lcublas -lcudart /NODEFAULTLIB:LIBCMT.lib
編譯完成,運行'matlab/xtest/vl_test_nnlayers(1)'通過。大概就是這個樣子。
聽小J說,有個比較奇怪的地方:在做卷積的時候,在GTX980、GTX970顯卡上會報錯。仔細驗證過,不是CUDA SDK的問題,也不是顯卡驅動的問題,使用GTX660這些顯卡無異常。初步懷疑可能由於Maxwell架構指令集與Kepler架構指令集不兼容導致,不過這些就不是我要考慮的了。
P.S. 對源文件做過小改動,主要是替換一些linux上的函數。
