本系列文章由 @yhl_leo 出品,轉載請注明出處。
文章鏈接: http://blog.csdn.net/yhl_leo/article/details/49737357
本文僅對 Opencv圖像讀取與保存進行闡述,重在探討圖像讀取與保存過程中應注意的細節問題。
1 圖像讀取
首先看一下,imread函數的聲明:
// C++: Mat based
Mat imread(const string& filename, int flags=1 );
// C: IplImage based
IplImage* cvLoadImage(const char* filename, int iscolor=CV_LOAD_IMAGE_COLOR );
// C: CvMat based
CvMat* cvLoadImageM(const char* filename, int iscolor=CV_LOAD_IMAGE_COLOR );
此處,就不列出python
的函數聲明。隨着2.x和3.x版本不斷更新, Opencv的C++
版本數據結構和C
版本有較大差異,前者減少了指針的大量使用,用法更加便捷,因此建議多使用前者。以C++
版本函數進行分析,形參列表包括:
filename
: 待加載圖像(包括:文件路徑和文件名,圖像在工程默認路徑下的可省略文件路徑);flags
: 標志符,指定圖像加載顏色類型,默認值為1:- IMREAD_UNCHANGED / CV_LOAD_IMAGE_UNCHANGED :不加改變的加載原圖。
- IMREAD_GRAYSCALE / CV_LOAD_IMAGE_GRAYSCALE :圖像轉為灰度圖(GRAY,1通道)。
- IMREAD_COLOR / CV_LOAD_IMAGE_COLOR :圖像轉為彩色圖(BGR,3通道)。
- IMREAD_ANYDEPTH / CV_LOAD_IMAGE_ANYDEPTH :任何位深度,如果載入的圖像不是16-bit位圖或者32-bit位圖,則轉化為8-bit位圖。
- IMREAD_ANYCOLOR / CV_LOAD_IMAGE_ANYCOLOR :任何彩色,單獨使用的時候等價於 IMREAD_UNCHANGED / CV_LOAD_IMAGE_UNCHANGED 。
- > 0 :返回3通道的彩色圖,但是如果是4通道(RGBA),其中Alpha需要保留的話,不建議這么使用,因為一旦這么使用,就會導致Alpha通道被剝離掉,此時建議使用負值。
- = 0 :返回灰度圖像。
- < 0 :返回具有Alpha通道的圖像。
如果你喜歡使用
imread("file.jpg")
缺省參數的形式加載圖像,務必要留意你所加載后的圖像可能已經不是你原本想要的圖像了!
從 Opencv源碼枚舉類型中也可以看到上述標識符含義:
// highgui.hpp
enum
{
// 8bit, color or not
IMREAD_UNCHANGED =-1,
// 8bit, gray
IMREAD_GRAYSCALE =0,
// ?, color
IMREAD_COLOR =1,
// any depth, ?
IMREAD_ANYDEPTH =2,
// ?, any color
IMREAD_ANYCOLOR =4
};
// highui_c.h
enum
{
/* 8bit, color or not */
CV_LOAD_IMAGE_UNCHANGED =-1,
/* 8bit, gray */
CV_LOAD_IMAGE_GRAYSCALE =0,
/* ?, color */
CV_LOAD_IMAGE_COLOR =1,
/* any depth, ? */
CV_LOAD_IMAGE_ANYDEPTH =2,
/* ?, any color */
CV_LOAD_IMAGE_ANYCOLOR =4
};
Opencv已經支持目前很多圖像格式,但是並非全部。主要包括:
- Windows bitmaps
*.bmp
,*.dib
(always supported) - JPEG files
*.jpeg
,*.jpg
,*.jpe
(see the Notes section) - JPEG 2000 files
*.jp2
,*.jpf
,*.jpx
(see the Notes section) - Portable Network Graphics
*.png
(see the Notes section) - WebP
*.webp
(see the Notes section) - Portable image format
*.pbm
,*.pgm
,*.ppm
(always supported) - Sun rasters
*.sr
,*.ras
(always supported) TIFF files
*.tiff
,*.tif
(see the Notes section)Notes
- 1 The function determines the type of an image by the content, not by the file extension.
- 2 On Microsoft Windows* OS and MacOSX*, the codecs shipped with an OpenCV image (libjpeg, libpng, libtiff, and libjasper) are used by default. So, OpenCV can always read JPEGs, PNGs, and TIFFs. On MacOSX, there is also an option to use native MacOSX image readers. But beware that currently these native image loaders give images with different pixel values because of the color management embedded into MacOSX.
- 3 On Linux*, BSD flavors and other Unix-like open-source operating systems, OpenCV looks for codecs supplied with an OS image. Install the relevant packages (do not forget the development files, for example, “libjpeg-dev”, in Debian* and Ubuntu*) to get the codec support or turn on the OPENCV_BUILD_3RDPARTY_LIBS flag in CMake.
- 4 In the case of color images, the decoded images will have the channels stored in B G R order.
對於常見的支持4通道的圖像格式來說, Opencv讀取結果是有差異的:
// 1.tif, 1.jp2 and 1.png are color images with 4 channels: R, G, B, A
cv::Mat imageTif = cv::imread("E:\\1.tif"); // the default flags is 1
cv::Mat imageJp2 = cv::imread("E:\\1.jp2"); // the default flags is 1
cv::Mat imagePng = cv::imread("E:\\1.png"); // the default flags is 1
std::cout << imageTif.channels() << std::endl; // prints 3
std::cout << imageJp2.channels() << std::endl; // prints 3
std::cout << imagePng.channels() << std::endl; // prints 3
cv::Mat imageTif2 = cv::imread("E:\\1.tif", -1); // flags = -1
cv::Mat imageJp22 = cv::imread("E:\\1.jp2", -1);
cv::Mat imagePng2 = cv::imread("E:\\1.png", -1);
std::cout << imageTif2.channels() << std::endl; // prints 3
std::cout << imageJp22.channels() << std::endl; // prints 3
std::cout << imagePng2.channels() << std::endl; // prints 4
由此可見,目前 Opencv能夠直接讀取4通道圖像並保留Alpha通道的貌似只有PNG格式,對於非PNG格式數據,需要保留Alpha通道的應用,如果堅持使用 Opencv庫,建議轉格式吧~
2 圖像存儲
首先來看,imwrite函數的聲明:
// c++: Mat based
bool imwrite(const string& filename, InputArray img, const vector<int>& params=vector<int>() );
// C: CvMat and IplImage based
int cvSaveImage(const char* filename, const CvArr* image, const int* params=0 );
仍舊以C++
版本為例,其形參列表為:
filename
:待保存圖像名(包括:文件路徑和文件名,圖像在工程默認路徑下的可省略文件路徑);img
:待保存的圖像對象;params
:特定圖像存儲編碼參數設置,以類似pairs
類型的方式,(paramId_1, paramValue_1)
,(paramId_2, paramValue_2)
… ,其中paramId_1
就是標志符值,paramValue_1
標識符值對應的后續參數設置:
vector<int> compression_params;
compression_params.push_back(CV_IMWRITE_PNG_COMPRESSION); // paramId_1, png compression
compression_params.push_back(9); // paramValue_2, compression level is 9
在 Opencv中,主要對JPEG,PNG和PXM的編碼方式進行了特別聲明:
// highgui.hpp
enum
{
IMWRITE_JPEG_QUALITY =1, // quality from 0 to 100, default value is 95. (The higher is the better)
IMWRITE_PNG_COMPRESSION =16, // compression level from 0 to 9, default value is 3. (A higher value means a smaller size and longer compression time. Default value is 3.)
IMWRITE_PNG_STRATEGY =17,
IMWRITE_PNG_BILEVEL =18,
IMWRITE_PNG_STRATEGY_DEFAULT =0,
IMWRITE_PNG_STRATEGY_FILTERED =1,
IMWRITE_PNG_STRATEGY_HUFFMAN_ONLY =2,
IMWRITE_PNG_STRATEGY_RLE =3,
IMWRITE_PNG_STRATEGY_FIXED =4,
IMWRITE_PXM_BINARY =32 // binary format flag: 0 or 1, default value is 1.
};
// highui_c.h
enum
{
CV_IMWRITE_JPEG_QUALITY =1,
CV_IMWRITE_PNG_COMPRESSION =16,
CV_IMWRITE_PNG_STRATEGY =17,
CV_IMWRITE_PNG_BILEVEL =18,
CV_IMWRITE_PNG_STRATEGY_DEFAULT =0,
CV_IMWRITE_PNG_STRATEGY_FILTERED =1,
CV_IMWRITE_PNG_STRATEGY_HUFFMAN_ONLY =2,
CV_IMWRITE_PNG_STRATEGY_RLE =3,
CV_IMWRITE_PNG_STRATEGY_FIXED =4,
CV_IMWRITE_PXM_BINARY =32
};
上述的標識符含義,顯而易見,就不累述。值得強調的是,imwrite函數支持存儲的圖像類型是有限的只包括:1,3,4通道的圖像,但是對於不同的圖像格式,也是有差異的:
- 對於單通道8-bit位圖(或者16-bit位圖( CV_16U/CV_16UC1 的PNG,JPEG 2000 和TIFF))或者3通道(通道順序為:B G R )的圖像,imwrite函數是都支持的。對於格式,或者位深或者通道順序與上面不一致的,可以使用函數
Mat::convertTo()
和cvtColor()
函數進行轉換后,再保存。當然,也可以使用通用的方法利用FileStorage
I/O操作,將圖像存為XML或YAML格式。 - 對於PNG圖像,可以保存其Alpha通道,創建一個8-bit或者16-bit 4通道的位圖(通道順序為:B G R A ),如果是全透明的Alpha通道設置為0,反之不透明設置為255/65535。
對於多通道圖像,如果想對其每個通道單獨進行保存,當然也是可行的,一方面自己可以根據圖像的信息和圖層信息寫出相應的存儲函數,另一方面 Opencv也提供了專門的函數split
可以將圖像的每個通道提取出保存到vector
中:

PNG原圖
cv::Mat img = imread( "C:\\Users\\Leo\\Desktop\\Panda.png", CV_LOAD_IMAGE_UNCHANGED );
std::vector<cv::Mat> imageChannels;
cv::split( img, imageChannels );
cv::imwrite("E:\\0.jpg", imageChannels[0]);
cv::imwrite("E:\\1.jpg", imageChannels[1]);
cv::imwrite("E:\\2.jpg", imageChannels[2]);
cv::imwrite("E:\\3.jpg", imageChannels[3]);
B




通道分離保存結果
附上 Opencv文檔源碼:
#include <vector>
#include <stdio.h>
#include <opencv2/opencv.hpp>
using namespace cv;
using namespace std;
void createAlphaMat(Mat &mat)
{
CV_Assert(mat.channels() == 4);
for (int i = 0; i < mat.rows; ++i) {
for (int j = 0; j < mat.cols; ++j) {
Vec4b& bgra = mat.at<Vec4b>(i, j);
bgra[0] = UCHAR_MAX; // Blue
bgra[1] = saturate_cast<uchar>((float (mat.cols - j)) / ((float)mat.cols) * UCHAR_MAX); // Green
bgra[2] = saturate_cast<uchar>((float (mat.rows - i)) / ((float)mat.rows) * UCHAR_MAX); // Red
bgra[3] = saturate_cast<uchar>(0.5 * (bgra[1] + bgra[2])); // Alpha
}
}
}
int main(int argv, char **argc)
{
// Create mat with alpha channel
Mat mat(480, 640, CV_8UC4);
createAlphaMat(mat);
vector<int> compression_params;
compression_params.push_back(CV_IMWRITE_PNG_COMPRESSION);
compression_params.push_back(9);
try {
imwrite("alpha.png", mat, compression_params);
}
catch (runtime_error& ex) {
fprintf(stderr, "Exception converting image to PNG format: %s\n", ex.what());
return 1;
}
fprintf(stdout, "Saved PNG file with alpha data.\n");
return 0;
}
運行結果為:
