1。如何讀取UTF-8編碼的文本文件?
2。如何讀取GB2312(中文)的文本文件?
3。如何讀取其它編碼文件?
首先解決第一個問題,
1。如何讀取UTF-8編碼的文本文件?
NSString *filePath = [[[NSBundle mainBundle] bundlePath] stringByAppendingPathComponent:fileName]; [NSString stringWithContentsOfFile:filePath encoding:NSUTF8StringEncoding error:nil] //當然也可以采用如下方法 //NSData *data = [NSData dataWithContentsOfFile:filePath]; //NSString *textFile = [[NSString alloc] initWithData:data encoding:NSUTF8StringEncoding];
2。如何讀取GB2312(中文)的文本文件?
可能很多人會想到,那還不簡單,apple肯定提供了gb2312的文件編碼常量,讓我們看看有那些常量吧,看看 NSUTF8StringEncoding定義文件,應該會有GB2312的常量定義。
/* Note that in addition to the values explicitly listed below, NSStringEncoding supports encodings provided by CFString.
See CFStringEncodingExt.h for a list of these encodings.
See CFString.h for functions which convert between NSStringEncoding and CFStringEncoding.
*/
enum {
NSASCIIStringEncoding = 1,/* 0..127 only */
NSNEXTSTEPStringEncoding = 2,
NSJapaneseEUCStringEncoding = 3,
NSUTF8StringEncoding = 4,
NSISOLatin1StringEncoding = 5,
NSSymbolStringEncoding = 6,
NSNonLossyASCIIStringEncoding = 7,
NSShiftJISStringEncoding = 8, /* kCFStringEncodingDOSJapanese */
NSISOLatin2StringEncoding = 9,
NSUnicodeStringEncoding = 10,
NSWindowsCP1251StringEncoding = 11, /* Cyrillic; same as AdobeStandardCyrillic */
NSWindowsCP1252StringEncoding = 12, /* WinLatin1 */
NSWindowsCP1253StringEncoding = 13, /* Greek */
NSWindowsCP1254StringEncoding = 14, /* Turkish */
NSWindowsCP1250StringEncoding = 15, /* WinLatin2 */
NSISO2022JPStringEncoding = 21, /* ISO 2022 Japanese encoding for e-mail */
NSMacOSRomanStringEncoding = 30,
NSUTF16StringEncoding = NSUnicodeStringEncoding, /* An alias for NSUnicodeStringEncoding */
NSUTF16BigEndianStringEncoding = 0x90000100, /* NSUTF16StringEncoding encoding with explicit endianness specified */
NSUTF16LittleEndianStringEncoding = 0x94000100, /* NSUTF16StringEncoding encoding with explicit endianness specified */
NSUTF32StringEncoding = 0x8c000100,
NSUTF32BigEndianStringEncoding = 0x98000100, /* NSUTF32StringEncoding encoding with explicit endianness specified */
NSUTF32LittleEndianStringEncoding = 0x9c000100 /* NSUTF32StringEncoding encoding with explicit endianness specified */
};
typedef NSUInteger NSStringEncoding;
很遺憾,我沒找到,不過Apple留了個線索讓我取找,不過仔細看看它的注釋
Note that in addition to the values explicitly listed below, NSStringEncoding supports encodings provided by CFString.
See CFStringEncodingExt.h for a list of these encodings.
See CFString.h for functions which convert between NSStringEncoding and CFStringEncoding.
我的英文比較不好,但大意是看明白了,不在下面支持了編碼格式,在CFStringEncodingExt.h 里頭文件申明 。
我們通過finder的文件查找方法找到CFStringEncodingExt.h 。
仔細看看確實找到了kCFStringEncodingGB_18030_2000 ( 我以為是kCFStringEncodingGB_2312_80 ,實際上不是) 但是這個是CFStringEncoding類型,我們需要NSEncode的類型。
CFString 和NSString具有相同的內存結構,也是NSString的重要補充,通過查找CFString的幫助文檔,找到了這個方法 CFStringConvertEncodingToNSStringEncoding
NSStringEncoding enc = CFStringConvertEncodingToNSStringEncoding(kCFStringEncodingGB_18030_2000); NSString *textFile = [NSString stringWithContentsOfFile:filePath encoding:enc error:nil];
那么第二個問題就解決了
3。如何讀取其它編碼文件?
相信,通過以上的方法,第三個問題也可以很順利的解決了。
文章地址:http://www.cnblogs.com/likwo/archive/2011/06/26/2090914.html