Java unicode中文編碼轉換和反轉

本文轉載自查看原文 2013-09-19 00:17 15603 java/ unicode

參考網址http://www.oschina.net/code/snippet_142385_4297

http://canofy.iteye.com/blog/718659

在java的很多配置文件中，尤其是國際化資源中經常遇到類似\uf432這樣的unicode編碼，搜集了下該編碼相關的資料，大致處理方法有如下：

1、Unicode轉漢字字符串。

這個過程最簡單的方式就是直接獲取。比如

String cnStr = "\ufeff\u4e2d\u56fd\u4eba";

System.out.println(cnStr); 即可獲取對應的漢字字符 “中國人”；

但是呢，每次從輸出讀的話也未免過於不方便了，我們使用方法來做轉換，直接獲取。

參考如下

	public static String unicodeToString(String str) {

        Pattern pattern = Pattern.compile("(\\\\u(\\p{XDigit}{4}))");    
        Matcher matcher = pattern.matcher(str);
        char ch;
        while (matcher.find()) {
            ch = (char) Integer.parseInt(matcher.group(2), 16);
            str = str.replace(matcher.group(1), ch + "");    
        }
        return str;
    }

2、獲取字符串的unicode編碼，這個我們可以通過直接獲取字符串的unicode二進制，然后將其byte轉換成對應的16進制表示即可，函數示例如下

static String getUnicode(String s) {
		try {
			StringBuffer out = new StringBuffer("");
			byte[] bytes = s.getBytes("unicode");
			for (int i = 0; i < bytes.length - 1; i += 2) {
				out.append("\\u");
				String str = Integer.toHexString(bytes[i + 1] & 0xff);
				for (int j = str.length(); j < 2; j++) {
					out.append("0");
				}
				String str1 = Integer.toHexString(bytes[i] & 0xff);
				out.append(str1);
				out.append(str);
				
			}
			return out.toString();
		} catch (UnsupportedEncodingException e) {
			e.printStackTrace();
			return null;
		}
	}

通過上面的方式便可完整的使用unicode編碼了，大家有其他方式的轉換也可以告訴我下，互相學習

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Qt中文編碼和QString類Unicode編碼轉換 QString 中文編碼轉換 Java中文編碼小結 Java實現中文轉換成Unicode編碼和 Unicode編碼轉換成中文 (轉)網址中的中文編碼轉換 Django - HttpResponse返回JSON數據時中文編碼為Unicode java中文和unicode編碼相互轉換(轉) Unicode與中文的轉換-java python 中文編碼(一) python unicode轉中文及轉換默認編碼