如何在 java 中将 UTF-16 转换为 UTF-32?
How can I convert UTF-16 to UTF-32 in java?
我一直在寻找解决方案,但关于这个主题的内容似乎并不多。我找到了建议的解决方案:
String unicodeString = new String("utf8 here");
byte[] bytes = String.getBytes("UTF8");
String converted = new String(bytes,"UTF16");
用于从 utf8 转换为 utf16,但是,java 不处理 "UTF32",这使得该解决方案不可行。有谁知道如何实现此目标的任何其他方法?
public static char[] bytesToHex(byte[] raw) {
int length = raw.length;
char[] hex = new char[length * 2];
for (int i = 0; i < length; i++) {
int value = (raw[i] + 256) % 256;
int highIndex = value >> 4;
int lowIndex = value & 0x0f;
hex[i * 2 + 0] = kDigits[highIndex];
hex[i * 2 + 1] = kDigits[lowIndex];
}
return hex;
}
byte[] bytearr = converted.getBytes("UTF-32");
System.out.println("With UTF-32 encoding:\t" + String.valueOf(bytesToHex(bytearr)));
System.out.println("With UTF-32 decoding:\t" + new String((bytearr), "UTF-32"));
Java 确实处理 UTF-32,试试这个测试
byte[] a = "1".getBytes("UTF-32");
System.out.println(a.length);
它将显示数组的长度 = 4
搜索后,我开始使用它:
public static String convert16to32(String toConvert){
for (int i = 0; i < toConvert.length(); ) {
int codePoint = Character.codePointAt(toConvert, i);
i += Character.charCount(codePoint);
//System.out.printf("%x%n", codePoint);
String utf32 = String.format("0x%x%n", codePoint);
return utf32;
}
return null;
}
我一直在寻找解决方案,但关于这个主题的内容似乎并不多。我找到了建议的解决方案:
String unicodeString = new String("utf8 here");
byte[] bytes = String.getBytes("UTF8");
String converted = new String(bytes,"UTF16");
用于从 utf8 转换为 utf16,但是,java 不处理 "UTF32",这使得该解决方案不可行。有谁知道如何实现此目标的任何其他方法?
public static char[] bytesToHex(byte[] raw) {
int length = raw.length;
char[] hex = new char[length * 2];
for (int i = 0; i < length; i++) {
int value = (raw[i] + 256) % 256;
int highIndex = value >> 4;
int lowIndex = value & 0x0f;
hex[i * 2 + 0] = kDigits[highIndex];
hex[i * 2 + 1] = kDigits[lowIndex];
}
return hex;
}
byte[] bytearr = converted.getBytes("UTF-32");
System.out.println("With UTF-32 encoding:\t" + String.valueOf(bytesToHex(bytearr)));
System.out.println("With UTF-32 decoding:\t" + new String((bytearr), "UTF-32"));
Java 确实处理 UTF-32,试试这个测试
byte[] a = "1".getBytes("UTF-32");
System.out.println(a.length);
它将显示数组的长度 = 4
搜索后,我开始使用它:
public static String convert16to32(String toConvert){
for (int i = 0; i < toConvert.length(); ) {
int codePoint = Character.codePointAt(toConvert, i);
i += Character.charCount(codePoint);
//System.out.printf("%x%n", codePoint);
String utf32 = String.format("0x%x%n", codePoint);
return utf32;
}
return null;
}