These are the converters and Charsets supported by IBM Java 2 version 1.4. JDK 1.4 introduces a new set of character conversion APIs as part of the nio. The names for the nio Charsets are normally the IANA primary name.
Most of the converters are also supported in earlier releases. The notes indicate whether this is an IBM only converter, and the release in which it was introduced. Some of the converters were introduced in service refreshes. This document reflects the current state of these releases.
The nio Charset name can be used as an alias for legacy converters, and the legacy converter name can be used as an alias for nio Charsets. Other available aliases are shown in the aliases column. The alias names are case independent. Some aliases have been added in releases after the one in which the converter was added.
The images of the codepages show the glyphs and unicode mappings of the converter. However, the unicode values may not exactly match the ones used by Java. Only the first part of complex codepages are shown.
Also see: detailed mapping from IBM CCSIDs to Java converters.
Converter | Charset | Description | Aliases | Notes |
ASCII | US-ASCII | ASCII 7 bit | 646 ANSI_X3.4-1968 ANSI_X3.4-1986 ascii7 Cp367 csASCII default direct ibm-367 iso-646.irv:1983 ISO-646.irv:1991 ISO646-US iso-ir-6 us | |
Cp037 | IBM-037 | EBCDIC United States | ibm037 | |
Cp273 | IBM-273 | EBCDIC Germany | ibm273 | |
Cp277 | IBM-277 | EBCDIC Denmark, Norway | ibm277 | |
Cp278 | IBM-278 | EBCDIC Finland, Sweden | ibm278 | |
Cp280 | IBM-280 | EBCDIC Italy | ibm280 | |
Cp284 | IBM-284 | EBCDIC Spain, Latin America | ibm284 | |
Cp285 | IBM-285 | EBCDIC UK, Ireland | ibm285 | |
Cp290 | IBM-290 | EBCDIC SBCS Japan Katakana | ibm290 | ibm 1.3 |
Cp297 | IBM-297 | EBCDIC France | ibm297 | |
Cp300 | IBM-300 | EBCDIC DBCS-only Japan | ibm300 | ibm 1.3 |
Cp301 | IBM-301 | PC DBCS-only Japan | ibm301 | ibm 1.3 |
Cp420 | IBM-420 | EBCDIC Arabic | ibm420 | |
Cp420S | IBM-420S | EBCDIC Arabic shaped | ibm420S | ibm 1.2 |
Cp424 | IBM-424 | EBCDIC Hebrew | ibm424 | |
Cp437 | IBM-437 | PC United States | cspc8codepage437 ibm437 | |
Cp500 | IBM-500 | EBCDIC Latin 1 | ibm500 | |
Cp737 | IBM-737 | MS-DOS Greek | ibm737 | |
Cp775 | IBM-775 | MS-DOS Baltic Rim | ibm775 | |
Cp808 | IBM-808 | PC Russian (euro) | ibm808 | ibm 1.4 |
Cp833 | IBM-833 | EBCDIC SBCS Korean | ibm833 | ibm 1.3 |
Cp834 | IBM-834 | EBCDIC DBCS-only Korean | ibm834 | ibm 1.3 |
Cp835 | IBM-835 | EBCDIC DBCS-only Taiwan | ibm835 | ibm 1.3 |
Cp836 | IBM-836 | EBCDIC SBCS China | ibm836 | ibm 1.3 |
Cp837 | IBM-837 | EBCDIC DBCS-only China | ibm837 | ibm 1.3 |
Cp838 | IBM-838 | EBCDIC Thai | ibm838 | |
Cp850 | IBM-850 | PC Latin 1 | cspc850multilingual ibm850 | |
Cp852 | IBM-852 | PC Latin 2 | cspcp852 ibm852 | |
Cp855 | IBM-855 | PC Cyrillic | ibm855 | |
Cp856 | IBM-856 | PC Hebrew (old) | ibm856 | |
Cp857 | IBM-857 | PC Latin 5 | csibm857 ibm857 | |
Cp858 | IBM-858 | PC Latin 1 (euro) | ibm00858 ibm858 | |
Cp859 | IBM-859 | PC Latin 9 | ibm859 | ibm 1.2 |
Cp860 | IBM-860 | PC Portugal | ibm860 | |
Cp861 | IBM-861 | PC Iceland | ibm861 | |
Cp862 | IBM-862 | PC Israel | ibm862 | |
Cp863 | IBM-863 | PC Canadian French | ibm863 | |
Cp864 | IBM-864 | PC Arabic | ibm864 | |
Cp864S | IBM-864S | PC Arabic (shaped) | ibm864S | ibm 1.2 |
Cp865 | IBM-865 | PC Nordic | ibm865 | |
Cp866 | IBM-866 | PC Russia | ibm866 | |
Cp867 | IBM-867 | PC Israel | ibm867 | ibm 1.2 |
Cp868 | IBM-868 | PC Urdu | ibm868 | |
Cp869 | IBM-869 | PC Greece | ibm869 | |
Cp870 | IBM-870 | EBCDIC Latin 2 | ibm870 | |
Cp871 | IBM-871 | EBCDIC Iceland | ibm871 | |
Cp874 | IBM-874 | PC Thai | ibm874 | |
Cp875 | IBM-875 | EBCDIC Greek | ibm875 | |
Cp897 | IBM-897 | PC SBCS-only Japan | ibm897 | ibm 1.3 |
Cp918 | IBM-918 | EBCDIC Urdu | ibm918 | |
Cp921 | IBM-921 | PC Baltic | ibm921 | |
Cp921 | ISO-8859-13 | PC Baltic | 8859_13 iso8859_13 iso8859-13 iso-8859-13 | |
Cp922 | IBM-922 | PC Estonian | ibm922 | |
Cp924 | IBM-924 | EBCDIC Latin 9 | ibm00924 ibm924 | ibm 1.2 |
Cp927 | IBM-927 | PC DBCS-only Taiwan | ibm927 | ibm 1.3 |
Cp930 | IBM-930 | EBCDIC Japan DBCS | ibm930 | |
Cp933 | IBM-933 | EBCDIC Korea DBCS | ibm933 | |
Cp935 | IBM-935 | EBCDIC China DBCS | ibm935 | |
Cp937 | IBM-937 | EBCDIC Taiwan DBCS | ibm937 | |
Cp939 | IBM-939 | EBCDIC Japan Extended DBCS | ibm939 | |
Cp942 | IBM-942 | PC Japan SJIS-78 | ibm942 | |
Cp942C | IBM-932 | PC Japan SJIS-78 syntax | Cp932 ibm932 | |
Cp942C | IBM-942C | PC Japan SJIS-78 syntax | ibm942C | |
Cp943 | IBM-943 | PC Japan SJIS-90 | ibm943 | |
Cp943C | IBM-943C | PC Japan SJIS-90 syntax | ibm943C | |
Cp947 | IBM-947 | PC DBCS-only Taiwan | ibm947 | ibm 1.3 |
Cp948 | IBM-948 | PC Taiwan SAA | ibm948 | |
Cp949 | IBM-949 | PC Korea (KS) | ibm949 | |
Cp949C | IBM-949C | PC Korea (KS) | ibm949C | |
Cp950 | IBM-950 | PC Taiwan | ibm950 | |
Cp951 | IBM-951 | PC DBCS-only Korean | ibm951 | ibm 1.3 |
Cp954 | IBM-954 | EUC Japan | ibm954 | ibm 1.4 |
Cp954C | IBM-954C | EUC Japan with syntax | Cp954c ibm-eucjp | ibm 1.4 |
Cp964 | IBM-964 | EUC Taiwan | ibm-euctw | |
Cp970 | EUC-KR | EUC Korea | 5601 euc_kr euckr ibm-970 ibm-euckr ks_c_5601-1987 ksc_5601 ksc5601 ksc5601_1987 ksc5601-1987 | |
Cp971 | IBM-971 | EUC DBCS-only Korean | ibm971 | ibm 1.3 |
Cp1006 | IBM-1006 | ISO Urdu | ibm1006 | |
Cp1025 | IBM-1025 | EBCDIC Cyrillic | ibm1025 | |
Cp1026 | IBM-1026 | EBCDIC Latin 5 (Turkey) | ibm1026 | |
Cp1027 | IBM-1027 | EBCDIC SBCS Japan | ibm1027 | ibm 1.3 |
Cp1041 | IBM-1041 | PC SBCS-only Japan | ibm1041 | ibm 1.3 |
Cp1043 | IBM-1043 | PC SBCS-only Taiwan | ibm1043 | ibm 1.3 |
Cp1046 | IBM-1046 | EBCDIC Arabic | ibm1046 | |
Cp1046S | IBM-1046S | EBCDIC Arabic (shaped) | ibm1046S | ibm 1.2 |
Cp1047 | IBM-1047 | EBCDIC Open Edition | ibm1047 | ibm 1.2 |
Cp1088 | IBM-1088 | PC SBCS-only Korean KS | ibm1088 | ibm 1.3 |
Cp1097 | IBM-1097 | EBCDIC Farsi | ibm1097 | |
Cp1098 | IBM-1098 | ISO Farsi | ibm1098 | |
Cp1112 | IBM-1112 | EBCDIC Baltic | ibm1112 | |
Cp1114 | IBM-1114 | PC SBCS-only Taiwan | ibm1114 | ibm 1.3 |
Cp1115 | IBM-1115 | PC SBCS-only China | ibm1115 | ibm 1.3 |
Cp1122 | IBM-1122 | EBCDIC Estonia | ibm1122 | |
Cp1123 | IBM-1123 | EBCDIC Ukraine | ibm1123 | |
Cp1124 | IBM-1124 | PC Ukraine | ibm1124 | |
Cp1140 | IBM-1140 | ECECP United States | ibm01140 | |
Cp1141 | IBM-1141 | ECECP Germany | ibm01141 | |
Cp1142 | IBM-1142 | ECECP Denmark, Norway | ibm01142 | |
Cp1143 | IBM-1143 | ECECP Finland, Sweden | ibm01143 | |
Cp1144 | IBM-1144 | ECECP Italy | ibm01144 | |
Cp1145 | IBM-1145 | ECECP Spain | ibm01145 | |
Cp1146 | IBM-1146 | ECECP UK, Ireland | ibm01146 | |
Cp1147 | IBM-1147 | ECECP France | ibm01147 | |
Cp1148 | IBM-1148 | ECECP Multilingual | ibm01148 | |
Cp1149 | IBM-1149 | ECECP Iceland | ibm01149 | |
Cp1250 | windows-1250 | Windows Latin 2 | ibm-1250 | |
Cp1251 | windows-1251 | Windows Cyrillic | ibm-1251 | |
Cp1252 | windows-1252 | Windows Latin 1 | ibm-1252 | |
Cp1253 | windows-1253 | Windows Greek | ibm-1253 | |
Cp1254 | windows-1254 | Windows Latin 5 (Turkey) | ibm-1254 | |
Cp1255 | windows-1255 | Windows Hebrew | ibm-1255 | |
Cp1256 | windows-1256 | Windows Arabic | ibm-1256 | |
Cp1256S | windows-1256S | Windows Arabic (shaped) | Cp1256s ibm-1256s | ibm 1.2 |
Cp1257 | windows-1257 | Windows Latin 4 (Balttic) | ibm-1257 | |
Cp1258 | windows-1258 | Windows Vietnamese | ibm-1129 ibm-1258 | |
Cp1351 | IBM-1351 | PC DBCS-only Japan | ibm1351 | ibm 1.3 |
Cp1362 | IBM-1362 | PC DBCS-only Korea KS extended | ibm1362 | ibm 1.3 |
Cp1363 | IBM-1363 | PC Korea KS extended | ibm1363 | ibm 1.2 |
Cp1363C | IBM-1363C | PC Korea KS extended (syntax) | ibm1363C | ibm 1.2 |
Cp1364 | IBM-1364 | EBCDIC Korea KS extended | ibm1364 | ibm 1.2 |
Cp1370 | IBM-1370 | PC Taiwan (euro) | ibm1370 | ibm 1.2 |
Cp1371 | IBM-1371 | EBCDIC Taiwan (euro) | ibm1371 | ibm 1.2 |
Cp1380 | IBM-1380 | PC DBCS-only China GB | ibm1380 | ibm 1.3 |
Cp1381 | IBM-1381 | PC China GB | ibm1381 | |
Cp1382 | IBM-1382 | EUC DBCS-only China GB | ibm1382 | ibm 1.3 |
Cp1383 | EUC-CN | EUC China | euc_cn euccn gb2312 gb2312-1980 gb2312-80 ibm-euccn | |
Cp1383 | IBM-1383 | EUC China | ibm1383 | |
Cp1385 | IBM-1385 | PC DBCS-only China GBK | ibm1385 | ibm 1.3 |
Cp1386 | IBM-1386 | PC China GBK | ibm1386 | ibm 1.2 |
Cp1388 | IBM-1388 | EBCDIC China GBK | ibm1388 | ibm 1.2 |
Cp1390 | IBM-1390 | EBCDIC Japan Katakana (euro) | ibm1390 | ibm 1,2 |
Cp1399 | IBM-1399 | EBCDIC Japan Latin (euro) | ibm1399 | ibm 1.2 |
Cp33722 | IBM-33722 | EUC Japan | ibm33722 | |
Cp33722C | IBM-33722C | EUC Japan syntax | ibm33722C | ibm 1.2 |
EUC_JP | EUC-JP | EUC Japan | euc_jp euc_jp_linux eucjp euc-jp-linux x-euc-jp x-eucjp | |
EUC_TW | EUC-TW | EUC Taiwan | cns11643 euc_tw euctw | |
GB18030 | GB18030 | China GB18030 | gb18030-2000 ibm-1392 windows-54936 | ibm 1.2, sun 1.4 |
GBK | GBK | China GBK | ||
ISO8859_1 | ISO-8859-1 | ISO Latin 1 | 8859-1 Cp819 csisolatin1 ibm819 ibm-819 iso8859_1 iso8859-1 iso-8859-1:1987 iso-ir-100 l1 latin1 | |
ISO8859_2 | ISO-8859-2 | ISO Latin 2 (Eastern Europe) | 8859-2 Cp912 csisolatin2 ibm912 ibm-912 iso8859_2 iso8859-2 iso-8859-2:1987 iso-ir-101 l2 latin2 | |
ISO8859_3 | ISO-8859-3 | ISO Latin 3 (Latin, Esperanto) | 8859-3 Cp913 csisolatin3 ibm-913 iso8859_3 iso8859-3 iso-8859-3:1988 iso-ir-109 l3 latin3 | |
ISO8859_4 | ISO-8859-4 | ISO Latin 4 (Baltic - old) | 8859-4 Cp914 csisolatin4 ibm-914 iso8859_4 iso8859-4 iso-8859-4:1988 iso-ir-110 l4 latin4 | |
ISO8859_5 | ISO-8859-5 | ISO Cyrillic | 8859-5 Cp915 csisolatincyrillic cyrillic ibm915 ibm-915 iso8859_5 iso8859-5 iso-8859-5:1988 iso-ir-144 | |
ISO8859_6 | ISO-8859-6 | ISO Arabic | 8859-6 arabic asmo-708 Cp1089 csisolatinarabic ecma-114 ibm-1089 ibm1089 iso8859_6 iso8859-6 iso-8859-6:1987 iso-ir-127 | |
ISO8859_6S | ISO-8859-6S | ISO Arabic (shaped) | iso8859_6S iso8859-6S | ibm 1.2 |
ISO8859_7 | ISO-8859-7 | ISO Greek | 8859-7 Cp813 csisolatingreek ecma-118 elot-928 greek greek8 ibm813 ibm-813 iso8859_7 iso8859-7 iso-8859-7:1987 iso-ir-126 | |
ISO8859_8 | ISO-8859-8 | ISO Hebrew | 8859-8 Cp916 csisolatinhebrew hebrew ibm916 ibm-916 iso8859_8 iso8859-8 iso-8859-8:1988 iso-ir-138 | |
ISO8859_9 | ISO-8859-9 | ISO Latin 5 | 8859-9 Cp920 csisolatin5 ibm-920 ibm920 iso8859_9 iso8859-9 iso-ir-148 l5 latin5 | |
ISO8859_15 | ISO-8859-15 | ISO Latin 9 (euro) | 8859-9 Cp923 csisolatin9 ibm923 ibm-923 iso8859_15 iso8859_15_fdis iso8859-15 l9 latin0 latin9 | was ISO8859_15_FDIS before 1.4 |
JIS0201 | JIS0201 | Japan JIS 0201 | ||
JIS0208 | JIS0208 | Japan JIS 0208 | ||
JIS0212 | JIS0212 | Japan JIS 0212 | ||
KOI8_R | KOI8-R | Russia Internet | cskoi8r koi8 koi8_r | |
MacArabic | MacArabic | Apple Arabic | ||
MacCentralEurope | MacCentralEurope | Apple Central Europe | ibm-1282 | |
MacCroatian | MacCroatian | Apple Croatian | ibm-1284 | |
MacCyrillic | MacCyrillic | Apple Cyrillic | ibm-1283 | |
MacDingbat | MacDingbat | Apple Dingbat | ||
MacGreek | MacGreek | Apple Greek | ibm-1280 | |
MacHebrew | MacHebrew | Apple Hebrew | ||
MacIceland | MacIceland | Apple Iceland | ibm-1286 | |
MacRoman | MacRoman | Apple Roman | ibm-1275 | |
MacRomania | MacRomania | Apple Romania | ibm-1285 | |
MacSymbol | MacSymbol | Apple PostScript Symbol | Adobe-Symbol-Encoding ibm-1038 | |
MacThai | MacThai | Apple Thai | ||
MacTurkish | MacTurkish | Apple Turkish | ibm-1281 | |
MacUkraine | MacUkraine | Apple Ukraine | ||
MS874 | windows-874 | Windows Thai | ||
MS932 | windows-932 | Windows Japan | csshiftjis cswindows31j ms_kanji shift_jis s-jis sjis windows-31j x-sjis | |
MS936 | windows-936 | Windows China | 936 | |
MS949 | windows-949 | Windows Korea | Cp1361 ibm-1361 ibm1361 johab ksc5601-1992 ms1361 | |
MS950 | windows-950 | Windows Taiwan | big5 | |
Roman8 | hp-roman8 | HP Roman 8 | Cp1051 ibm-1051 r8 roman8 | ibm 1.4 |
TIS620 | TIS-620 | Thailand | tis620 tis620.2533 | |
UnicodeBig | X-UnicodeBig | UTF-16 Big Endian with mark | ||
UnicodeLittle | X-UnicodeLittle | UTF-16 Little Endian with mark | ||
UTF-16BE | UTF-16BE | UTF-16 Big-endian | ISO-10646-UCS-2 UnicodeBigUnmarked UTF_16BE UTF16BE X-UTF-16BE | |
UTF-16LE | UTF-16LE | UTF-16 Little-endian | UnicodeLittleUnmarked UTF_16LE UTF16LE X-UTF-16LE | |
UTF16 | UTF-16 | UTF-16 Unicode | UCS-2 Unicode UTF_16 | |
UTF32 | UTF-32 | UTF-32 Unicode | ISO-10646-UCS-4 UCS-4 UTF_32BE | ibm 1.4 nio |
UTF32BE | UTF-32BE | UTF-32 Little-endian | UTF_32BE X-UTF-32BE | ibm 1.4 nio |
UTF32LE | UTF-32LE | UTF-32 Little-endian | UTF_32LE X-UTF-32LE | ibm 1.4 nio |
UTF8 | UTF-8 | UTF-8 Unicode | UTF_8 | |
UTF8J | UTF-8J | UTF-8 Java varient | ibm 1.4 nio | |
CESU8 | CESU-8 | UTF-8 without surrogates | ibm 1.4 nio |