IBM Java 2 converters



These are the converters and Charsets supported by IBM Java 2 version 1.4. JDK 1.4 introduces a new set of character conversion APIs as part of the nio. The names for the nio Charsets are normally the IANA primary name.

Most of the converters are also supported in earlier releases. The notes indicate whether this is an IBM only converter, and the release in which it was introduced. Some of the converters were introduced in service refreshes. This document reflects the current state of these releases.

The nio Charset name can be used as an alias for legacy converters, and the legacy converter name can be used as an alias for nio Charsets. Other available aliases are shown in the aliases column. The alias names are case independent. Some aliases have been added in releases after the one in which the converter was added.

The images of the codepages show the glyphs and unicode mappings of the converter. However, the unicode values may not exactly match the ones used by Java. Only the first part of complex codepages are shown.

Also see: detailed mapping from IBM CCSIDs to Java converters.

Converter Charset Description Aliases Notes
ASCII US-ASCII ASCII 7 bit 646 ANSI_X3.4-1968 ANSI_X3.4-1986 ascii7 Cp367 csASCII default direct ibm-367 iso-646.irv:1983 ISO-646.irv:1991 ISO646-US iso-ir-6 us  
Cp037 IBM-037 EBCDIC United States ibm037  
Cp273 IBM-273 EBCDIC Germany ibm273  
Cp277 IBM-277 EBCDIC Denmark, Norway ibm277  
Cp278 IBM-278 EBCDIC Finland, Sweden ibm278  
Cp280 IBM-280 EBCDIC Italy ibm280  
Cp284 IBM-284 EBCDIC Spain, Latin America ibm284  
Cp285 IBM-285 EBCDIC UK, Ireland ibm285  
Cp290 IBM-290 EBCDIC SBCS Japan Katakana ibm290 ibm 1.3
Cp297 IBM-297 EBCDIC France ibm297  
Cp300 IBM-300 EBCDIC DBCS-only Japan ibm300 ibm 1.3
Cp301 IBM-301 PC DBCS-only Japan ibm301 ibm 1.3
Cp420 IBM-420 EBCDIC Arabic ibm420  
Cp420S IBM-420S EBCDIC Arabic shaped ibm420S ibm 1.2
Cp424 IBM-424 EBCDIC Hebrew ibm424  
Cp437 IBM-437 PC United States cspc8codepage437 ibm437  
Cp500 IBM-500 EBCDIC Latin 1 ibm500  
Cp737 IBM-737 MS-DOS Greek ibm737  
Cp775 IBM-775 MS-DOS Baltic Rim ibm775  
Cp808 IBM-808 PC Russian (euro) ibm808 ibm 1.4
Cp833 IBM-833 EBCDIC SBCS Korean ibm833 ibm 1.3
Cp834 IBM-834 EBCDIC DBCS-only Korean ibm834 ibm 1.3
Cp835 IBM-835 EBCDIC DBCS-only Taiwan ibm835 ibm 1.3
Cp836 IBM-836 EBCDIC SBCS China ibm836 ibm 1.3
Cp837 IBM-837 EBCDIC DBCS-only China ibm837 ibm 1.3
Cp838 IBM-838 EBCDIC Thai ibm838  
Cp850 IBM-850 PC Latin 1 cspc850multilingual ibm850  
Cp852 IBM-852 PC Latin 2 cspcp852 ibm852  
Cp855 IBM-855 PC Cyrillic ibm855  
Cp856 IBM-856 PC Hebrew (old) ibm856  
Cp857 IBM-857 PC Latin 5 csibm857 ibm857  
Cp858 IBM-858 PC Latin 1 (euro) ibm00858 ibm858  
Cp859 IBM-859 PC Latin 9 ibm859 ibm 1.2
Cp860 IBM-860 PC Portugal ibm860  
Cp861 IBM-861 PC Iceland ibm861  
Cp862 IBM-862 PC Israel ibm862  
Cp863 IBM-863 PC Canadian French ibm863  
Cp864 IBM-864 PC Arabic ibm864  
Cp864S IBM-864S PC Arabic (shaped) ibm864S ibm 1.2
Cp865 IBM-865 PC Nordic ibm865  
Cp866 IBM-866 PC Russia ibm866  
Cp867 IBM-867 PC Israel ibm867 ibm 1.2
Cp868 IBM-868 PC Urdu ibm868  
Cp869 IBM-869 PC Greece ibm869  
Cp870 IBM-870 EBCDIC Latin 2 ibm870  
Cp871 IBM-871 EBCDIC Iceland ibm871  
Cp874 IBM-874 PC Thai ibm874  
Cp875 IBM-875 EBCDIC Greek ibm875  
Cp897 IBM-897 PC SBCS-only Japan ibm897 ibm 1.3
Cp918 IBM-918 EBCDIC Urdu ibm918  
Cp921 IBM-921 PC Baltic ibm921  
Cp921 ISO-8859-13 PC Baltic 8859_13 iso8859_13 iso8859-13 iso-8859-13  
Cp922 IBM-922 PC Estonian ibm922  
Cp924 IBM-924 EBCDIC Latin 9 ibm00924 ibm924 ibm 1.2
Cp927 IBM-927 PC DBCS-only Taiwan ibm927 ibm 1.3
Cp930 IBM-930 EBCDIC Japan DBCS ibm930  
Cp933 IBM-933 EBCDIC Korea DBCS ibm933  
Cp935 IBM-935 EBCDIC China DBCS ibm935  
Cp937 IBM-937 EBCDIC Taiwan DBCS ibm937  
Cp939 IBM-939 EBCDIC Japan Extended DBCS ibm939  
Cp942 IBM-942 PC Japan SJIS-78 ibm942  
Cp942C IBM-932 PC Japan SJIS-78 syntax Cp932 ibm932  
Cp942C IBM-942C PC Japan SJIS-78 syntax ibm942C  
Cp943 IBM-943 PC Japan SJIS-90 ibm943  
Cp943C IBM-943C PC Japan SJIS-90 syntax ibm943C  
Cp947 IBM-947 PC DBCS-only Taiwan ibm947 ibm 1.3
Cp948 IBM-948 PC Taiwan SAA ibm948  
Cp949 IBM-949 PC Korea (KS) ibm949  
Cp949C IBM-949C PC Korea (KS) ibm949C  
Cp950 IBM-950 PC Taiwan ibm950  
Cp951 IBM-951 PC DBCS-only Korean ibm951 ibm 1.3
Cp954 IBM-954 EUC Japan ibm954 ibm 1.4
Cp954C IBM-954C EUC Japan with syntax Cp954c ibm-eucjp ibm 1.4
Cp964 IBM-964 EUC Taiwan ibm-euctw  
Cp970 EUC-KR EUC Korea 5601 euc_kr euckr ibm-970 ibm-euckr ks_c_5601-1987 ksc_5601 ksc5601 ksc5601_1987 ksc5601-1987  
Cp971 IBM-971 EUC DBCS-only Korean ibm971 ibm 1.3
Cp1006 IBM-1006 ISO Urdu ibm1006  
Cp1025 IBM-1025 EBCDIC Cyrillic ibm1025  
Cp1026 IBM-1026 EBCDIC Latin 5 (Turkey) ibm1026  
Cp1027 IBM-1027 EBCDIC SBCS Japan ibm1027 ibm 1.3
Cp1041 IBM-1041 PC SBCS-only Japan ibm1041 ibm 1.3
Cp1043 IBM-1043 PC SBCS-only Taiwan ibm1043 ibm 1.3
Cp1046 IBM-1046 EBCDIC Arabic ibm1046  
Cp1046S IBM-1046S EBCDIC Arabic (shaped) ibm1046S ibm 1.2
Cp1047 IBM-1047 EBCDIC Open Edition ibm1047 ibm 1.2
Cp1088 IBM-1088 PC SBCS-only Korean KS ibm1088 ibm 1.3
Cp1097 IBM-1097 EBCDIC Farsi ibm1097  
Cp1098 IBM-1098 ISO Farsi ibm1098  
Cp1112 IBM-1112 EBCDIC Baltic ibm1112  
Cp1114 IBM-1114 PC SBCS-only Taiwan ibm1114 ibm 1.3
Cp1115 IBM-1115 PC SBCS-only China ibm1115 ibm 1.3
Cp1122 IBM-1122 EBCDIC Estonia ibm1122  
Cp1123 IBM-1123 EBCDIC Ukraine ibm1123  
Cp1124 IBM-1124 PC Ukraine ibm1124  
Cp1140 IBM-1140 ECECP United States ibm01140  
Cp1141 IBM-1141 ECECP Germany ibm01141  
Cp1142 IBM-1142 ECECP Denmark, Norway ibm01142  
Cp1143 IBM-1143 ECECP Finland, Sweden ibm01143  
Cp1144 IBM-1144 ECECP Italy ibm01144  
Cp1145 IBM-1145 ECECP Spain ibm01145  
Cp1146 IBM-1146 ECECP UK, Ireland ibm01146  
Cp1147 IBM-1147 ECECP France ibm01147  
Cp1148 IBM-1148 ECECP Multilingual ibm01148  
Cp1149 IBM-1149 ECECP Iceland ibm01149  
Cp1250 windows-1250 Windows Latin 2 ibm-1250  
Cp1251 windows-1251 Windows Cyrillic ibm-1251  
Cp1252 windows-1252 Windows Latin 1 ibm-1252  
Cp1253 windows-1253 Windows Greek ibm-1253  
Cp1254 windows-1254 Windows Latin 5 (Turkey) ibm-1254  
Cp1255 windows-1255 Windows Hebrew ibm-1255  
Cp1256 windows-1256 Windows Arabic ibm-1256  
Cp1256S windows-1256S Windows Arabic (shaped) Cp1256s ibm-1256s ibm 1.2
Cp1257 windows-1257 Windows Latin 4 (Balttic) ibm-1257  
Cp1258 windows-1258 Windows Vietnamese ibm-1129 ibm-1258  
Cp1351 IBM-1351 PC DBCS-only Japan ibm1351 ibm 1.3
Cp1362 IBM-1362 PC DBCS-only Korea KS extended ibm1362 ibm 1.3
Cp1363 IBM-1363 PC Korea KS extended ibm1363 ibm 1.2
Cp1363C IBM-1363C PC Korea KS extended (syntax) ibm1363C ibm 1.2
Cp1364 IBM-1364 EBCDIC Korea KS extended ibm1364 ibm 1.2
Cp1370 IBM-1370 PC Taiwan (euro) ibm1370 ibm 1.2
Cp1371 IBM-1371 EBCDIC Taiwan (euro) ibm1371 ibm 1.2
Cp1380 IBM-1380 PC DBCS-only China GB ibm1380 ibm 1.3
Cp1381 IBM-1381 PC China GB ibm1381  
Cp1382 IBM-1382 EUC DBCS-only China GB ibm1382 ibm 1.3
Cp1383 EUC-CN EUC China euc_cn euccn gb2312 gb2312-1980 gb2312-80 ibm-euccn  
Cp1383 IBM-1383 EUC China ibm1383  
Cp1385 IBM-1385 PC DBCS-only China GBK ibm1385 ibm 1.3
Cp1386 IBM-1386 PC China GBK ibm1386 ibm 1.2
Cp1388 IBM-1388 EBCDIC China GBK ibm1388 ibm 1.2
Cp1390 IBM-1390 EBCDIC Japan Katakana (euro) ibm1390 ibm 1,2
Cp1399 IBM-1399 EBCDIC Japan Latin (euro) ibm1399 ibm 1.2
Cp33722 IBM-33722 EUC Japan ibm33722  
Cp33722C IBM-33722C EUC Japan syntax ibm33722C ibm 1.2
EUC_JP EUC-JP EUC Japan euc_jp euc_jp_linux eucjp euc-jp-linux x-euc-jp x-eucjp  
EUC_TW EUC-TW EUC Taiwan cns11643 euc_tw euctw  
GB18030 GB18030 China GB18030 gb18030-2000 ibm-1392 windows-54936 ibm 1.2, sun 1.4
GBK GBK China GBK    
ISO8859_1 ISO-8859-1 ISO Latin 1 8859-1 Cp819 csisolatin1 ibm819 ibm-819 iso8859_1 iso8859-1 iso-8859-1:1987 iso-ir-100 l1 latin1  
ISO8859_2 ISO-8859-2 ISO Latin 2 (Eastern Europe) 8859-2 Cp912 csisolatin2 ibm912 ibm-912 iso8859_2 iso8859-2 iso-8859-2:1987 iso-ir-101 l2 latin2  
ISO8859_3 ISO-8859-3 ISO Latin 3 (Latin, Esperanto) 8859-3 Cp913 csisolatin3 ibm-913 iso8859_3 iso8859-3 iso-8859-3:1988 iso-ir-109 l3 latin3  
ISO8859_4 ISO-8859-4 ISO Latin 4 (Baltic - old) 8859-4 Cp914 csisolatin4 ibm-914 iso8859_4 iso8859-4 iso-8859-4:1988 iso-ir-110 l4 latin4  
ISO8859_5 ISO-8859-5 ISO Cyrillic 8859-5 Cp915 csisolatincyrillic cyrillic ibm915 ibm-915 iso8859_5 iso8859-5 iso-8859-5:1988 iso-ir-144  
ISO8859_6 ISO-8859-6 ISO Arabic 8859-6 arabic asmo-708 Cp1089 csisolatinarabic ecma-114 ibm-1089 ibm1089 iso8859_6 iso8859-6 iso-8859-6:1987 iso-ir-127  
ISO8859_6S ISO-8859-6S ISO Arabic (shaped) iso8859_6S iso8859-6S ibm 1.2
ISO8859_7 ISO-8859-7 ISO Greek 8859-7 Cp813 csisolatingreek ecma-118 elot-928 greek greek8 ibm813 ibm-813 iso8859_7 iso8859-7 iso-8859-7:1987 iso-ir-126  
ISO8859_8 ISO-8859-8 ISO Hebrew 8859-8 Cp916 csisolatinhebrew hebrew ibm916 ibm-916 iso8859_8 iso8859-8 iso-8859-8:1988 iso-ir-138  
ISO8859_9 ISO-8859-9 ISO Latin 5 8859-9 Cp920 csisolatin5 ibm-920 ibm920 iso8859_9 iso8859-9 iso-ir-148 l5 latin5  
ISO8859_15 ISO-8859-15 ISO Latin 9 (euro) 8859-9 Cp923 csisolatin9 ibm923 ibm-923 iso8859_15 iso8859_15_fdis iso8859-15 l9 latin0 latin9 was ISO8859_15_FDIS before 1.4
JIS0201 JIS0201 Japan JIS 0201    
JIS0208 JIS0208 Japan JIS 0208    
JIS0212 JIS0212 Japan JIS 0212    
KOI8_R KOI8-R Russia Internet cskoi8r koi8 koi8_r  
MacArabic MacArabic Apple Arabic    
MacCentralEurope MacCentralEurope Apple Central Europe ibm-1282  
MacCroatian MacCroatian Apple Croatian ibm-1284  
MacCyrillic MacCyrillic Apple Cyrillic ibm-1283  
MacDingbat MacDingbat Apple Dingbat    
MacGreek MacGreek Apple Greek ibm-1280  
MacHebrew MacHebrew Apple Hebrew    
MacIceland MacIceland Apple Iceland ibm-1286  
MacRoman MacRoman Apple Roman ibm-1275  
MacRomania MacRomania Apple Romania ibm-1285  
MacSymbol MacSymbol Apple PostScript Symbol Adobe-Symbol-Encoding ibm-1038  
MacThai MacThai Apple Thai    
MacTurkish MacTurkish Apple Turkish ibm-1281  
MacUkraine MacUkraine Apple Ukraine    
MS874 windows-874 Windows Thai    
MS932 windows-932 Windows Japan csshiftjis cswindows31j ms_kanji shift_jis s-jis sjis windows-31j x-sjis  
MS936 windows-936 Windows China 936  
MS949 windows-949 Windows Korea Cp1361 ibm-1361 ibm1361 johab ksc5601-1992 ms1361  
MS950 windows-950 Windows Taiwan big5  
Roman8 hp-roman8 HP Roman 8 Cp1051 ibm-1051 r8 roman8 ibm 1.4
TIS620 TIS-620 Thailand tis620 tis620.2533  
UnicodeBig X-UnicodeBig UTF-16 Big Endian with mark    
UnicodeLittle X-UnicodeLittle UTF-16 Little Endian with mark    
UTF-16BE UTF-16BE UTF-16 Big-endian ISO-10646-UCS-2 UnicodeBigUnmarked UTF_16BE UTF16BE X-UTF-16BE  
UTF-16LE UTF-16LE UTF-16 Little-endian UnicodeLittleUnmarked UTF_16LE UTF16LE X-UTF-16LE  
UTF16 UTF-16 UTF-16 Unicode UCS-2 Unicode UTF_16  
UTF32 UTF-32 UTF-32 Unicode ISO-10646-UCS-4 UCS-4 UTF_32BE ibm 1.4 nio
UTF32BE UTF-32BE UTF-32 Little-endian UTF_32BE X-UTF-32BE ibm 1.4 nio
UTF32LE UTF-32LE UTF-32 Little-endian UTF_32LE X-UTF-32LE ibm 1.4 nio
UTF8 UTF-8 UTF-8 Unicode UTF_8  
UTF8J UTF-8J UTF-8 Java varient   ibm 1.4 nio
CESU8 CESU-8 UTF-8 without surrogates   ibm 1.4 nio