########################################################################## # Licensed Materials - Property of IBM and/or HCL # # __INF__ GLS # (c) Copyright IBM Corporation 1996, 2004 All rights reserved. # (c) Copyright HCL Technologies Ltd. 2017. All Rights Reserved. ########################################################################## # # Informix Code-Set Name Mapping File # ================================== # # -- This file maps code-set names to code-set numbers. More than one # code-set name can map to the same code-set number; however, the # same code-set name cannot map to different code-set numbers. # # A code-set name is used in locale specifications. For example, # "8859-1" is the code-set name in the locale specification # "en_us.8859-1@phon". # # A code-set number is based on the IBM CCSID numbering scheme and # is used in locale and code-set conversion filenames. For example, # "0333" is the hexadecimal representation for the code-set number 819 # in the filename "en_us/0333.lco"; and both "0333" and "01b5" are the # hexadecimal representations for the code-set numbers 819 and 437 in # the filename "033301b5.cvo". # # -- A locale specification can contain the decimal representation of the # code-set number as well as the code-set name. For example, if the # code-set name "8859-1" maps to the decimal code-set number 819, then # the locale specification "en_us.819@phon" will have the same effect # as the locale specification "en_us.8859-1@phon". # # -- Not every code set supported by Informix has a code-set name. For # example, most IBM CCSIDs and Microsoft Windows Code Pages are only # known by their number, so the decimal representation of their number # is used in the locale specification. For example, the locale # specification "en_us.1252@phon" specifies that the code set is # Microsoft Windows Code Page 1252. Note that Microsoft Windows Code # Pages are not yet officially registered as IBM CCSIDs, but their # numbers do not conflict with existing IBM CCSIDs, so for each # Microsoft Windows Code Page, the Informix code-set number is the # same as the Microsoft Windows Code Page number. # # IBM CCSIDs and Microsoft Windows Code Pages can have names associated # with them if they are also commonly known by a name. For example, IBM # CCSID 819 is the same code set as ISO 8859-1; therefore, the code-set # name "8859-1" is mapped to the code-set number 819. # # -- IBM CCSIDs are numbers between 0 and 65535, inclusive. However, IBM # has reserved a block of CCSIDs for use outside of IBM (between 57344 # [0xE000] and 61439 [0xEFFF], inclusive). Therefore, the non-IBM CCSIDs # and non-Microsoft Windows Code Pages that Informix will support will # map to numbers in this reserved block. The number 57344 [0xE000] is # reserved by Informix as a sentinel error value, therefore, it cannot # be associated with a code-set name. # # -- The range 61424 [0xEFF0] to 61439 [0xEFFF] is reserved for # customer-specific codesets. That is, codesets which are not # defined by any international or national standards body, operating # system vendor, etc. Informix will not make an entry in the # registry using a value in this range. Customers who add codesets # in this range must themselves maintain those entries. # # -- Given a code-set name, the GLS library functions will determine its # code-set number using the following algorithm. They will first search # for the code-set name in this mapping file. The search is not # case-sensitive. If the code-set name is found, it will return the # code-set number associated with the name. If the code-set name is # not found and the name consists of only decimal digits, they will # convert the name to a number between 0 and 65535, inclusive. # Therefore, IBM CCSIDs and Microsoft Windows Code Pages do not have to # appear explicitely in this mapping file. # # -- The following are the rules for the syntax of this mapping file: # # 1. Comment lines are indicated by a '#' as the first character in the # line. Blank lines are not ignored. # # 2. Each line in this mapping file consists of a mapping between # code-set name and code-set number. The format of each line is # as follows: # # The code-set name begins in the first column. # # The code-set name consists of 1 to 21, inclusive, graphical # ASCII characters (except that a code-set name cannot begin with # the comment character '#'). Case is not significant in code-set # names. # # The code-set name is separated from its code-set number by 1 or # more space or tab characters # # The code-set number consists of 1 or more decimal ASCII digits # that are converted to a number in the range of 0-65535, inclusive. # # An optional comment can follow the code-set number on the line. # This optional comment consists of one whitespace character # followed by any number of characters to the end of the line. # # ASCII 364 # 0x016c C 819 # 0x0333 C locale (os/portable/C) POSIX 819 # 0x0333 POSIX locale (os/portable/POSIX) # UnitedStates 437 # 0x01b5 (Microsoft Code Page 437) CP437 437 # 0x01b5 (Microsoft Code Page 437) # 8859-1 819 # 0x0333 same as IBM CCSID 819 Latin-1 819 # 0x0333 # KorHstExtSBCS 833 # 0x0341 - Korean Host Extended SBCS CCSID833 833 # 0x0341 # KorHstDBCS 834 # 0x0342 - Korean Host DBCS including 1880 UDC CCSID834 834 # 0x0342 # TChHstDBCS 835 # 0x0343 - Trad.Chinese Host DBCS including 6204 UDC CCSID835 835 # 0x0343 # SChHstExtSBCS 836 # 0x0344 - Sim.Chinese Host Extended SBCS CCSID836 836 # 0x0344 # SChHstDBCS 837 # 0x0345 - Sim.Chinese Host DBCS including 1880 UDC CCSID837 837 # 0x0345 # PC-Latin-1 850 # 0x0352 IBM CCSID 850 # PC-Latin-2 852 # 0x0354 IBM CCSID 852 # pc-data-hebrew 856 # 0x0358 IBM CCSID 856 pc-hebrew-saa 856 # 0x0358 # pc857 857 # 0x0359 IBM CCSID 857 cp857 857 # 0X0359 IBM CCSID 857 CCSID857 857 # 0X0359 IBM CCSID 857 PC-Latin-1-we 858 # 0x035a Latin 1 with euro PC868 858 # 0x035a # pc-cyrillic 866 # 0x0362 (Microsoft Code Page 866) CIS-1 866 # 0x0362 (Microsoft Code Page 866) PC866 866 # 0x0362 (Microsoft Code Page 866) # 8859-2 912 # 0x0390 same as IBM CCSID 912 Latin-2 912 # 0x0390 # Cyrillic-8bit 915 # 0x0393 IBM CCSID 915 8859-5 915 # 0x0393 IBM CCSID 915 Latin-Cyrillic 915 # 0x0393 IBM CCSID 915 # # # CCSID932 932 # 0x03a4 IBM CCSID 932 Mixed including 1880 UDC sjis-s 932 # 0x03a4 pc-sjis 932 # 0x03a4 cp932 932 # 0x03a4 Japanese MS Windows Code Page 932 # KorHstMxExSBCS 933 # 0x03a5 - Korean Host Mixed including 1880 UDC CSID933 933 # 0x03a5 Extended SBCS # SChMxExSBCS 935 # 0x03a7 - Sim.Chinese Host Mixed including 1880 UDC CCSID935 935 # 0x03a7 Extended SBCS # TChHstMxExSBCS 937 # 0x03a9 - Trad.Chinese Host Mixed including 6204 UDC CCSID937 937 # 0x03a9 Extended SBCS # TChHstExSBCS 28709 # 0x7025 - Trad.Chinese Host Extended SBCS CCSID28709 28709 # 0x7025 # 8859-3 57346 # 0xe002 Latin-3 57346 # 0xe002 # 8859-4 57347 # 0xe003 Latin-4 57347 # 0xe003 # 8859-6 1089 # 0x0441 same as IBM CCSID 1089 Latin-Arabic 1089 # 0x0441 iso-ir-127 1089 # 0x0441 ASMO-708 1089 # 0x0441 # 8859-7 813 # 0x032d same as IBM CCSID 813 Latin-Greek 813 # 0x032d # 8859-8 916 # 0x0394 same as IBM CCSID 916 Latin-Hebrew 916 # 0x0394 # 8859-9 920 # 0x0398 same as IBM CCSID 920 Latin-5 920 # 0x0398 # ISO-7-German 1011 # 0x03f3 IBM CCSID 1011 646de 1011 # 0x03f3 IBM CCSID 1011 # ISO-7-Danish 1017 # 0x03f9 IBM CCSID 1017 # 8859-13 57390 # 0xe02e Latin-Baltic 57390 # 0xe02e # 8859-15 57391 # 0xe02f Latin-9 57391 # 0xe02f # 8859-16 1173 # 0x0495 Latin-10 1173 # 0x0495 # pc1046 1046 # 0x0416 IBM CCSID 1046 arabic-pc 1046 # 0x0416 # ASMO-449 57389 # 0xe02d Arabic Standard, same as ISO 9036 arabic7 57389 # 0xe02d same as ISO 9036 iso-ir-89 57389 # 0xe02d same as ISO 9036 ISO-9036 57389 # 0xe02d cp708 708 # 0x02c4 MSDOS Code Page 708 (Microsoft) ASMO-449E 709 # 0x02c5 Arabic standard aka ASMO 449+ ASMO-449+ 709 # 0x02c5 IBM CCSID 709 (Arabic Standard 449+) cp864 864 # 0x0360 IBM CCSID 864 (Arabic DOS) cp420 420 # 0x01a4 IBM CCSID 420 (Arabic EBCDIC) ebcdic-cp-ar1 420 # 0x01a4 IBM CCSID 420 (Arabic EBCDIC) # # ISO-10646 is the specification for UCS-4. # ISO-10646 57392 # 0xe030 UCS-4 57392 # 0xe030 # unicode 57349 # 0xe005 # east-europe 1250 # 0x04e2 (Microsoft Code Page 1250) CP1250 1250 # 0x04e2 (Microsoft Code Page 1250) pc-slavic 1251 # 0x04e3 (Microsoft Code Page 1251) CP1251 1251 # 0x04e3 (Microsoft Code Page 1251) west-europe 1252 # 0x04e4 (Microsoft Code Page 1252) CP1252 1252 # 0x04e4 (Microsoft Code Page 1252) pc-greek 1253 # 0x04e5 (Microsoft Code Page 1253) CP1253 1253 # 0x04e5 (Microsoft Code Page 1253) pc-latin5 1254 # 0x04e6 (Microsoft Code Page 1254) CP1254 1254 # 0x04e6 (Microsoft Code Page 1254) pc-turkish 1254 # 0x04e6 (Microsoft Code Page 1254) pc-hebrew 1255 # 0x04e7 (Microsoft Code Page 1255) CP1255 1255 # 0x04e7 (Microsoft Code Page 1255) pc-arabic 1256 # 0x04e8 (Microsoft Code Page 1256) CP1256 1256 # 0x04e8 (Microsoft Code Page 1256) pc-baltic 1257 # 0x04e9 (Microsoft Code Page 1257) CP1257 1257 # 0x04e9 (Microsoft Code Page 1257) CP1258 1258 # 0x04ea (Microsoft Code Page 1258) # big5HKSCS 1375 # 0x055f big5-HKSCS 1375 # 0x055f big-5HKSCS 1375 # 0x055f big-5-HKSCS 1375 # 0x055f # sjis 57350 # 0xe006 Shift-JIS 57350 # 0xe006 # ujis 57351 # 0xe007 # eucJP 5050 # 0x13ba same as IBM CCSID 5050 # big5 57352 # 0xe008 Big-5 57352 # 0xe008 cp950 57352 # 0xe008 Traditional Chinese MS Windows Code Page 950 # sbig5 57353 # 0xe009 Shift-Big-5 57353 # 0xe009 # ccdc 57354 # 0xe00a # cccii 57355 # 0xe00b # big5e 57393 # 0xe031 Big5e locale for DSC # Greek 737 # 0x02E1 - cannot find 737 CCSID but no conflict. greek 737 # Olivetti CP 210 # baltic 775 # 0x0307 Baltic Rim # ksc 57356 # 0xe00c Korean (Wansung) KS C-5601-1987 KS5601 57356 # 0xe00c Korean (Wansung) KS C-5601-1987 cp_949 949 # 0x03b5 Korean MS Windows Code Page 949 # gb 57357 # 0xe00d GB2312-80 57357 # 0xe00d cp936 57357 # 0xe00d Simplified Chinese Microsoft Windows # GB18030-2000 5488 # 0x1570 Simplified Chinese GB18030-2000 # EBCDIC 37 # 0x0025 EBCDIC-US 37 # 0x0025 # # Testing code sets # chngcase 57358 # 0xe00e bangA 57359 # 0xe00f shift1 57360 # 0xe010 rot13 57361 # 0xe011 swap0to9 57362 # 0xe012 pcwin 57363 # 0xe013 qacvtest 57365 # 0xe015 swap09-nl 57366 # 0xe016 qa-unix-client 57367 # 0xe017 qa-unix-server 57368 # 0xe018 qa-win-client 57369 # 0xe019 # # Code set for ALS # Guobiao2312 57357 # 0xe00d # # MIK 57370 # 0xe01a (commonly used in Bulgaria) # utf8 57372 # 0xe01c # thai 57373 # 0xe01d (Thai 620-2533 (1990)) thai620 57373 # 0xe01d (Thai 620-2533 (1990)) # sopstw 57374 # 0xe01e (SOPS code set used in Taiwan) euctw 57375 # 0xe01f (EUC code set for Taiwan) cns 57375 # 0xe01f (EUC CNS code set for Taiwan) cns-extended 57378 # 0xe022 (Extended EUC CNS code set for Taiwan) eucxtw 57378 # 0xe022 (Extended EUC CNS code set for Taiwan) # tst 57376 # 0xe020 (For compatiblity, use qa-* instead) qa-win-950 57377 # 0xe021 (qa codeset using real MS 950 code page) qa-win-932 57379 # 0xe023 (qa codeset using real MS 932 code page) # KOI-8 57382 # 0xe026 (Russian code page) CSK 57383 # 0xe027 (Polish code page) mazovia 57384 # 0xe028 (Polish code page) maz 57384 # 0xe028 (Polish code page) # Roman8 57385 # 0xe029 (HP Roman 8) qa-win-unicode 57386 # 0xe02a (qa codeset using real Unicode code page) qa-unix-big5 57387 # 0xe02b (qa codeset for code page big5) urst2018-91 57388 # 0xe02c (Ukrainian Repulican STandard 2018-91) # # END OF REGISTRY