2 # $NetBSD: UCS%CNS11643-1.src,v 1.4 2007/04/01 18:52:29 tnozaki Exp $
6 SRC_ZONE 0x00A2 - 0xFFE5
13 # Name: CNS 11643-1986 to Unicode table (complete)
14 # Unicode version: 1.1
15 # Table version: 0.0d1
16 # Table format: Format A
17 # Date: 21 October 1994
19 # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
21 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
22 # No claims are made as to fitness for any particular purpose. No
23 # warranties of any kind are expressed or implied. The recipient
24 # agrees to determine applicability of information provided. If this
25 # file has been provided on magnetic media by Unicode, Inc., the sole
26 # remedy for any claim will be exchange of defective media within 90
29 # Recipient is granted the right to make copies in any form for
30 # internal distribution and to freely use the information supplied
31 # in the creation of products supporting Unicode. Unicode, Inc.
32 # specifically excludes the right to re-distribute this file directly
33 # to third parties or other organizations whether for profit or not.
38 # This table contains one set of mappings from CNS 11643-1986 into Unicode.
39 # Note that these data are *possible* mappings only and may not be the
40 # same as those used by actual products, nor may they be the best suited
41 # for all uses. For more information on the mappings between various code
42 # pages incorporating the repertoire of CNS 11643-1986 and Unicode, consult the
43 # VENDORS mapping data. Normative information on the mapping between
44 # CNS 11643-1986 and Unicode may be found in the Unihan.txt file in the
45 # latest Unicode Character Database.
47 # If you have carefully considered the fact that the mappings in
48 # this table are only one possible set of mappings between CNS 11643-1986 and
49 # Unicode and have no normative status, but still feel that you
50 # have located an error in the table that requires fixing, you may
51 # report any such error to errata@unicode.org.
54 # WARNING! It is currently impossible to provide round-trip compatibility
55 # between CNS 11643-1986 and Unicode.
57 # (1) Some characters are not currently mapped because
58 # of conflicts with other mappings. They include the following:
60 # CNS Description Comments
62 # 0x12224 SPACING HEAVY OVERSCORE not in Unicode
63 # 0x12226 SPACING HEAVY UNDERSCORE not in Unicode
65 # (2) Some characters are not currently mapped because CNS 11643-1986 includes
66 # duplicate versions of some ideographs which are included in Unicode only
67 # once. They include the following:
69 # 0x1243E through 0x12440, HANGZHOU NUMERAL TEN, TWENTY, and THIRTY
70 # 0x12721 through 0x12939, KangXi radical set
72 # Entries for these characters are not included in this table.
76 # 1. There is an uncertainty in the mapping of the CNS 11643-1986 character
77 # 0x1256D. This character occurs within the CNS 11643-1986 block of tone marks
78 # for bopomofo and is intended to be the tone mark for the first tone in
79 # Mandarin Chinese. We have selected the mapping U+02C9 MODIFIER LETTER
80 # MACRON (Mandarin Chinese first tone) to reflect this semantic.
81 # However, because bopomofo uses the absense of a tone mark to indicate
82 # the first Mandarin tone, most implementations of CNS 11643-1986
83 # represent this character with a blank space, and so a mapping such as
84 # U+2003 EM SPACE might be preferred.
86 # Format: Three tab-separated columns
87 # Column #1 is the CNS 11643-1986 code (in hex as 0xXXXXX)
88 # Column #2 is the Unicode (in hex as 0xXXXX)
89 # Column #3 is the Unicode name (follows a comment sign, '#')
90 # The official names for Unicode characters U+4E00
91 # to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
92 # where XXXX is the code point. Including all these
93 # names in this file increases its size substantially
94 # and needlessly. The token "<CJK>" is used for the
95 # name of these characters. If necessary, it can be
96 # expanded algorithmically by a parser or editor.
98 # The entries are in CNS 11643-1986 order
103 # Name: Unihan database
104 # Unicode version: 4.0.1
106 # Date: 31 October 2003
108 # Copyright (c) 1996-2003 Unicode, Inc. All Rights reserved.
114 # The Unicode Character Database (including the Unihan database) is provided
115 # as is by Unicode, Inc. No claims are made as to fitness for any particular
116 # purpose. No warranties of any kind are expressed or implied. The recipient agrees
117 # to determine applicability of information provided. If this file has been
118 # purchased on magnetic or optical media from Unicode, Inc., the sole remedy for
119 # any claim will be exchange of defective media within 90 days of receipt.
121 # This disclaimer is applicable for all other data files accompanying the Unicode
122 # Character Database, some of which have been compiled by the Unicode Consortium,
123 # and some of which have been supplied by other sources.
125 # Limitations on Rights to Redistribute This Data
127 # Recipient is granted the right to make copies in any form for internal
128 # distribution and to freely use the information supplied in the creation
129 # of products supporting the UnicodeTM Standard. The files in the Unicode
130 # Character Database can be redistributed to third parties or other
131 # organizations (whether for profit or not) as long as this notice and
132 # the disclaimer notice are retained. Information can be extracted from
133 # these files and used in documentation or programs, as long as there is
134 # an accompanying notice indicating the source.
137 0x00A2 = 0x2266 # fallback -> 0xFFE0
138 0x00A3 = 0x2267 # fallback -> 0xFFE1
139 0x00A5 = 0x2264 # fallback -> 0xFFE5
202 0x2016 = 0x225D # fallback -> 0x2225
207 0x2022 = 0x2126 # fallback -> 0x2027
260 0x223C = 0x2244 # fallback -> 0xFF5E
380 0x2609 = 0x2254 # fallback -> 0x2299
382 0x2641 = 0x2253 # fallback -> 0x2295
384 0x2F00 - 0x2F5D = 0x2721 -
385 0x2F5F - 0x2FBC = 0x2821 -
386 0x2FBD - 0x2FD5 = 0x2921 -
419 0x3041 - 0x306D = 0x2952 -
420 0x306E - 0x3094 = 0x2A21 -
421 0x3099 - 0x309E = 0x2A48 -
422 0x30A1 - 0x30D1 = 0x2A4E -
423 0x30D2 - 0x30FE = 0x2B21 -
424 0x30FB = 0x2126 # fallback -> 0x2027
462 0x31A0 - 0x31B7 = 0x293A -
943 0x5344 = 0x243F # fallback -> 0x3039
5888 0xFE32 = 0x2138 # fallback -> 0xFE31
6027 0xFF64 = 0x212F # fallback -> 0xFE51