Skip to content

Commit 9070100

Browse files
authored
Add back character sets that had characters outside 16 bit plane (#1964)
* Add back character sets that had characters outside 16 bit plane * Update XCCS-353=SYMBOLS3.TXT Update title line * Update UNICODE.TEDIT * Fix charset names * Reorganized the tables, added requested interfaces * Use a single hash * Top-level array branch beats a single hash * cleanup UNICODE.TRANSLATE macro * Fix slug in outcharfn * Remove a stray line * Another try, would work for raw * Remove duplicates, redo hashing * Getting complete maps in both directions * Initializing * Only the latest file versions * Add back gothic mappings
1 parent db98ea3 commit 9070100

File tree

9 files changed

+1943
-1626
lines changed

9 files changed

+1943
-1626
lines changed

library/UNICODE

Lines changed: 452 additions & 492 deletions
Large diffs are not rendered by default.

library/UNICODE.LCOM

-1.73 KB
Binary file not shown.

library/UNICODE.TEDIT

451 Bytes
Binary file not shown.

unicode/xerox/INVERTED-UNICODE-MAPPINGS.TXT

Lines changed: 1129 additions & 966 deletions
Large diffs are not rendered by default.

unicode/xerox/UNICODE-MAPPINGS.TXT

Lines changed: 327 additions & 133 deletions
Large diffs are not rendered by default.

unicode/xerox/XCCS-353=SYMBOLS3.TXT

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@
4242
# Any comments or problems, contact <ron.kaplan@post.harvard.edu>
4343

4444

45-
# "353" UNKNOWN
45+
# "353" SYMBOLS3
4646
0xEB21 0x2119 # ℙ DOUBLE-STRUCK CAPITAL P
4747
0xEB22 0x210B # ℋ SCRIPT CAPITAL H
4848
0xEB23 0x2110 # ℐ SCRIPT CAPITAL I
@@ -53,7 +53,7 @@
5353
0xEB28 0x203D # ‽ INTERROBANG
5454
0xEB29 0x2318 # ⌘ PLACE OF INTEREST SIGN
5555
0xEB2B 0x210C # ℌ BLACK-LETTER CAPITAL H
56-
0xEB2D 0x1D53D # 𝔽 MATHEMATICAL DOUBLE-STRUCK CAPITAL F
56+
0xEB2D 0x1D53D # MATHEMATICAL DOUBLE-STRUCK CAPITAL F
5757
0xEB2E 0x21C5 # ⇅ UPWARDS ARROW LEFTWARDS OF DOWNWARDS ARROW
5858
0xEB2F 0x21F5 # ⇵ DOWNWARDS ARROW LEFTWARDS OF UPWARDS ARROW
5959
0xEB30 0x21E2 # ⇢ RIGHTWARDS DASHED ARROW

unicode/xerox/XCCS-51=RUNIC-GOTHIC.TXT

Lines changed: 29 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@
4242
# Any comments or problems, contact <ron.kaplan@post.harvard.edu>
4343

4444

45-
# "51" UNKNOWN
45+
# "51" RUNIC-GOTHIC
4646
0x2922 0x16A0 # ᚠ RUNIC LETTER FEHU FEOH FE F
4747
0x2924 0x16A2 # ᚢ RUNIC LETTER URUZ UR U
4848
0x2927 0x16A6 # ᚦ RUNIC LETTER THURISAZ THURS THORN
@@ -87,31 +87,31 @@
8787
0x29B5 0x16A3 # ᚣ RUNIC LETTER YR
8888
0x29B6 0x16E0 # ᛠ RUNIC LETTER EAR
8989
0x29B8 0x16E1 # ᛡ RUNIC LETTER IOR
90-
0x29E1 0x10330 # 𐌰 GOTHIC LETTER AHSA
91-
0x29E2 0x10331 # 𐌱 GOTHIC LETTER BAIRKAN
92-
0x29E3 0x10332 # 𐌲 GOTHIC LETTER GIBA
93-
0x29E4 0x10333 # 𐌳 GOTHIC LETTER DAGS
94-
0x29E5 0x10334 # 𐌴 GOTHIC LETTER AIHVUS
95-
0x29E6 0x10335 # 𐌵 GOTHIC LETTER QAIRTHRA
96-
0x29E7 0x10336 # 𐌶 GOTHIC LETTER IUJA
97-
0x29E8 0x10337 # 𐌷 GOTHIC LETTER HAGL
98-
0x29E9 0x10338 # 𐌸 GOTHIC LETTER THIUTH
99-
0x29EA 0x10339 0x0308 # 𐌹̈ GOTHIC LETTER EIS; COMBINING DIAERESIS
100-
0x29EB 0x10339 # 𐌹 GOTHIC LETTER EIS
101-
0x29EC 0x1033A # 𐌺 GOTHIC LETTER KUSMA
102-
0x29ED 0x1033B # 𐌻 GOTHIC LETTER LAGUS
103-
0x29EE 0x1033C # 𐌼 GOTHIC LETTER MANNA
104-
0x29EF 0x1033D # 𐌽 GOTHIC LETTER NAUTHS
105-
0x29F0 0x1033E # 𐌾 GOTHIC LETTER JER
106-
0x29F1 0x1033F # 𐌿 GOTHIC LETTER URUS
107-
0x29F2 0x10340 # 𐍀 GOTHIC LETTER PAIRTHRA
108-
0x29F3 0x10341 # 𐍁 GOTHIC LETTER NINETY
109-
0x29F4 0x10342 # 𐍂 GOTHIC LETTER RAIDA
110-
0x29F5 0x10343 # 𐍃 GOTHIC LETTER SAUIL
111-
0x29F6 0x10344 # 𐍄 GOTHIC LETTER TEIWS
112-
0x29F7 0x10345 # 𐍅 GOTHIC LETTER WINJA
113-
0x29F8 0x10346 # 𐍆 GOTHIC LETTER FAIHU
114-
0x29F9 0x10347 # 𐍇 GOTHIC LETTER IGGWS
115-
0x29FA 0x10348 # 𐍈 GOTHIC LETTER HWAIR
116-
0x29FB 0x10349 # 𐍉 GOTHIC LETTER OTHAL
117-
0x29FC 0x1034A # 𐍊 GOTHIC LETTER NINE HUNDRED
90+
0x29E1 0x10330 # GOTHIC LETTER AHSA
91+
0x29E2 0x10331 # GOTHIC LETTER BAIRKAN
92+
0x29E3 0x10332 # GOTHIC LETTER GIBA
93+
0x29E4 0x10333 # GOTHIC LETTER DAGS
94+
0x29E5 0x10334 # GOTHIC LETTER AIHVUS
95+
0x29E6 0x10335 # GOTHIC LETTER QAIRTHRA
96+
0x29E7 0x10336 # GOTHIC LETTER IUJA
97+
0x29E8 0x10337 # GOTHIC LETTER HAGL
98+
0x29E9 0x10338 # GOTHIC LETTER THIUTH
99+
0x29EA 0x10339 0x0308 # GOTHIC LETTER EIS; COMBINING DIAERESIS
100+
0x29EB 0x10339 # GOTHIC LETTER EIS
101+
0x29EC 0x1033A # GOTHIC LETTER KUSMA
102+
0x29ED 0x1033B # GOTHIC LETTER LAGUS
103+
0x29EE 0x1033C # GOTHIC LETTER MANNA
104+
0x29EF 0x1033D # GOTHIC LETTER NAUTHS
105+
0x29F0 0x1033E # GOTHIC LETTER JER
106+
0x29F1 0x1033F # GOTHIC LETTER URUS
107+
0x29F2 0x10340 # GOTHIC LETTER PAIRTHRA
108+
0x29F3 0x10341 # GOTHIC LETTER NINETY
109+
0x29F4 0x10342 # GOTHIC LETTER RAIDA
110+
0x29F5 0x10343 # GOTHIC LETTER SAUIL
111+
0x29F6 0x10344 # GOTHIC LETTER TEIWS
112+
0x29F7 0x10345 # GOTHIC LETTER WINJA
113+
0x29F8 0x10346 # GOTHIC LETTER FAIHU
114+
0x29F9 0x10347 # GOTHIC LETTER IGGWS
115+
0x29FA 0x10348 # GOTHIC LETTER HWAIR
116+
0x29FB 0x10349 # GOTHIC LETTER OTHAL
117+
0x29FC 0x1034A # GOTHIC LETTER NINE HUNDRED

unicode/xerox/XCCS-56=UNKNOWN1.TXT renamed to unicode/xerox/XCCS-56=DECORATED-RULES.TXT

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
#
22
# Name: XCCS (XC-3-1-1-0) to Unicode
33
# Unicode version: 3.0
4-
# XCCS charset: 56 UNKNOWN
4+
# XCCS charset: 56 DECORATED-RULES
55
# Table version: 0.1
66
# Table format: Format A
77
# Date: 9-Aug-2021
@@ -42,7 +42,7 @@
4242
# Any comments or problems, contact <ron.kaplan@post.harvard.edu>
4343

4444

45-
# "56" UNKNOWN
45+
# "56" DECORATED-RULES
4646
0x2E21 0x2500 # ─ BOX DRAWINGS LIGHT HORIZONTAL
4747
0x2E22 0x23AF # ⎯ HORIZONTAL LINE EXTENSION
4848
0x2E23 0x2501 # ━ BOX DRAWINGS HEAVY HORIZONTAL

unicode/xerox/XCCS-57=UNKNOWN2.TXT renamed to unicode/xerox/XCCS-57=VERTICAL-JAPANESE.TXT

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
#
22
# Name: XCCS (XC-3-1-1-0) to Unicode
33
# Unicode version: 3.0
4-
# XCCS charset: 57 UNKNOWN
4+
# XCCS charset: 57 VERTICAL-JAPANESE
55
# Table version: 0.1
66
# Table format: Format A
77
# Date: 9-Aug-2021
@@ -42,7 +42,7 @@
4242
# Any comments or problems, contact <ron.kaplan@post.harvard.edu>
4343

4444

45-
# "57" UNKNOWN
45+
# "57" VERTICAL-JAPANESE
4646
0x2F24 0xFE33 # ︳ PRESENTATION FORM FOR VERTICAL LOW LINE
4747
0x2F26 0xFE31 # ︱ PRESENTATION FORM FOR VERTICAL EM DASH
4848
0x2F2B 0x22EE # ⋮ VERTICAL ELLIPSIS

0 commit comments

Comments
 (0)