]> CyberLeo.Net >> Repos - FreeBSD/FreeBSD.git/commit
MFC r340491, r340492:
authorYuri Pankov <yuripv@FreeBSD.org>
Thu, 6 Dec 2018 10:53:11 +0000 (10:53 +0000)
committerYuri Pankov <yuripv@FreeBSD.org>
Thu, 6 Dec 2018 10:53:11 +0000 (10:53 +0000)
commitbcf900aae0cb1cb25f99e8691568f9ba85f5407c
tree2d3eacb35ec567b73002535d73d8010edc077d6f
parent88ae7be36bae9963e5c52e0b57089a1072eeceb0
MFC r340491, r340492:
Use UnicodeData.txt to create UTF-8 ctype map.

This should provide more complete coverage of currently defined Unicode
characters as compared to manually assembled one we use currently.

Comparison of original and new UTF-8 ctype maps by character class:

TYPE    ORIG    NEW
alnum   94229   126029
alpha   93557   125419
blank   4       2
cntrl   73      137685
digit   469     622
graph   109615  137203
lower   1478    2145
print   109641  137222
punct   3428    797
rune    110481  274907
space   33      24
upper   983     1781
xdigit  469     622

Large number of added cntrl definitions is due to the fact that private-use
planes are currently defined as such, this can change in the future.

Discussed with: bapt
Differential revision: https://reviews.freebsd.org/D17842
contrib/netbsd-tests/lib/libc/locale/t_mbstowcs.c
share/ctypedef/C.UTF-8.src
tools/tools/locale/Makefile
tools/tools/locale/etc/common.UTF-8.src [deleted file]
tools/tools/locale/etc/manual-input.UTF-8 [deleted file]
tools/tools/locale/tools/utf8-rollup.pl