diff options
author | glepnir <glephunter@gmail.com> | 2024-02-12 22:14:53 +0100 |
---|---|---|
committer | Christian Brabandt <cb@256bit.org> | 2024-02-12 22:39:40 +0100 |
commit | bd1232a1faf56b614a1e74c4ce51bc6e0650ae00 (patch) | |
tree | e4037f4cfb6ccddad1a599be613fd456e33b094f /src/mbyte.c | |
parent | f2d90a351159fd6843f450850f52004f42e00183 (diff) |
patch 9.1.0101: upper-case of German sharp s should be U+1E9Ev9.1.0101
Problem: upper-case of ß should be U+1E9E (CAPITAL LETTER SHARP S)
(fenuks)
Solution: Make gU, ~ and g~ convert the U+00DF LATIN SMALL LETTER SHARP S (ß)
to U+1E9E LATIN CAPITAL LETTER SHARP S (ẞ), update tests
(glepnir)
This is part of Unicode 5.1.0 from April 2008, so should be fairly safe
to use now and since 2017 is part of the German standard orthography,
according to Wikipedia:
https://en.wikipedia.org/wiki/Capital_%E1%BA%9E#cite_note-auto-12
There is however one exception: UnicodeData.txt for U+00DF
LATIN SMALL LETTER SHARP S does NOT define U+1E9E LATIN CAPITAL LETTER
SHARP S as its upper case version. Therefore, toupper() won't be able
to convert from lower sharp s to upper case sharp s (the other way
around however works, since U+00DF is considered the lower case
character of U+1E9E and therefore tolower() works correctly for the
upper case version).
fixes: #5573
closes: #14018
Signed-off-by: glepnir <glephunter@gmail.com>
Signed-off-by: Christian Brabandt <cb@256bit.org>
Diffstat (limited to 'src/mbyte.c')
-rw-r--r-- | src/mbyte.c | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/src/mbyte.c b/src/mbyte.c index 2d18a2796a..0427f0ce3c 100644 --- a/src/mbyte.c +++ b/src/mbyte.c @@ -3454,6 +3454,8 @@ static convertStruct toLower[] = {0x1e900,0x1e921,1,34} }; +// Note: UnicodeData.txt does not define U+1E9E as being the corresponding upper +// case letter for U+00DF (ß), however it is part of the toLower table static convertStruct toUpper[] = { {0x61,0x7a,1,-32}, |