diff options
author | pgen <p.gen.progs@gmail.com> | 2022-02-01 00:12:59 +0100 |
---|---|---|
committer | pgen <p.gen.progs@gmail.com> | 2022-02-01 00:12:59 +0100 |
commit | 947189af997e35824f7e00961fe24116ba6059b9 (patch) | |
tree | 6b2ef76cce1163b2eedcd5898e045a37d15faec8 /smenu.1 | |
parent | a67de73f4557335f783b238c1924e9d4deb36176 (diff) |
Add a note on canonical UTF-8 forms in the manual
Diffstat (limited to 'smenu.1')
-rw-r--r-- | smenu.1 | 10 |
1 files changed, 10 insertions, 0 deletions
@@ -473,6 +473,16 @@ This utility is Unicode aware and should be able to display correctly any Unicode character (even double-width ones) as long as the current encoding is \fBUTF-8\fP (\fBUTF-8\fP in the output of the \fIlocale\fP command). + +Note that smenu will not attempt to normalize words containing UTF-8 glyphs. +Thus \fI\\u61\\ucc88\fP (\fIä\fP) will not be considered equal to +\fI\\uc3a4\fP (canonical normalization of \fIä\fP). +It is nevertheless possible to use an external tool such as uconv from the +ICU project (https://icu.unicode.org) to do this work before using smenu. + +For example: uconv can be used as a filter as in: + +\f(CBcat ... | uconv -x any-nfc | smenu\fP .SS Configuration If a file with adequate permissions and the same name as the executable but prefixed with a dot is present in the current directory |