accented characters

David Bremner david at tethera.net
Mon Nov 13 09:47:01 PST 2017


Stefano Zacchiroli <zack at debian.org> writes:

>
> Unicode has a notion of canonical form that rearrange accented
> characters in a sequence of non-accented characters + modifiers
> https://en.wikipedia.org/wiki/Unicode_equivalence . A bunch of libraries
> use that stuff to normalize-away accents in unicode strings. I'm aware
> of a few in Python for instance, but not in C++ (which I believe is what
> you'd be interested in).
>

Apropos, Rob Browning started looking at canonicalization using glib

in

        id:1440951676-17286-1-git-send-email-rlb at defaultvalue.org
        http://article.gmane.org/gmane.mail.notmuch.general/21004


More information about the notmuch mailing list