regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

Tomi Ollila tomi.ollila at iki.fi
Thu Aug 22 12:53:34 PDT 2019


On Thu, Aug 22 2019, David Bremner wrote:

> "yury.t" <tptlab at tuta.io> writes:
>
>> Thank you for your reply.
>> I confirmed that the issue is reproduced in C program. https://pastebin.com/5NaCM45G <https://pastebin.com/5NaCM45G>
>>
>> Sorry for bothering you...
>
> I'm not sure, but it might be a glibc bug. Since we are already using
> glib, maybe we should use
>
>       https://developer.gnome.org/glib/stable/glib-Perl-compatible-regular-expressions.html
>
> I don't know if it also has this problem with [] and non-ascii
> characters.

Since pcre2 supports \K that gives positive vibe about the above

( 'Resetting the match start' 
  in http://www.pcre.org/current/doc/html/pcre2pattern.html )

Tomi 

>
> d


More information about the notmuch mailing list