utf-8 in author field

Michal Sojka sojkam1 at fel.cvut.cz
Sat Oct 30 05:33:17 PDT 2010


On Sat, 30 Oct 2010, Carl Worth wrote:
> On Mon, 17 May 2010 09:56:27 +0200, Michal Sojka <sojkam1 at fel.cvut.cz> wrote:
> > On Fri, 14 May 2010, Igor Shenderovich wrote:
> > > What should one do to see the true list of authors?
> > 
> > I encounter the same when headers are not encoded properly according to
> > RFC 2047. I commonly see the violation of section 5, paragraph (3),
> > sentence "An 'encoded-word' MUST NOT appear within a 'quoted-string'".
> > That is when the encoded word is enclosed in double quotes. I guess, the
> > "problem" is not only notmuch related, but all users of gmime library
> > must be affected.
> [...]
> My guess is that the best we could do is to come up with some heuristics
> for recognizing a non-RFC-compliant header here and munging it. And the
> heuristics could then fail with messages that were RFC-compliant and
> intentionally including a string of characters that would match the
> heuristic, (which would presumably be rare, but not impossible---so
> perhaps we would then need some configuration).

I think that other, more mature, users of gmime library (Evolution?)
must use such heuristics. We may look for insporation there.

> Anyway, if one of you could send an example of a misbehaving message, I
> might like to look at it and perhaps add it to the test suite to see if
> there's anything we can safely do about it.

I attach one such message.

-Michal

-------------- next part --------------
An embedded message was scrubbed...
From: "=?utf-8?q?Jes=C3=BAs_M=2E?= Navarro" <jesus.navarro at undominio.net>
Subject: Re: debian can be better
Date: Wed, 27 Oct 2010 16:01:06 +0200
Size: 4231
URL: <http://notmuchmail.org/pipermail/notmuch/attachments/20101030/4f65b090/attachment.eml>


More information about the notmuch mailing list