[PATCH 3/3] lib: Reject multi-message mboxes and deprecate single-message mbox

Tomi Ollila tomi.ollila at iki.fi
Sun Nov 25 11:57:27 PST 2012


On Sun, Nov 25 2012, Austin Clements <amdragon at MIT.EDU> wrote:

> Quoth Tomi Ollila on Nov 25 at  3:26 pm:
>> On Sun, Nov 25 2012, Austin Clements <amdragon at MIT.EDU> wrote:
>> 
>> > Previously, we would treat multi-message mboxes as one giant email,
>> > which, besides the obvious incorrect indexing, often led to
>> > out-of-memory errors for archival mboxes.  Now we explicitly reject
>> > multi-message mboxes.  For historical reasons, we retain support for
>> > single-message mboxes, but official deprecate this behavior.
>> 
>> 
>> The series looks good to me -- but I don't know about deprecating
>> single-message mboxes:
>> 
>> * If we someday support (read-only?) mbox format, then single-message
>>   mboxes are "normal" again.
>
> If notmuch does gain mbox support, then its handling of single-message
> mboxes will *definitely* change because it will stop doing
> maildir-like things to them (flag sync, moving from new to cur, etc),
> which people may currently be depending on.  This was one of the
> motivations for deprecating the current handling of single-message
> mboxes.
>
>> * Some naïve mb2md scripts could leave the 'From ' -line intact: for
>>   example `formail -bz -s head -3 < $MAIL`(*) can be used to demonstrate this
>
> I would call that "buggy", rather than "naïve".  ]:--8)
>
>> * Some people may have large collection of single-file messages starting
>>   with 'From ' currently indexed. If those are to be re-indexed later
>>   without "single-message mbox" support that is somewhat of a burden to
>>   the users (**)
>
> That's why this only deprecates them (with a warning) and doesn't drop
> support for them.  The idea is to keep the historical handling for a
> few releases and then we'll have the flexibility to do what we want
> with single-message mboxes (including supporting them as real mbox).
>
> It's probably a good idea to include a script or a wiki pointer for
> fixing single-message mboxes in the NEWS.  As long as the file name is
> kept the same, notmuch won't reindex it.

Ok, I'm convinced. +1

Tomi

>
>> (*) my "mb2md" wannabe does gnus-like "$formail" -bz -R 'From ' X-From-Line: ...
>> 
>> (**) Something like the following could be used to mangle "single-file mboxes"...
>>      find . -type f | xargs perl -e 'foreach (@ARGV) { open IO, "+<", $_ or
>>      next; sysread IO, $buf, 5; if ($buf eq "From ") { sysseek IO, 0, 0;
>>      syswrite IO, "Fro:"; }}' 
>>      This breaks the multi-message mbox nicely... >;)
>> 
>> 
>> Tomi
> _______________________________________________
> notmuch mailing list
> notmuch at notmuchmail.org
> http://notmuchmail.org/mailman/listinfo/notmuch


More information about the notmuch mailing list