Bug: fatal error with notmuch new, second run starts indexing all over again
Gregor Zattler
telegraph at gmx.net
Thu Jul 13 00:04:44 PDT 2017
Hi David, notmuch developers,
* David Bremner <david at tethera.net> [09. Jul. 2017]:
> Gregor Zattler <telegraph at gmx.net> writes:
>> Short version: Some of the messages on this mailinglist had very
>> weired References: headers mot probably causing notmuch to
>> misbehave while threading the messages. But then there was no
>> xapian exeption involved.
>>
>
> Right. I looks like there are indeed many message-ids in the resulting
> database that look like valid email addresses. So that problem persists,
> despite an attempted fix discussed in that thread.
>
> I was wondering if the exception could result from overflowing some
> internal limit due to threads many thousands of messages long. I will be
> hard to know until I can replicate the exception. Perhaps a more
> complete org-mode list archive would do it.
Quite possible, when I do
notmuch show --entire-thread=true --format=mbox path:Mail/~ml/emacs-orgmode at gnu.org/** date 64 bit 32 > /tmp/emo.mbox
and open the resulting mbox with mutt, it shows 10199 messages.
There is fun to have regarding message ids:
0 grfz at len:/tmp$ grep -c "^Message-ID:[[:space:]]" eom.mbox
8758
0 grfz at len:/tmp$ grep -ci "^Message-ID:[[:space:]]" eom.mbox
10174
I then split the mbox with mutt in a maildir folder. This were
10199 individual files. I then extracted the message ids via
formail from this files and piped this through sort -u. These
were 10105 message ids.
The maildir with the emacs-orgmode at gnu.org emails contained 114563
individual files. I removed every file somewhere containing one
of those message ids. Now there were only 102164 individual files
in the maildir.
And after that I indexed this cleaned up maildir. If somehow
this message threading problem affects the indexing it should now
index the files without xapian exeption:
This is with newest notmuch:
0 grfz at len:/tmp$ /home/grfz/src/notmuch/notmuch --version
notmuch 0.24.2+112~g37d1fa5
0 grfz at len:/tmp$ gdb --args /home/grfz/src/notmuch/notmuch new
GNU gdb (Debian 7.12-6) 7.12.0.20161007-git
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from /home/grfz/src/notmuch/notmuch...done.
(gdb) b _notmuch_database_log
Breakpoint 1 at 0x1f6e0: file lib/database.cc, line 426.
(gdb) run
Starting program: /home/grfz/src/notmuch/notmuch new
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Found 102164 total files (that's not much mail).
Processed 102164 total files in 5m 58s (284 files/sec.).
Added 102151 new messages to the database.
[Inferior 1 (process 26339) exited normally]
(gdb)
While indexing the original emacs-org mode mailing list maildir
with it's 114563 files with the very same binary results in a
xapian exeption:
0 grfz at len:/tmp$ gdb --args /home/grfz/src/notmuch/notmuch new
GNU gdb (Debian 7.12-6) 7.12.0.20161007-git
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from /home/grfz/src/notmuch/notmuch...done.
(gdb) b _notmuch_database_log
Breakpoint 1 at 0x1f6e0: file lib/database.cc, line 426.
(gdb) run
Starting program: /home/grfz/src/notmuch/notmuch new
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Found 114563 total files (that's not much mail).
Breakpoint 1, _notmuch_database_log (notmuch=0x5555557d2780, format=0x55555558d258 "A Xapian exception occurred adding message: %s.\n") at lib/database.cc:426
426 {
(gdb) bt
#0 _notmuch_database_log (notmuch=0x5555557d2780, format=0x55555558d258 "A Xapian exception occurred adding message: %s.\n") at lib/database.cc:426
#1 0x0000555555577f58 in notmuch_database_add_message (notmuch=notmuch at entry=0x5555557d2780, filename=filename at entry=0x555557ae2070 "/home/grfz/Mail/~ml/emacs-orgmode at gnu.org/cur/1488666090.R8081408151026683680.len:2,",
message_ret=message_ret at entry=0x7fffffffd2c8) at lib/database.cc:2597
#2 0x000055555556802f in add_file (state=0x7fffffffd540, filename=0x555557ae2070 "/home/grfz/Mail/~ml/emacs-orgmode at gnu.org/cur/1488666090.R8081408151026683680.len:2,", notmuch=0x5555557d2780) at notmuch-new.c:264
#3 add_files (notmuch=notmuch at entry=0x5555557d2780, path=path at entry=0x5555557d25f0 "/home/grfz/Mail/~ml/emacs-orgmode at gnu.org/cur", state=state at entry=0x7fffffffd540) at notmuch-new.c:599
#4 0x0000555555567b44 in add_files (notmuch=0x5555557d2780, path=path at entry=0x5555557d4f90 "/home/grfz/Mail/~ml/emacs-orgmode at gnu.org", state=state at entry=0x7fffffffd540) at notmuch-new.c:483
#5 0x00005555555689ed in notmuch_new_command (config=0x5555557ce1d0, argc=<optimized out>, argv=<optimized out>) at notmuch-new.c:1099
#6 0x0000555555561a27 in main (argc=<optimized out>, argv=0x7fffffffda68) at notmuch.c:456
(gdb) continue
Continuing.
Error: A Xapian exception occurred. Halting processing.
Processed 69364 total files in 25m 59s (44 files/sec.).
Added 69350 new messages to the database.
Note: A fatal error was encountered: A Xapian exception occurred
[Inferior 1 (process 27880) exited with code 01]
(gdb) continue
The program is not being run.
(gdb)
The referenced file is the second one I attached in this thread.
So your/Davids intuition was right. The problem has to do with
this threading problem.
I also did extensive memtest and every kind of smartctl test
possible, to be sure this is no hardware problem.
I then downloaded parts of the archive, merged and sliced it and
now I have a sample of 25001 total files (that's not much mail),
on which notmuch new produces a xapian exeption:
0 (master *) grfz at len:~$ gdb --args /home/grfz/src/notmuch/notmuch new
GNU gdb (Debian 7.12-6) 7.12.0.20161007-git
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from /home/grfz/src/notmuch/notmuch...done.
(gdb) b _notmuch_database_log
Breakpoint 1 at 0x1f6e0: file lib/database.cc, line 426.
(gdb) run
Starting program: /home/grfz/src/notmuch/notmuch new
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Found 25001 total files (that's not much mail).
Breakpoint 1, _notmuch_database_log (notmuch=0x5555557d27f0, format=0x55555558d258 "A Xapian exception occurred adding message: %s.\n") at lib/database.cc:426
426 {
(gdb) bt
#0 _notmuch_database_log (notmuch=0x5555557d27f0, format=0x55555558d258 "A Xapian exception occurred adding message: %s.\n") at lib/database.cc:426
#1 0x0000555555577f58 in notmuch_database_add_message (notmuch=notmuch at entry=0x5555557d27f0, filename=filename at entry=0x555558dc96c0 "/tmp/reduced-sample/cur/1499897912.R1843131171398763530.len:2,",
message_ret=message_ret at entry=0x7fffffffd2e8) at lib/database.cc:2597
#2 0x000055555556802f in add_file (state=0x7fffffffd560, filename=0x555558dc96c0 "/tmp/reduced-sample/cur/1499897912.R1843131171398763530.len:2,", notmuch=0x5555557d27f0) at notmuch-new.c:264
#3 add_files (notmuch=notmuch at entry=0x5555557d27f0, path=path at entry=0x5555557e7950 "/tmp/reduced-sample/cur", state=state at entry=0x7fffffffd560) at notmuch-new.c:599
#4 0x0000555555567b44 in add_files (notmuch=0x5555557d27f0, path=path at entry=0x5555557cf710 "/tmp/reduced-sample", state=state at entry=0x7fffffffd560) at notmuch-new.c:483
#5 0x00005555555689ed in notmuch_new_command (config=0x5555557ce1d0, argc=<optimized out>, argv=<optimized out>) at notmuch-new.c:1099
#6 0x0000555555561a27 in main (argc=<optimized out>, argv=0x7fffffffda88) at notmuch.c:456
(gdb)
As a tar.xz it weighs 28 MB. I could provide this for download
if someone is interested.
Ciao, Gregor
More information about the notmuch
mailing list