[PATCH 1/7] test: add known broken test for indexing html

David Bremner david at tethera.net
Thu Apr 20 03:05:51 PDT 2017


David Bremner <david at tethera.net> writes:

> 'quite' on IRC reported that notmuch new was grinding to a halt during
> initial indexing, and we eventually narrowed the problem down to some
> html parts with large embedded images. These cause the number of terms
> added to the Xapian database to explode (the first 400 messages
> generated 4.6M unique terms), and of course the resulting terms are
> not much use for searching.
>
> The second test is sanity check for any "improved" indexing of HTML.

pushed the first patch in the series to master.

d


More information about the notmuch mailing list