How to index arbitrary headers?
Dmitrijs Ledkovs
xnox at debian.org
Thu Oct 4 01:17:59 PDT 2012
On 3 October 2012 19:32, Petri Savolainen <petri at koodaamo.fi> wrote:
> Hi,
>
> thanks for your response. I am evaluating notmuch / xapian for building an
> application for analyzing in various ways a fairly large number of emails
> accumulated over several years. I am afraid the number of headers that would
> ultimately need to be indexed is therefore quite a lot larger than what
> notmuch currently indexes.
>
> Petri
>
> 2012/10/1 Austin Clements <amdragon at mit.edu>
>>
>> Quoth Petri Savolainen on Oct 01 at 3:39 pm:
>> > Hello,
>> > I could not find information anywhere in notmuch docs about what is
>> > actually indexed - specifically, what email headers are indexed and
>> > searchable? If a header is not indexed, does searching for its value
>> > still
>> > result in a search hit?
>> > It would be nice if one could just provide the list of headers to be
>> > indexed in some configuration file or something.
>> > Thanks,
>> > Petri
>>
>> notmuch doesn't currently implement this, though it is an
>> oft-requested feature. One (not insurmountable) difficulty is that
>> the database would have to be rebuilt if a user-configured list of
>> headers changed and there are technical limitations that prevent us
>> from simply indexing all headers. Out of curiosity, what headers are
>> you interested in indexing?
>>
>> The currently indexed headers are described in man
>> notmuch-search-terms.
>
Use mapreduce instead: hadoop or discoproject or haddop with dumbo
should be faster.
Regards,
Dmitrijs.
More information about the notmuch
mailing list