[PATCH v3 00/13] Implement and use database "features"
Austin Clements
amdragon at MIT.EDU
Fri Aug 8 11:18:57 PDT 2014
Quoth Tomi Ollila on Aug 08 at 12:55 am:
> On Fri, Aug 01 2014, Austin Clements <amdragon at MIT.EDU> wrote:
>
> > This is v3 of id:1406652492-27803-1-git-send-email-amdragon at mit.edu.
> > This fixes one issue and tidies up another thing in
> > notmuch_database_upgrade I found while working on change tracking
> > support. Most of the patches are logically identical to v2, but the
> > changes ripple through the patch context, so I'm sending a new series.
> >
> > First, this updates notmuch->features before starting the upgrade,
> > rather than after, so that functions called by upgrade will use the
> > new database features instead of the old (this didn't matter in this
> > series because nothing modified the database differently depending on
> > features). Second, this combines multiple _notmuch_message_sync calls
> > into one, which cleans up the code, should further improve upgrade
> > performance, and makes way for additional per-message upgrades.
>
> This series looks good to me. I looked through the diffs a few times and
> notmuch_database_upgrade() in lib/database.cc to see that in full.
> Tests pass (also T530-upgrade now that I downloaded that one test database.)
>
> I googled answers to few questions along the review; one thing still
> interests me -- is there potential to have speed/memory problems
> when doing upgrade transaction in large/very large databases. And
> how long will the (final) commit_transaction() take (i.e how
> many times handle_sigalrm() is called while that is in progress...)
>
> (my guess is that this transaction just builds a new revision and
> if it is never committed the revision is never used -- and data is
> written there in some batches of suitable size -- so memory usage
> and transaction commit time is O(1))
Your guess is basically right. Xapian periodically flushes stuff to
disk during a transaction basically just like it does when not in a
transaction; AFAIK the only difference is when it flushes out the new
revision number and updated free block metadata. Hence, speed and
memory use aren't affected by the use of a transaction, and
commit_transaction isn't particularly sensitive to how big the
transaction is.
> Tomi
>
> >
> > The diff from v2 is below (excluding patch 2, which David pushed).
> >
> > diff --git a/lib/database.cc b/lib/database.cc
> > index b323691..d90a924 100644
> > --- a/lib/database.cc
> > +++ b/lib/database.cc
> > @@ -1252,6 +1252,10 @@ notmuch_database_upgrade (notmuch_database_t *notmuch,
> > /* Perform the upgrade in a transaction. */
> > db->begin_transaction (true);
> >
> > + /* Set the target features so we write out changes in the desired
> > + * format. */
> > + notmuch->features = target_features;
> > +
> > /* Perform per-message upgrades. */
> > if (new_features &
> > (NOTMUCH_FEATURE_FILE_TERMS | NOTMUCH_FEATURE_BOOL_FOLDER)) {
> > @@ -1280,7 +1284,6 @@ notmuch_database_upgrade (notmuch_database_t *notmuch,
> > if (filename && *filename != '\0') {
> > _notmuch_message_add_filename (message, filename);
> > _notmuch_message_clear_data (message);
> > - _notmuch_message_sync (message);
> > }
> > talloc_free (filename);
> > }
> > @@ -1289,10 +1292,10 @@ notmuch_database_upgrade (notmuch_database_t *notmuch,
> > * probabilistic and stemmed. Change it to the current
> > * boolean prefix. Add "path:" prefixes while at it.
> > */
> > - if (new_features & NOTMUCH_FEATURE_BOOL_FOLDER) {
> > + if (new_features & NOTMUCH_FEATURE_BOOL_FOLDER)
> > _notmuch_message_upgrade_folder (message);
> > - _notmuch_message_sync (message);
> > - }
> > +
> > + _notmuch_message_sync (message);
> >
> > notmuch_message_destroy (message);
> >
> > @@ -1348,7 +1351,6 @@ notmuch_database_upgrade (notmuch_database_t *notmuch,
> > }
> > }
> >
> > - notmuch->features = target_features;
> > db->set_metadata ("features", _print_features (local, notmuch->features));
> > db->set_metadata ("version", STRINGIFY (NOTMUCH_DATABASE_VERSION));
> >
More information about the notmuch
mailing list