Synchronizing xs-0.3 and xo-??? --- backups
Tomeu Vizoso
tomeu at tomeuvizoso.net
Tue Apr 29 07:11:40 EDT 2008
On Tue, Apr 29, 2008 at 1:34 AM, Martin Langhoff
<martin.langhoff at gmail.com> wrote:
> On Tue, Apr 29, 2008 at 2:14 AM, Tomeu Vizoso <tomeu at tomeuvizoso.net> wrote:
> > > - Q: Does it make sense to always maintain the CJSON-formatted
> > > metadata anyway? (This would specially make sense if some of the DS
> > > corruption reports being discussed are tracked down to Xapian.)
> >
> > Extracting all the metadata from the index is a relatively expensive
> > process, that could affect very negatively the user experience while
> > it lasts.
>
> Is it? Ugh then. (I thought it'd be a cheap op!)
Right now it's slow, but it's as well one of the first things that we
should change in order to improve robustness.
> > I really recommend to do it in smaller chunks (one hundred
> > entries?). One single metadata file per entry looks best to me, and is
> > already implemented here:
> >
> > http://dev.laptop.org/git?p=users/krstic/ds-backup;a=summary
>
> Ah, ok. I must have mis-read it, I thought it was exporting to a
> single large metadata file.
You are right, sorry. Both things are easily doable.
> > > Full restore:
> > >
> > > We can simplify things here
> > > - If the XS is using rsync, the REST API returns a path, the XO
> > > rsyncs from there
> > > - If the XS is using git internally, there are more options it may
> > > force a delay (to get a temp checkout that will be held there for
> > > 24hs) or perhaps the client can just execute git archive and pipe the
> > > output to itself ;-)
> > > - After retrieval of files from the XS, the XO must rebuild its
> > > Xapian metadata store
> >
> > Should this happen after each entry has been copied?
>
> That would be _very_ costly on the server side. Let the rsync
> complete, and do a bulk import.
Ok, we just have to take in account that the bulk import will block
all activity requiring the DS.
> > > Single-item-restore:
> > >
> > > I'll punt on this in the short-term (as I have to prepare an XS
> > > release too!), but various avenues are open. We can support web-based
> > > zipfile download, specially since Tomeu's implemented the handling,
> > > and we can support Journal-based browse-and-restore. But want to get
> > > the major issues sorted first ;-)
> >
> > Sure. Robson may be interested in lending a hand here. At any point,
> > thinking now about the different possibilities may not do any harm.
>
> Definitely. Right now I am trying to sort out the plumbing that does
> the real job, and do the web-based stuff later ;-)
Cool.
Thanks,
Tomeu
More information about the Devel
mailing list