Re: htdig: Has anyone merged databases?


Scot Needy (scot@finch.engrs.infi.net)
Thu, 23 Oct 1997 14:16:05 -0400


I really do need some input on this..
I have simplified my questions to yes or no answers
so you don't have to take the time come up with some long RE.
I actually answered them. I just need some verification that I am or am
not of
my rocker sorta speak.

Please help
Thanks

I can see that the merge is on the "List o things to do"
but I was hoping there was someone out there who has done it.
yes?

y|n htmerge -a will create the *.*.work files which must be manually moved
to *.*

y|n db.docdb - Live database of URL's and information on the URL. This
file is
                        created by htdig and is used by htmerge to create
db.doc.index.
                       db.docdb can not be deleted

y|n db.docdb.work - Copy being built by a dig -a This file must be moved
to
                                  db.docdb to activate the new index.

y|n db.docs.index - The index used to lookup all document matches.
                                Kind of like an index in a msql/sybase..
database

y|n db.docs.index.work - Copy of the index being built. This file must be
moved
                                        into place when the re-index is
finished,

y|n db.wordlist.new - Temporary file which usually contains nothing.
Delete it
                                  with extreme prejudism if you find one.

y|n db.wordlist.work Temporary file used to write sorted data out. If you
use the -w
                                 option on htmerge you had better have set
an environment variable
                                 $TMPDIR in your shell script and have
enough room in the
                                 TMPDIR to hold the wordlist.

y|n db.wordlist.work.new - Parent of db.wordlist.new come to mess you up
because
                                          you killed her db.wordlist.new.
Better kill this one two.

y|n db.words.gdbm - gdbm database used by htsearch created by htmerge.

y|n db.words.gdbm.work - built by a htmerge -a This file needs to be moved
to
                                           db.words.gdbm to make it live
like the others.

 Well how did I do?
 Thanks
 Scot

Scot Needy wrote:

> Hi;
>
> While I am asking all these questions....
>
> Has anyone merged multiple databases?
> It seems I could dig then cat the respective files together to create
> a larger word db then run htmerge on the new file?
>
> Also forgive me if have missed this in the documentation but
> could someone tell me what each of these files really do and
> the process which htdig/htmerge creates or uses them?
> I understand the .work files are alternate files but...
>
> db.docdb
> db.docdb.work
> db.docs.index
> db.docs.index.work
> db.wordlist.new
> db.wordlist.work
> db.wordlist.work.new
> db.words.gdbm
> db.words.gdbm.work
>
> ----------------------------------------------------------------------
> To unsubscribe from the htdig mailing list, send a message to
> htdig-request@sdsu.edu containing the single word "unsubscribe" in
> the body of the message.

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:25:11 PST