Subject: Re: [htdig3-dev] readable doc_index
From: Erik Buelinckx (erik@buelinckx.net)
Date: Wed Jun 07 2000 - 14:10:28 PDT
At 08:27 -0500 07-06-2000, Geoff Hutchison wrote:
>At 2:05 AM +0100 6/7/00, Erik Buelinckx wrote:
>>i'm new to this list (hoping i am on the right place) and new to htdig. I would like to know if it's possible (and if so, how) i could get a readable version of the by doc_index generated db file. Like doc_list does for doc_db.
>
>I'm not sure why you'd want one. The file is simply a lookup from URLs to DocIDs. So the db.docs file will include this information.
>
>-Geoff
>
Well, someone asked me to create a tabled list of words and the documents they occur in. This is exactly the kind of list "db.wordlist" is. But in that file, documents are referred to by an id number and i wanted an easy readable list of the indexed documents and their matching id's which are i believe in this "doc_index" file.
I'm talking about 3000 documents, so i'm not to eager to check them by hand. Now if i would have a readable list i could put that in a some; standard database program and play with it to create the (huge huge) word list my client asks for.
I checked out the berkeley db initiative but since i'm not a programmer i didn't find an easy converter tool for this format to something humanly readable.
So maybe this clears things a bit up and someone could come up with an easy solution for my problem. This is a one time operation. I would appreciate any input.
-Erik
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev-unsubscribe@htdig.org
You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Wed Jun 07 2000 - 11:17:21 PDT