Re: [htdig] Deleting files


Geoff Hutchison (ghutchis@wso.williams.edu)
Mon, 1 Nov 1999 18:24:15 -0600


At 5:08 PM +0200 10/28/99, J. op den Brouw wrote:
>I'm short of disk space.
>See below:
>
><msql@pluto:187> ls -al
>total 131656
>drwxr-xr-x 2 msql www 512 Oct 9 21:28 ./
>drwxrwxr-x 7 msql www 512 Oct 6 1998 ../
>-rw-r--r-- 1 msql www 29069312 Oct 9 21:26 db.docdb
>-rw-r--r-- 1 msql www 22797624 Oct 9 21:10 db.docs
>-rw-r--r-- 1 msql www 935936 Oct 9 21:26 db.docs.index
>-rw-r--r-- 1 msql www 63430 Oct 9 21:28 db.images.sorted.gz
>-rw-r--r-- 1 msql www 128175 Oct 9 21:27 db.urls.sorted.gz
>-rw-r--r-- 1 msql www 37314560 Oct 9 21:25 db.wordlist
>-rw-r--r-- 1 msql www 44359680 Oct 9 21:24 db.words.db
><msql@pluto:188>

I don't think anyone got back to you. You don't need db.urls.* or
db.images.* or db.docs. These are all auxilliary files generated by
various options to htdig. Basically, they're a list of the URLs
observed during the dig, the images observed, and an ASCII version of
the document DB.

I wouldn't recommend removing db.wordlist, but you don't need it for
searches. It's useful since it allows you to do an update instead of
an initial dig for subsequent indexing.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig@htdig.org containing the single word unsubscribe in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Mon Nov 01 1999 - 16:47:29 PST