RE: [htdig] Can't get my search to update correctly..


Subject: RE: [htdig] Can't get my search to update correctly..
From: Rivera, Tony (RiveraL@umsystem.edu)
Date: Fri Oct 06 2000 - 06:34:30 PDT


Thank you all for your help!!

I think I have *sorta* figured out what my problem is. I believe it's the
.work files. By default in htdig.conf the database_dir defaults to
/opt/www/htdig/db however, I didn't have enough room on that partition for
the db files, so I had to move them to a partition where I had enough free
space database_dir /home/htdig_db

Here is an example of what I am seeing:

[utopia@freya utopia]$ ll /opt/www/htdig/db
total 64424
-rw-rw-r-- 1 root root 27250688 Aug 29 09:26 db.docdb.work
-rw-r--r-- 1 root root 21328574 Aug 29 04:34 db.docs
-rw-rw-r-- 1 root root 696320 Aug 29 08:49 db.docs.index.work
-rw-rw-r-- 1 root root 8585456 Aug 29 09:26 db.wordlist.work
-rw-rw-r-- 1 root root 7845888 Aug 29 08:47 db.words.db.work
[utopia@freya utopia]$ ll /home/htdig_db
total 98628
-rw-r--r-- 1 root root 26348544 Oct 6 04:29 db.docdb
-rw-r--r-- 1 root root 654336 Oct 6 04:29 db.docs.index
-rw-r--r-- 1 root root 38597295 Oct 6 04:28 db.wordlist
-rw-r--r-- 1 root root 35264512 Oct 6 04:28 db.words.db
[utopia@freya utopia]$

By this I am seeing that all of the work files are still in the old default
directory /opt/www/htdig/db and have not been updated since aug-29. It was
Aug-29 that I made the change to my htdig.conf moving my database_dir to
/home/htdig_db. I can see my crontab is working because it's updating
everything in /home/htdig_db everynite like it should be.

My question now is...do I need to move those .work files from
/opt/www/htdig/db to /home/htdig_db to get my search to finally update?

Once again, thank you for you help!!

-----Original Message-----
From: Gilles Detillieux [mailto:grdetil@scrc.umanitoba.ca]
Sent: Thursday, October 05, 2000 12:39 PM
To: RiveraL@umsystem.edu
Cc: htdig@htdig.org
Subject: Re: [htdig] Can't get my search to update correctly..

According to Rivera, Tony:
> However, that's not working...about 5 days ago I added a new
> directory /www/itss and have made numerous links to it from my index page
> and various other pages on the server and it is still not getting picked
up
> when I do a search for it.

I assume these are HTML links and not JavaScript ones. One possibility is
that the pages you modified are actually dynamic content (SSI, PHP, etc.)
and so the server isn't returning a Last-Modified header. If this is the
case, htdig won't realize the pages have been modified. You can set the
modification_time_is_now attribute to true, but then htdig will reindex
all dynamic pages every time it runs.

> I am not quite sure what I am missing here...I have read through all the
> archives on the site, ran /opt/www/htdig/bin/htdig -v -a -s and
> /opt/www/htdig/bin/htmerge -v -a -s but is still doesn't update.

I assume you know about how to maintain the .work files and non-.work
files before and after running htdig and htmerge with the -a option?
If you're not copying the updated .work files to their non-.work locations,
then htsearch won't see the updates. See the contrib/examples/rundig.sh
script for an example of using the -a option for updates.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:
http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Fri Oct 06 2000 - 06:38:50 PDT