Re: htdig: ht://dig doesn't work.


Phillip Morgan (admin@clam.ehcs.com.au)
Mon, 28 Sep 1998 18:10:39 +1000


Hello webmaster,

> > I'm having terrible problems getting ht://dig to work on my SLackware Linux system
> >(kernel 2.0.30).

> You might try setting your start point to
>
> http://www.ehcs.com.au/index.htm
>
> and leave your limit_urls at http://www.ehcs.com.au. Use rundig to
> rebuild from scratch.
>
> I found I needed to point my htdig at my main navigation page, rather
> than the home page address. If this works, I think you are then going to
> have trouble with your frames...

Sigh. It never ends.... :-(

Initially, not knowing anything about search engines, I thought it would simply index all
pages in the document rout specified as start_utl. Then it became obvious htdig actually
follows [some] links through the web pages.

What's irking me is why it is only some, not others. I have two systems.
http://www.ehcs.com.au is the primary machine, ftp://ftp.ehcs.com.au is another. The web
pages are currently stored at http://www.ehcs.com.au, which is really
/usr/local/etc/httpd/htdocs.

Here's an extract from the chain of web files.. The first is index.htm, which points to
ftpmenu.htm which has four buttons.

[index.htm]
a href="ftpmenu.htm" TARGET="main" onMouseOver="window.status='Visit our HUGE FTP site and
download everything for free!' ;return true" onMouseOut="window.status='';return true">
  <IMG SRC="./pics/idxbut10.gif" BORDER=0 ALT="Huge FTP Archive."></A>

--

[ftpmenu.htm] <a href="htdigsrch.html"><img src="./pics/idxbut23.gif" border=0 alt="Use our search engine to locate the files you need"></a>

<a href="filelogo.htm"><img src="./pics/idxbut22.gif" border=0 alt="Browse through the files by category"></a>

<a href="ftp://ftp.ehcs.com.au"><img src="./pics/idxbut24.gif" border=0 alt="Browse through the files via text directory listings"></a>

<a href="ftpindex.htm"><img src="./pics/idxbut21.gif" border=0 alt="View the HUGE list of files by filename."></a>

The third button (idxbut24.gif), brings up the text based directory listing, which is probably obvious from the code. The last (idxbut21.gif), is a list of every file on the ftp machine, as shown by the next few lines...

--

[ftpindex.htm] <A HREF="ftp://ftp.ehcs.com.au/lists/allfiles.zip"><B>allfiles.zip</B></A> <A HREF="ftp://ftp.ehcs.com.au/lists/xxxfiles.zip"><B>xxxfiles.zip</

htdig won't catalog these pages. In fact, it won't even catalog ftpindex.htm, but it does catalog ftpmenu.htm

Has me baffled :-(

-- 
cheers,

Phillip Morgan,

email: admin@ehcs.com.au fax (03) 9876 5294 vox 0419 874 804 (03) 9876 5295 ---------------------------------------------------------------------- To unsubscribe from the htdig mailing list, send a message to htdig-request@sdsu.edu containing the single word "unsubscribe" in the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:27:52 PST