[htdig] Duplicate results with directories & some questions/bugs?


Subject: [htdig] Duplicate results with directories & some questions/bugs?
From: Evelio Martinez (evelio.martinez@testanet.com)
Date: Fri Jan 19 2001 - 03:27:29 PST


Hello!

Finally I have configured htdig. It works great.

I have 2 doubts. I do not know if it is a bug or something bad
configured.

1) If I search for a word that match the name of a directory and that is
inside a file under that
     directory I get "duplicate" results, ( 6,8, 9 times the same link).

     How can it be fix ?
      What does it mean the links endings? ?N=D ?M=D ?S=D ...
     By the way, the subdirectory doc is a symlink. I do not know it it
has something to do with this wrong result.

     As an example you have some following:

Index of /recursos/doc/tecnica/inktomi
      Index of /recursos/doc/tecnica/inktomi Name Last modified Size
Description [DIR] Parent Directory 16-Jan-2001 14:47 - [ ]
robots_white_paper.pdf 09-Jan-2001 14:20 246k [TXT]
      instalar 10-Jan-2001 12:31 3k [ ] database.doc 09-Jan-2001 14:15
58k [ ] 4.0_Admin_Guide.pdf 09-Jan-2001 09:40 1 ...
      http://correo.testanet.com/recursos/doc/tecnica/inktomi/?N=D ,
1027 bytes

Index of /recursos/doc/tecnica/inktomi
      Index of /recursos/doc/tecnica/inktomi Name Last modified Size
Description [DIR] Parent Directory 16-Jan-2001 14:47 - [TXT] instalar
10-Jan-2001 12:31 3k [ ]
      robots_white_paper.pdf 09-Jan-2001 14:20 246k [ ] database.doc
09-Jan-2001 14:15 58k [ ] 4.0_Admin_Guide.pdf 09-Jan-2001 09:40 1 ...
      http://correo.testanet.com/recursos/doc/tecnica/inktomi/?M=D ,
1027 bytes

Index of /recursos/doc/tecnica/inktomi
      Index of /recursos/doc/tecnica/inktomi Name Last modified Size
Description [DIR] Parent Directory 16-Jan-2001 14:47 - [ ]
4.0_Admin_Guide.pdf 09-Jan-2001 09:40 1.4M [ ]
      robots_white_paper.pdf 09-Jan-2001 14:20 246k [ ] database.doc
09-Jan-2001 14:15 58k [TXT] instalar 10-Jan-2001 12:31 ...
      http://correo.testanet.com/recursos/doc/tecnica/inktomi/?S=D ,
1027 bytes

Index of /recursos/doc/tecnica/inktomi
      Index of /recursos/doc/tecnica/inktomi Name Last modified Size
Description [DIR] Parent Directory 16-Jan-2001 14:47 - [ ]
robots_white_paper.pdf 09-Jan-2001 14:20 246k [TXT]
      instalar 10-Jan-2001 12:31 3k [ ] database.doc 09-Jan-2001 14:15
58k [ ] 4.0_Admin_Guide.pdf 09-Jan-2001 09:40 1 ...
      http://correo.testanet.com/recursos/doc/tecnica/inktomi/?D=D ,
1027 bytes

2) The apache DocumentRoot is /home/httpd/html and I have the
/home/httpd/recursos as "brother" not "son" and password-protected.

The thing is that documents in directories under /home/httpd/html are
not indexed by htdig unless I write down expressly in the htdig.conf
start_url: http://correo.testanet.com \
                        http://correo.testanet.com/recursos/ \
                        http://correo.testanet.com/manual/ \
                        ... etc.

BUT if I use the -u user:password flag it behaves in a recursive way
and I do not need to write all the subdirectories in the conf file.

Is this normal??

Thanks in advance

--
Evelio Martínez
Testanet. Dept. desarrollo software.
Av. Reino de Valencia, 15 - 5
46005 Valencia (Spain)
Tel: +34 96 395 90 00
Fax: +34 96 316 23 19



This archive was generated by hypermail 2b28 : Fri Jan 19 2001 - 03:42:22 PST