Re: [htdig] Digging problem, probably css-related?


Subject: Re: [htdig] Digging problem, probably css-related?
From: Thomas Rother (t.rother@gaia.de)
Date: Mon Sep 04 2000 - 14:53:46 PDT


Geoff Hutchison wrote:

> I can't imagine why the text parser would do anything strange with
> CSS. However, you probably don't want to index CSS files, so I'd add
> .css to bad_extensions in your config file:
>

Thanks for that hint, Geoff, but: In now have the following directives in
htdig.conf:

    exclude_urls: /cgi-bin/ .cgi .css /suchdb/ Msgs mh.rsc suck
htdig mhonarc
    bad_extensions: .css .htaccess

and still I get:

    href: http://intraweb.gaia.de/gaiaev/css/gaia.css ()

      Rejected: Item in the exclude list: item # 3 length: 4

    url rejected: (level 1)http://intraweb.gaia.de/gaiaev/css/gaia.css

    title: GAIA WEB INTERN Titelseite
    image: http://intraweb.gaia.de/gaiaev/images/gaialgs.jpg
    image: http://intraweb.gaia.de/gaiaev/images/minimize.gif
     size = 2986
    pick: intraweb.gaia.de, # servers = 1
    data:/opt/www/htdig/bin #

I don't understand at all ...

??

Thomas

--
-----------------------------------------------------------
Thomas M. ROTHER  -- 73728 Esslingen -- EU/Germany
mailto:t.rother@netzwissen.de - http://www.netzwissen.de
Public PGP Key auf http://www.keyserver.net
-----------------------------------------------------------

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Mon Sep 04 2000 - 15:07:50 PDT