Re: htdig: excluding java script text


Taylor Marvin (taylor@mail.gkg-com.com)
Wed, 8 Jul 1998 14:26:01 -0500 (CDT)


I stand corrected..I did misunderstand you.

Thanks very much for the info...i'm going to give it a whirl..see how it
turns out.

Best Regards,

Taylor

> I think you may have misunderstood my suggestion... If you setup muffin to
> strip javascript and then let htdig use muffin as a proxy server, you will get
> an index of all the pages but all javascript crap will be gone for indexing
> purposes. When you have built the index, any search results will point to the
> original, non-filtered pages.
>
> The fact that your pages are on the same machine as where muffin or htdig are
> running makes no difference.
>
> To let htdig use a proxy server for indexing, set the 'http_proxy' attribute
> in your config file. Check the docs for http_proxy for the details.
>
> Taylor Marvin wrote:
> >
> > Andrew...
> >
> > As I wish it were that easy...I cannot remove the javascript..it is the
> > driving force behind the website...meaning..it is menu drive through
> > javascript.
> >
> > anyway....the htdig i'm running is on the same server as the site that
> > contains the javascript so this wouldn't work anyway...even if it were,
> > that would not prove to be a solution since the majority of the pages
> > contain javascript...and that would leave very few pages left to index and
> > defeating the purpose.
> >
> > Thank you for your input though...I do welcome any other suggestions you
> > may have.
> >
> > Do you know if it is possible to exclude certain tags..such as comment tags
> > ?
> >
> > Thanks again,
> >
> > Taylor
> >
> > > Taylor Marvin wrote:
> > > >
> > > > Hello,
> > > >
> > > > I was just wondering if anybody knew how to exclude the text inside of the
> > > > javascript tags on an html doc.
> > > >
> > > > Thanks in advance,
> > > >
> > > > Taylor
> > >
> > > I hate javascript! :-(
> > > The simplest solution I know to get rid of javascript in documents that are
> > > being indexed is to use Muffin as a proxy server. You can find muffin at
> > > http://muffin.doit.org/
> > >
> > > --
> > > Andrew Scherpbier <andrew@contigo.com>
> > > Contigo Software <http://www.contigo.com/>
> > > ----------------------------------------------------------------------
> > > To unsubscribe from the htdig mailing list, send a message to
> > > htdig-request@sdsu.edu containing the single word "unsubscribe" in
> > > the body of the message.
> > >
>
> --
> Andrew Scherpbier <andrew@contigo.com>
> Contigo Software <http://www.contigo.com/>
> ----------------------------------------------------------------------
> To unsubscribe from the htdig mailing list, send a message to
> htdig-request@sdsu.edu containing the single word "unsubscribe" in
> the body of the message.
>



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:26:51 PST