Subject: RE: [htdig] How do I access parse_doc.pl.gz?
From: Wayne Larmon (email@example.com)
Date: Sat Dec 11 1999 - 18:50:35 PST
> -----Original Message-----
> From: Geoff Hutchison [mailto:firstname.lastname@example.org]
> Sent: Saturday, December 11, 1999 9:16 PM
> To: Wayne Larmon
> Cc: email@example.com
> Subject: Re: [htdig] How do I access parse_doc.pl.gz?
> On Sat, 11 Dec 1999, Wayne Larmon wrote:
> > http://www.htdig.org/files/contrib/parsers/ directory because
> is is stored
> > as parse_doc.pl.gz.
> > "gunzip parse_doc.pl.gz" gives this error:
> > gunzip: parse_doc.pl.gz: not in gzip format
> This sounds like there was some sort of transfer error (like it was
> transferred as ASCII instead of binary). How did you download it? Have you
> tried going through ftp?
Yeah, that was it. I was downloading it with Internet Explorer 5 and it
must have transferred as ASCII. I switched to WS_FTP and then it
"gunzipped" without complaint.
This is peculiar, because I use IE5 to download tar.gz files all the time
with no problems.
Anyway, I configured parse_doc.pl to use the xpdf programs and it indexes
fine. The text shows up in the search results as text, not displayed as
binary gibberish like Acroread does.
To unsubscribe from the htdig mailing list, send a message to
You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Sat Dec 11 1999 - 19:05:12 PST