RE: [htdig] parsing PDF with NT

Subject: RE: [htdig] parsing PDF with NT
From: Stéphane Baudet (
Date: Mon Mar 06 2000 - 23:57:38 PST

Well, to compile it, I just did it from the cygwin bash shell the classic
unix way, I mean : "sh configure" to use autoconf, and then "make". I didn't
use "make install", but i copied the binaries and the htdig.conf file at the
place where they should be. I compiled it on a Pentium pro 200 server with
NT server 4.0. You must first install cygwin B20.1 that you can find on . Actually, only the cygwin1.dll may be
I zipped my files and I also put my own htdig.conf that other users may
modify to put their own path for the external parser. In my search, I found
a slightly modified dubbed as, which works well
with NT and xPDF 9.0. I also put pdfinfo.exe and pdftotext.exe from xPDF
9.0. I put my binaries in c:\opt\www\cgi-bin and c:\opt\www.htdig\bin .The
zip file contains c:\opt\www\cgi-bin and c:\opt\www\htdig . NT users should
change the database path in htdig.conf. I put it on c:\opt\www\htdig\db, but
with cygwin syntax, if the DB must be, for instance, on d:\www\mydb, just
change the database dir line in htdig.conf with the following path :
d:/www/mydb or //d/www/mydb .
I think that's all, but how can I upload my zip file on ?

Stephane Baudet.

-----Message d'origine-----
De : Gilles Detillieux []
Envoyé : lundi, mars 06, 2000 8:50 PM
À : Stephane_Baudet
Cc : Gilles Detillieux;
Objet : Re: [htdig] parsing PDF with NT

According to Stéphane Baudet:
> Yess it works now !! I just added "wb" as the second argument, then
> recompiled htdig. And now it works perfectly ! HtDig retrieves the correct
> size for temporary files and all works well in the best world !
> To parse the PDF files, I used with the following line in
> htdig.conf :
> external_parsers: application/pdf->text/html "d:/perl/bin/perl.exe
> /opt/www/htdig/bin/"

Hi, Stéphane. Glad that worked. To build htdig under Cygwin, did
you need to make any other changes than that one fopen() call? If so,
could you send patches, for the record.

Also, if your binaries were built for a fairly generic configuration,
I think it would be great to put them up on the ftp/web site for others
to use. Would you be willing to package them up to distribute to others?

Gilles R. Detillieux              E-mail: <>
Spinal Cord Research Centre       WWW:
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to You will receive a message to confirm this.

------------------------------------ To unsubscribe from the htdig mailing list, send a message to You will receive a message to confirm this.

This archive was generated by hypermail 2b28 : Tue Mar 07 2000 - 00:11:23 PST