Re: [htdig3-dev] modified doc2html.pl does PDF subject and keywords


Subject: Re: [htdig3-dev] modified doc2html.pl does PDF subject and keywords
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Fri Sep 08 2000 - 09:00:20 PDT


According to GregHolmes@aol.com:
> Don't know if this would be of interest to anybody, but I have (with
> help) modified the doc2html.pl script to support extracting the
> "subject" and "keywords" of a PDF and contruct the appropriate meta
> tags so PDF excerpts will look nice and keywords exist.
>
> Or is this so obvious it has been done already?

I think it might only be obvious to someone who's done a lot of work with
PDF files and external parsers or converters. I had thought of adding that
capability to conv_doc.pl, but as I don't use those fields in my PDFs, I
didn't think it was worth my time to do it. I suspect other users may feel
differently, so your patches would probably be appreciated.

The author and maintainer of doc2html is David Adams
<D.J.Adams@soton.ac.uk>, so you may want to give him a copy of your
changes, and/or post the patch to htdig@htdig.org.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Fri Sep 08 2000 - 09:02:17 PDT