parsepdf.pl -- yet another pdf parser ------------------------------------- hi, maybe you will find this useful. This perl script may be used by htdig as external parser for pdf-files. cu, Stefan -- Stefan Nehlsen | ParlaNet Administration | sn@parlanet.de | +49 431 988-1260 # parse pdf files for htdig # # - generate anchor tags # - do site specific rewriting url to title # for missing or bad titles # - I suppose it is faster then parse_doc.pl # # based on: # - htdig documentation # - parse_doc.pl # - pdftodig.py (http://po.gaillard.free.fr/pdftodig.py) # # Stefan Nehlsen sn@parlanet.de