[htdig] Indexing Images using external parsers


Subject: [htdig] Indexing Images using external parsers
From: Rzepa, Henry (h.rzepa@ic.ac.uk)
Date: Sun Mar 05 2000 - 09:04:37 PST


We noted with interest that htdig V 3.2 does not appear to have any config
flags for invoking an external image parser (are we correct?)

Given that GIF and particularly PNG can have hidden text content fields,
it might be of interest to include the indexing of these types of files
for some sites.

We have in fact modified the htdig source code to do this, invoking an
external parser for the purpose. Not sure yet how it might scale to sites
containing a very large number of images. It is of course also possible
to pass the content extracted from a GIF to other parsers for "added"
value.

-- 

Henry Rzepa. Imperial College, Chemistry Dept. +44 171 594 5774 (Office) +44 171 594 5804 (Fax)

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Sun Mar 05 2000 - 09:20:16 PST