Re: [htdig] avoiding binary attachments when indexing email archive s


Subject: Re: [htdig] avoiding binary attachments when indexing email archive s
From: David Robley (huntsman@www.nisu.flinders.edu.au)
Date: Wed Feb 02 2000 - 15:46:25 PST


On 2 Feb, Brett Dikeman wrote:
> what's the best way to avoid attachments in archived email?
> Otherwise, fuzzy searches end up including random "words" made up of
> many random characters, drawn from what htdig considered "text"; I
> can find it in emails people sent that included binary attachments.

If you are using Mhonarc; it seems it saves binary attachments as files
named binnnnnn.bin and Word docs as docnnnnn.doc (where n is an
integer). You could then exclude such files from the search.

Cheers

-- 
David Robley                        | WEBMASTER & Mail List Admin
RESEARCH CENTRE FOR INJURY STUDIES  | http://www.nisu.flinders.edu.au/
AusEinet                            | http://auseinet.flinders.edu.au/
            Flinders University, ADELAIDE, SOUTH AUSTRALIA

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Wed Feb 02 2000 - 15:49:08 PST