[htdig3-dev] external parder for HTDig-3.1.2


kimsg (kimsg@mail.techsoft.co.kr)
Wed, 30 Jun 1999 17:35:36 +0900


Hi Digger,

It's external parser for HTDig 3.1.2 and very useful if you want to parsing
MS-OFFICE document file.

* What can handle file format.
  - MS-WORD 8.0(above MS-WORD 5.0 version)
  - MS-EXCEL 95 & 97
  - MS-POWERPOINT 97(can't handle PPT 95 format. If you know this format,
Please tell me.)
  - English & Korean version of MS-OFFICE is supported.

* How to use external parser for HTdig.
  1. Please add following MIME type in /etc/mime.types file, if you use
apache web sever.
        application/msword doc
        application/msexcel xls
        application/mspowerpoint ppt
  2. Please add external parser in htdig.conf file.
        external_parsers: "application/msword"
"/opt/www/htdig/parsers/htparsedoc"\
                                "application/msexcel" "/opt/www/htdig/parsers/htparsedoc"\
                                "application/mspowerpoint" "/opt/www/htdig/parsers/htparsedoc"
  3. Please check max_doc_size value in htdig.conf file. If this value is
lower than real document file size, external parser should go
unlimited-execution state.

  4. Please add Document.cc, 532 line.
        <----
                    mystrncasecmp("application/msword", token, 18) != 0 &&
                    mystrncasecmp("application/msexcel", token, 19) != 0 &&
                    mystrncasecmp("application/mspowerpoint", token, 24) != 0 &&
        --->

* License
  - This tsparser's not following GNU license and commercial software. But
if you want to use uncommercial web site, you can use it free.
  - This tsparser is only allow to use for HTDig, If you want to use it
another program, you have to contact to techsoft.

Powered by GNU & FSF.

HomePage : www.techsoft.co.kr

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Wed Jun 30 1999 - 00:49:23 PDT