htdig: Can't indes postscript files with htdig 3.1.0b4


Herve Lefebvre (hlefebvre@easynet.fr)
Sun, 24 Jan 1999 01:14:34 +0100


Hello,

I tried a lot of things, but htdig refuses to index
the content of ps files :-(

Size of .ps files ares less than the max_doc_size in the .conf file

I ran htding with -v -v -v option, here are some of the outputs:
-------------------------------

New server: 127.0.0.1, 80
Retrieval command for http://127.0.0.1/robots.txt: GET /robots.txt
HTTP/1.0^M
User-Agent: htdig/3.1.0b4 (unconfigured@htdig.searchengine.maintainer)^M
Host: 127.0.0.1^M
^M
New server: 127.0.0.1, 80
Retrieval command for http://127.0.0.1/robots.txt: GET /robots.txt
HTTP/1.0^M
User-Agent: htdig/3.1.0b4 (unconfigured@htdig.searchengine.maintainer)^M
Host: 127.0.0.1^M
^M
Header line: HTTP/1.1 404 Not Found
Header line: Date: Sat, 23 Jan 1999 23:49:33 GMT
Header line: Server: Apache/1.3.3 (Unix) (Red Hat/Linux)
Header line: Connection: close
Header line: Content-Type: text/html
Header line:
returnStatus = 1
pick: 127.0.0.1:80, # servers = 1
0:0:0:http://127.0.0.1/: Retrieval command for http://127.0.0.1/: GET /
HTTP/1.0^M
User-Agent: htdig/3.1.0b4 (unconfigured@htdig.searchengine.maintainer)^M
Host: 127.0.0.1^M

(......snip.........)

+A tag: pos = 2, position = ="./b2.ps">
href: http://127.0.0.1/b2.ps (book 2)
resolving 'http://127.0.0.1/b2.ps'
                                                                                  
pushing http://127.0.0.1/b2.ps

(.....snip...........)

pick: 127.0.0.1:80, # servers = 1
5:3:1:http://127.0.0.1/b2.ps: Retrieval command for
http://127.0.0.1/b2.ps: GET /b2.ps HTTP/1.0^M
User-Agent: htdig/3.1.0b4 (unconfigured@htdig.searchengine.maintainer)^M
Referer: http://127.0.0.1/^M
Host: 127.0.0.1^M
^M Header line: HTTP/1.1 200 OK
Header line: Date: Sat, 23 Jan 1999 23:49:40 GMT
Header line: Server: Apache/1.3.3 (Unix) (Red Hat/Linux)
Header line: Last-Modified: Fri, 22 Jan 1999 19:27:30 GMT
Translated Fri, 22 Jan 1999 19:27:30 GMT to ven, 22 jan 1999 19:27:30
(99)
And converted to ven, 22 jan 1999 19:27:30
Header line: ETag: "8fbc-1f50e-36a8d122"
Header line: Accept-Ranges: bytes
Header line: Content-Length: 128270
Header line: Connection: close
Header line: Content-Type: application/postscript
Header line:
returnStatus = 0

Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 5390 from document
Read a total of 128270 bytes
 size = 128270
pick: 127.0.0.1:80, # servers = 1

(.........)

and after, the htsearch doesn't find any word contained in the .ps file,
I tried with various .ps , including the sybase documentation.
Some .ps were in french, and other in english.

I've no problem to index .html file

Any ideas ?

Thanks

-- 
Herve Lefebvre
aegir@mail.dotcom.fr
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Mon Jan 25 1999 - 08:15:24 PST