Re: [htdig] converter


Subject: Re: [htdig] converter
From: D.J.Adams@soton.ac.uk
Date: Tue Aug 29 2000 - 09:15:03 PDT


>
> Hello David,
>
> thanX for your feedback.
>
> I have not played with ppHtml, as this was no requirement in my project -
> thus I do not have any experience of it failing...

I found out that the "Not enough space" message was only produced by
files so large that they exceeded the maximum document size set in the
config file, and so htdig was supplying a truncated copy. A strange
message from pptHtml, but otherwise not a fault.

>
> But you might want to talk to the author of xlHtml, Steve Grubb
> (linux_4ever@yahoo.com) and tell him about those problems you experienced.
> Perhaps you can even persuade him to include Hyperlink support....
>

If I had immediate plans to index .xls files then I would contact the
author, but as I'm not using xlHtml at the moment I don't think it
appropriate for me to do so.

> If you could send me the new version of doc2html.pl, I'd be very
> gratefull - as my XLS support is not very elegantly coded...

I'll send you a copy (and upload it to the htdig site) just as soon as
I have stopped tinkering with it. Probably next week.

>
> Regards,
> Sven
>
> ----- Original Message -----
> From: <D.J.Adams@soton.ac.uk>
> To: "Sven Haberer" <svenl@haberer-online.de>
> Cc: <htdig@htdig.org>
> Sent: Tuesday, August 29, 2000 2:10 PM
> Subject: Re: [htdig] converter
>
>
> .......
> > > Hi,
> > >
> > > just having gone through the same problem, I used xlHtml to index Excel
> > > files. For this I had to change the parse_doc file, it can be found
> together
> > > with the instructions at the adress below:
> > > http://www.haberer-online.de/htdig/default.htm
> > >
> > > xlHtml also has an option to convert MS powerpoint, but I did not take a
> > > look at this.
> > >
> > > Hope that helps,
> > > Sven
> > >
> >
> > Sven,
> > Thanks for this very useful tip.
> >
> > I've tried xlHtml (version 0.2.7.2) and it seems at least as good
> > xls2csv, the converter that comes with catdoc, though it could be
> > better:
> >
> > option handling seems flaky.
> >
> > HTML output can be generated, but hyperlinks in spread sheets
> > are not marked up as links.
> >
> > I've also tried ppHtml which converts PowerPoint files to HTML, and it
> > seems adequate as a converter. While indexing our web pages using
> > doc2html.pl it processed about a hundred .ppt files ok, and failed on
> > three with the message "Not enough space". (As I'm using a sizable IRIX
> > system with plenty of memory and disk space I don't know why I should
> > get such a message.)
> >
> > The next version of doc2html.pl will include examples of using both
> > pptHTML and xlHtml as converters. I should be releasing it sometime in
> > September.
> >
> > --
> >
> > David J Adams
> > <D.J.Adams@soton.ac.uk>
> > Computing Services
> > University of Southampton
>
>

-- 
 
David J Adams
<D.J.Adams@soton.ac.uk>
Computing Services
University of Southampton

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Tue Aug 29 2000 - 09:16:12 PDT