Subject: [htdig] Re: indexing Flash (was: excluding page section...)
Date: Wed Jan 17 2001 - 08:41:03 PST
Theoretically, Flash is supposed to put links and text into the HTML
file if you check those options. Unfortunately, it sticks them in
comment fields. I've had inconsistent behavior with getting it to do
Macromedia did publish a Flash file access API or something, but it's
not open source as far as I know.
I'm working on a report on indexing Flash, so if anyone has a
text-heavy example, I'd love to see it!
At 1:04 PM +0100 1/17/01, Torsten Neuer wrote:
> > > > Is there a possibility to index Shockwave Flash files?
>> > This is a bit harder. I searched the web for an existing parser but
>> > only
>> > found some more-or-less useful docs and one generic parser.
>> > This generic parser (see attachment) can easily be used within a wrapper
>> > script to at least extract links from a flash menu, which in my opinion
>> > is
>> > the most requested feature.
>> Thanx for this one, but I'll need a bit more time to check it.
>> Anway, extracting links is not enough, i think. keywords or full text
>> index are needed.
>Well, full text index should also be possible, but requires some more
>work on the parser. The attached one is just a very generic one which
>dumps all the different record entries of a flash file. It is not de-
>signed to be an axternal parser for Ht://Dig, but it works well with
>the shell wrapper to extract links from flash menus. With some addi-
>tional work it shoudl be possible to produce a fully fledged external
>parser out of it (yet, I haven't found the time nor did I have some
>projects depending on that).
-- _________________________________________________ Complete Guide to Search Engines for Web Sites, Intranets, and Portals: <http://www.searchtools.com>
------------------------------------ To unsubscribe from the htdig mailing list, send a message to email@example.com You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
This archive was generated by hypermail 2b28 : Wed Jan 17 2001 - 08:55:43 PST