htdig for the www virtual library?

Gabriel Fenteany (
Mon, 26 Apr 1999 23:09:10 -0400

Hello. The WWW Virtual Library ( is a distributed
responsibility Web index of 288 separate sites linked to a central catalog
spanning a huge range of topics, academic and not. It was actually started
by Tim Berners-Lee himself and was once partly served from the W3C. It's
been around since 1991 and has all sorts of different individual sites:
some static HTML, some - much fewer - databases such as Links2, etc. We
would like to implement a search engine that gets all of the HTML and text
files on all 288 sites, tens of thousands of documents. There is little
hope that in the near future the sites will be unified around one format,
and this may not be desirable for us anyway, since each site serves a core
of specialists with differing needs; however, we'd like to put in place at
least a broad, top-level search engine that gets the user to the right page
of links in the right site.

We'd like to know whether you think ht://Dig would be a good solution for

Any and all comments would be very much appreciated.



Gabriel Fenteany, Ph.D.
Maintainer, WWW Virtual Library: Cell Biology

