loic@ceic.com
Fri, 16 Jul 1999 14:02:52 +0200 (MEST)
Gabriele Bartolini writes:
> je suis Gabriele ... and I write you from Italy. I am building up a
Ciao Gabriele :-)
> program that checks a site and stores all the tags it found in every
> document crawled. I argueed with Geoff lots of times about the best
> database to use ... We have come up that the best solutioni was to use
> mySQL. I have read in your previous message to the htdig3-dev that you have
> encountered some problems with it. Can you please give me more details on
> it? Indeed I have created a simple mysql interface in order to manage the C
> API interface.
There are three main problems with MySQL :
. No transactions
. No locks
. Size of field indexed max 255 char
Also you must take care to use the 'binary' keyword if you want your
search to be case sensitive. If you can cope with the three bottlenecks,
MySQL is very fast and compact.
> As I am at the beginning of the work, and now me and Geoff are focusing our
> attention on implementing a persistent connection mechanism on ht://dig
> (and ht://check), I would appreciate any kind of suggestions from you.
I didn't know about ht://check. Is it possible to get it somewhere ?
Are you planning to provide some effort to modify ht://dig so that it
can use a SQL database instead of Berkeley DB ?
Cheers,
-- Loic DacharyECILA 100 av. du Gal Leclerc 93500 Pantin - France Tel: 33 1 56 96 09 80, Fax: 33 1 56 96 09 61 e-mail: Loic@Dachary.org URL: http://www.senga.org/
------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev@htdig.org containing the single word "unsubscribe" in the SUBJECT of the message.
This archive was generated by hypermail 2.0b3 on Fri Jul 16 1999 - 04:30:48 PDT