[Chicago-talk] search indexers

JT Smith jt at plainblack.com
Thu Jan 19 15:30:34 PST 2006


What search indexing systems do you guys use?

Here are my requirements:

  - Must be easily installable on all platforms (which rules out stuff like xapian).

  - Must be open source on all platforms (which rules out stuff like mnogo search).

  - Must be able to index more than just web pages (rules out htdig)

  - Must be able to index more than just databases (rules out DBIx::FullTextSearch)

  - Would like it to be able to store Meta Data (rules out MyConText)

  - Is either pure perl or has perl bindings (rules out Lucene)

The only thing I've found so far that matches is Plucene, the Perl port of the Java 
based Lucene engine. I wrote a test to see how it would work out. The API and 
functionality is exactly what I need, but it's very slow. After having indexed 17,000 
documents, a search takes 2 seconds to return 20 results. And that's on hardware that 
looks like:

Dual Xeon 3.4 Ghz with hyperthreading
2 GB RAM
SCSI Hard Disks

So I'm not skimping on hardware.

So my question to you is, have you used anything that would fit the above criteria?


JT ~ Plain Black
ph: 703-286-2525 ext. 810
fax: 312-264-5382
http://www.plainblack.com

I reject your reality, and substitute my own. ~ Adam Savage


More information about the Chicago-talk mailing list