Stemming and analyzers for eXist
As part of the Moses project search engine, I've been looking at the possibility of using a more sophisticated Analyzer for Lucene, to enable stemming in indexing and searching. There is a Lucene analyzer called the SnowballAnalyzer which does stemming for many languages, but unfortunately it can't be used directly in eXist because its constructor requires different parameters than the default constructor for an Analyzer that eXist is expecting. This post suggests a way around this by creating a wrapper class, but this seems a bit complicated, especially in contexts where we expect to rebuild eXist regularly.
However, this post refers to a package that contains stemming-capable analyzers that will work with eXist as it is. However, none of them appears to handle English. So for the moment, we're a bit stuck, but it may be that someone patches eXist so it can use the Snowball analyzer.