Skip to main content.
home | support | download

Back to List Archive

Re: Re: Bug in stemmer.c

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Tue Oct 19 1999 - 17:20:25 GMT
At 09:53 AM 10/19/99 -0700, Roy Tennant wrote:
>All patches are put in the Patch directory at:
>
>http://sunsite.berkeley.edu/SWISH-E/Patches/
>
>(Bill's is "stemmer.c" for example)

Note that the above Patch only contains a bug fix.  The version of
stemmer.c I use has other "adjustments."

For example, I don't stem any words that stem to one or less characters. 
I also call Stem() in a loop until a word no longer stems.  

If this isn't done then, for example,

  "playing" stems to "play"
but
  "play" stems to "plai"

So searching for "playing" fails to find "play" (and "plays", "played").

I guess it's debatable if that's a bug or not.


Bill Moseley
mailto:moseley@hank.org
Received on Tue Oct 19 10:21:23 1999