Skip to main content.
home | support | download

Back to List Archive

Re: index file word-list and fuzzy searching

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Thu Sep 18 2003 - 23:08:06 GMT
On Thu, Sep 18, 2003 at 03:27:51PM -0700, Masoud Pirnazar wrote:
> two related questions
> 
> (1)
> is there a way of searching the list of terms in the index file, e.g. to see
> that
> "MyIndex" has the words (apple, pear, watermelon) in it?  (treating the
> "index" as if it was a dictionary or thesarus)

That's exactly what swish-e does.

> some kind of api such as "start at word >= 'banana'", and "read next" would
> do it (maybe a "give a count of total # words in the index")

The header of each search will tell you the number of words in the 
index.

  # Total Words: 15209
  # Total Files: 1252

> (2)
> is there any kind of fuzzy searching, e.g. "apple" with one spelling error
> accepatable, e.g. "appie" would still match.

Yes.

> (3)
> any support for "near", e.g. "apple" within 3 words of "banana"

No, not yet.


-- 
Bill Moseley
moseley@hank.org
Received on Thu Sep 18 23:08:15 2003