Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] count number of times a word occurs in an index[resolved]

From: Eric Jobidon <eric(at)not-real.NeoPaper.net>
Date: Sat Dec 29 2007 - 04:05:43 GMT
Is it appropriate to interpret the "position data" as a page number? So
"(5/9)" would indicate that the word occurs (at least once) on page 5 of a
nine page document? 
Thanks for the clarification
Eric

-----Original Message-----
From: users-bounces@lists.swish-e.org
[mailto:users-bounces@lists.swish-e.org] On Behalf Of Eric Lease Morgan
Sent: Friday, December 28, 2007 9:48 PM
To: Swish-e Users Discussion List
Subject: Re: [swish-e] count number of times a word occurs in an
index[resolved]




On Dec 28, 2007, at 7:17 PM, Bill Moseley wrote:
>> How can I use swish-e to count the number of times a word occurs in 
>> an index?
>
>     -T index_words
>     -T index_words_full
>
> probably will tell you in a round about way.
>
> hello [1 1 2 (5/9 6/9)] [1 2 1 (5/9)]
> hi [1 2 1 (6/9)]
> there [1 1 1 (7/9)]
>
> Where [<meta id> <file num> <count in file> (position data)]
>
> hello [1 1 2 (5/9 6/9)] [1 2 1 (5/9)]
>
> is in meta 1, file 1 (twice) and file 2 (once).


Okay, this is what I thought. The third space-delimited value in each
square-bracketed element denotes the number of times the word occurs in each
file. Sum each of these values for the each element and you get the total
number of times the word appears in the index.

--
Eric Lease Morgan

_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users




_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Fri Dec 28 22:47:19 2007