Skip to main content.
home | support | download

Back to List Archive

Re: NOT broken with RankScheme(1)

From: Peter Karman <peter(at)not-real.peknet.com>
Date: Fri Feb 11 2005 - 21:24:30 GMT
hmm.
this is very surprising to me (as the author of the rank schemes) because 
ranking is done (more or less) after the query has been eval'd.

can you duplicate this behaviour with a small, reproducable test set?

I just tried it like this, without seeing what you're describing.

karpet@cartermac 46% swish-e -i foo.html bar.html -c c
..
2 files indexed.  298 total bytes.  50 total words.


karpet@cartermac 47% swish-e -w 'ocean not sherry' -R 0
# SWISH format: 2.4.3
# Search words: ocean not sherry
# Removed stopwords:
# Number of hits: 1
# Search time: 0.004 seconds
# Run time: 0.032 seconds
1000 foo.html "foo.html" 147
.
karpet@cartermac 48% swish-e -w 'ocean not sherry' -R 1
# SWISH format: 2.4.3
# Search words: ocean not sherry
# Removed stopwords:
# Number of hits: 1
# Search time: 0.031 seconds
# Run time: 0.060 seconds
1000 foo.html "foo.html" 147
.
karpet@cartermac 49% swish-e -w 'ocean' -R 1
# SWISH format: 2.4.3
# Search words: ocean
# Removed stopwords:
# Number of hits: 2
# Search time: 0.004 seconds
# Run time: 0.036 seconds
1000 bar.html "bar.html" 151
1000 foo.html "foo.html" 147
.
karpet@cartermac 50% swish-e -w 'ocean' -R 0
# SWISH format: 2.4.3
# Search words: ocean
# Removed stopwords:
# Number of hits: 2
# Search time: 0.005 seconds
# Run time: 0.032 seconds
1000 bar.html "bar.html" 151
1000 foo.html "foo.html" 147
.

karpet@cartermac 52% cat foo.html
<html>
<body>
my bonny lies over the ocean
my bonny lies over the sea
my bonny refuses to search well
oh bring back my bonny to me
</body>
</html>
karpet@cartermac 53% cat bar.html
<html>
<body>
my sherry lies over the ocean
my sherry lies over the sea
my sherry refuses to search well
oh bring back my sherry to me
</body>
</html>
karpet@cartermac 54% cat c
IgnoreTotalWordCountWhenRanking 0



Mark Maunder wrote on 2/11/05 3:09 PM:
> Hi,
> 
> Firstly tip of the hat to the swish team - the new site rocks, and as
> usual so does the little miracle that it supports. 
> 
> I'm indexing using XML with metanames and
> IgnoreTotalWordCountWhenRanking no
> So when I do a query I'll do something like:
> job=(CEO not assistant)
> 
> It looks like NOT is not notting when using RankScheme(1). I'm switching
> back to RankScheme(0) for now but I'm going to miss RankScheme(1)
> because the sizes of the chunks of text that I index vary wildly and I
> find that the larger chunks float to the top with (0). 
> 
> Regards,
> 
> Mark.
> 
> 

-- 
Peter Karman  .  http://peknet.com/  .  peter(at)not-real.peknet.com
Received on Fri Feb 11 13:24:31 2005