Bill Moseley <moseley@hank.org> wrote:
> On Tue, 17 Dec 2002, Nikolaus Rath wrote:
>
>> I have some questions regarding the ranking of search results:
>>
>> - Does the context of the match influence its ranking? (e.g.,
>> "<h1>match</h1>" ranks higher than "<p>match" when searching for
>> "match"). And if so, which tags are used?
>
> Yes. It's a hole in the documentation, it seems.
[..]
That is what i wanted to know. Thanks a lot.
>> - Does swish-e search the given pattern also in all the meta tags? If
>> so, does a meta match count more than a "normal" match?
>
> Yes.
Hmm. I installed and tried:
----snip----
nikratio:~/test$ ls
config test2.html test.html
nikratio:~/test$ cat config
IndexDir .
IndexReport 1
MetaNames key
nikratio:~/test$ cat test.html
<html>
<head>
<meta name="key" content="pattern">
</head>
<body>
some words blub foo bar
</body>
</html>
nikratio:~/test$ cat test2.html
<html>
<head>
<meta name="key" content="pattern">
</head>
<body>
some words blub foo bar
pattern
</body>
</html>
nikratio:~/test$ swish-e -c config
Indexing Data Source: "File-System"
Indexing "."
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 14 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
14 unique words indexed.
4 properties sorted.
5 files indexed. 401435 total bytes. 23 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
nikratio:~/test$ swish-search -w "pattern"
# SWISH format: 2.2.1
# Search words: pattern
# Number of hits: 1
# Search time: 0.000 seconds
# Run time: 0.032 seconds
1000 ./test2.html "test2.html" 113
.
nikratio:~/test$ swish-search -w "key=pattern"
# SWISH format: 2.2.1
# Search words: key=pattern
# Number of hits: 2
# Search time: 0.000 seconds
# Run time: 0.032 seconds
1000 ./test2.html "test2.html" 113
1000 ./test.html "test.html" 105
.
-----snip------
It seems that swish does not match the pattern in the meta fields
unless a meta match is explicitly requested.
--Nikolaus
Received on Wed Dec 18 00:37:25 2002