Joseph Norris wrote on 6/29/12 6:12 PM:
> I have the following:
>
> swish-e -w "China Confirms Gold Price Suppression" -s names desc title
> desc -f /home/cs/sw/swishe/gold/i
> ndex.swish-e
>
> No hits
>
> however when I do this:
>
> grep -l "China Confirms Gold Price Suppression" /home/cs/sw/swishe/gold/*
>
> I get a hit
Try reducing your sample size to one doc, with the -T options as David suggests.
Here's a small self-contained example. I suggest you put together a similar
example of your own, which IME will often reveal the problem.
[karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -c conf
Indexing Data Source: "File-System"
Indexing "docs"
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 8 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
8 unique words indexed.
6 properties sorted.
1 file indexed. 96 total bytes. 8 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
[karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -w "China Confirms Gold Price Suppression" -s
names desc title
# SWISH format: 2.5.8
# Search words: China Confirms Gold Price Suppression
# Removed stopwords:
# Number of hits: 1
# Search time: 0.002 seconds
# Run time: 0.009 seconds
1000 docs/china.xml "china.xml" 96
.
[karpet(at)not-real.pekmac:~/tmp/s]$ cat docs/china.xml
<doc>
<title>China Confirms Gold Price Suppression</title>
<names>foo bar baz</names>
</doc>
[karpet(at)not-real.pekmac:~/tmp/s]$ cat conf
IndexReport 1
IndexOnly .xml
IndexContents XML* .xml
IndexDir docs
IndexFile index.swish-e
ParserWarnLevel 1
IgnoreTotalWordCountWhenRanking yes
PropertyNames names title
--
Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Sun Jul 08 2012 - 01:33:49 GMT