Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Ok - I do not understand what the problem is

From: at <Joseph>
Date: Thu, 12 Jul 2012 14:14:58 -0700
Thank you for this suggestion - I will do just this.

On 07/07/2012 06:33 PM, Peter Karman wrote:
> Joseph Norris wrote on 6/29/12 6:12 PM:
>> I have the following:
>>
>> swish-e -w "China Confirms Gold Price Suppression" -s names desc title
>> desc -f /home/cs/sw/swishe/gold/i
>> ndex.swish-e
>>
>> No hits
>>
>> however when I do this:
>>
>> grep -l "China Confirms Gold Price Suppression" /home/cs/sw/swishe/gold/*
>>
>> I get a hit
> Try reducing your sample size to one doc, with the -T options as David suggests.
>
> Here's a small self-contained example. I suggest you put together a similar
> example of your own, which IME will often reveal the problem.
>
> [karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -c conf
> Indexing Data Source: "File-System"
> Indexing "docs"
> Removing very common words...
> no words removed.
> Writing main index...
> Sorting words ...
> Sorting 8 words alphabetically
> Writing header ...
> Writing index entries ...
>    Writing word text: Complete
>    Writing word hash: Complete
>    Writing word data: Complete
> 8 unique words indexed.
> 6 properties sorted.
> 1 file indexed.  96 total bytes.  8 total words.
> Elapsed time: 00:00:00 CPU time: 00:00:00
> Indexing done!
>
> [karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -w "China Confirms Gold Price Suppression" -s
> names desc title
> # SWISH format: 2.5.8
> # Search words: China Confirms Gold Price Suppression
> # Removed stopwords:
> # Number of hits: 1
> # Search time: 0.002 seconds
> # Run time: 0.009 seconds
> 1000 docs/china.xml "china.xml" 96
> .
>
> [karpet(at)not-real.pekmac:~/tmp/s]$ cat docs/china.xml
> <doc>
>   <title>China Confirms Gold Price Suppression</title>
>   <names>foo bar baz</names>
> </doc>
>
> [karpet(at)not-real.pekmac:~/tmp/s]$ cat conf
> IndexReport 1
> IndexOnly .xml
> IndexContents XML*  .xml
> IndexDir docs
> IndexFile index.swish-e
> ParserWarnLevel 1
> IgnoreTotalWordCountWhenRanking yes
> PropertyNames names title
>
>
>

-- 
#Joseph Norris (Linux/Apache/Mysql/Perl - what else is there?)
print (at)not-real.c=map chr $_+100,(6,17,15,16,-68,-3,10,11,
16,4,1, 14,-68,12,1,14,8, -68,4,-3,-1,7,1,14,-68,
-26,11,15,1,12, 4,-68,-22,11,14,14,5,15,-90);



_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Thu Jul 12 2012 - 21:15:02 GMT