Skip to main content.
home | support | download

Back to List Archive

Re: Problem swish-e not finding words present in index solved???

From: John P. Rouillard <rouilj(at)not-real.cs.umb.edu>
Date: Wed Sep 03 2003 - 18:49:12 GMT
I wrote in a prior message:
> I have tried the simple example in the manual:
>[...] 
> % /tools/swish_e-2.4.0_pr1/bin/swish-e -i test.html -T indexed_words
> 
>   Indexing Data Source: "File-System"
>   Indexing "test.html"
>   ...
>     Adding:[1:swishdefault(1)]   'guest'   Pos:172  Stuct:0x9 ( BODY FILE )
>     Adding:[1:swishdefault(1)]   'guest'   Pos:206  Stuct:0x9 ( BODY FILE )
>     Adding:[1:swishdefault(1)]   'guest'   Pos:235  Stuct:0x9 ( BODY FILE )
>     Adding:[1:swishdefault(1)]   'guest'   Pos:245  Stuct:0x9 ( BODY FILE )
>   ...
>   Removing very common words...
>   no words removed.
>   Writing main index...
>   Sorting words ...
>   Sorting 141 words alphabetically
>   Writing header ...
>   Writing index entries ...
>     Writing word text: Complete
>     Writing word hash: Complete
>     Writing word data: Complete
>   141 unique words indexed.
>   4 properties sorted.                                              
>   1 file indexed.  3085 total bytes.  241 total words.
>   Elapsed time: 00:00:00 CPU time: 00:00:00
>   Indexing done!
> 
> Which shows that guest is being indexed as swishdefault.
> 
>   % /tools/swish_e-2.4.0_pr1/bin/swish-e -w guest
>   # SWISH format: 2.4.0-pr1
>   # Search words: guest
>   # Removed stopwords: 
>   # Number of hits: 1
>   # Search time: 0.001 seconds
>   # Run time: 0.023 seconds
>   1000 test.html "TWiki security setup." 3085
> 
> So the simple test case works. Doing a guest search on the entire 
> directory tree returns no hits, my config file is:
> 
>   IndexName "Majordomo Mailing list archives"
>   IndexDescription "Index of Majordomo mailing list archives"
>   IndexPointer "http://XXXXXX/mailing-lists"
>   IndexAdmin "admin@example.com
> 
>   IndexDir /tools/swish_e-2.1dev25/lib/swish-e/progs/index_hypermail.pl
>   IndexFile /data/www/swish-e/hypermail.idx
> 
>   SwishProgParameters /data/www/mailing-lists/*
>   ReplaceRules replace "/data/www/" "http://XXXXX/"
>   MetaNames swishtitle name email
>   PropertyNames name email
>   PropertyNamesDate sent
>   IndexContents HTML2 .html
>   StoreDescription HTML2 <body> 100000
>   UndefinedMetaTags  ignore
> 
>   IncludeConfigFile /home/jrouilla/develop/search/filters.cf
> 
>   MetaNames swishdocpath
>   MetaNameAlias swishtitle title
>   MetaNameAlias swishdocpath path

For some reason the MetaNameAlias swishtitle title is causing a
problem.  If I comment it out, the indexing (and searching) works as I
expect, and guess is indexed into the swishdefault metaname. With that
line there, it is put into the swishtitle metaname.

Any idea why this is? Am I missing something in my config file?

				-- rouilj
John Rouillard
===========================================================================
My employers don't acknowledge my existence much less my opinions.

------- End of Forwarded Message
Received on Wed Sep 3 18:49:39 2003