At 04:30 AM 12/21/2001 -0800, Zambra - Michael wrote:
>The index contains a page with the word "Camarón".
>If I search for "Camarón" the search engine shows the hit, but without the
accented character. Bill pointed out that the indexer might still working
wrong because it was indexing "camar" and "n" and interpreting "ó" as a
blank. I don't think so, because the engine is unable to find "camar" or "n".
It was never an issue with the script as far as I know, but rather wrong
default WordCharacters in swish. I updated that on Dec 7th.
Are you sure you don't have any WordCharacters settings in your config?
Are you sure you are really using a newer version of swish?
Doe you get the same results:
~/swish_archive %./swish-e -T index_header | fgrep WordCh
The swish-e list archive script is the one that's in the current swish-e
distribution, too. Go there and search for Camarón
Here's the entire config file for indexing the archives:
MetaNames swishtitle name email
PropertyNames name email
IndexContents HTML2 .html
StoreDescription HTML2 <body> 100000
So I'm using the default settings in swish for WordChars.
Check your versions once again.
I really doubt that your locale setting would effect this, but if all else
Received on Fri Dec 21 15:26:58 2001