Skip to main content.
home | support | download

Back to List Archive

Re: Question on searching for double quote chars...

From: David Wood <dwood(at)not-real.inter.nl.net>
Date: Wed Oct 15 2003 - 14:06:44 GMT
Hi folks,

Argh, I think I can answer my own question...  The double quote is in 
WordCharacters below, but it isn't in EndCharacters (from src/config.h, 
because I haven't specified it), so the word '17"', ending in a double 
quote, is considered illegal and gets rejected.

Cheers,

David



At 15:48 Wednesday 15-10-2003, David Wood wrote:
>Using SWISH-E 2.4.0-pr4 on HP-UX 11, with the following config file:
>
>IndexDir /var/opt/web/rrc-web/htdocs/rrc/performance/html_src/cbo/products
>IndexFile /var/opt/web/rrc-web/swish-e.v2/cbo_products.index
>IndexName "Index file of HPIS-sourced content."
>IndexReport 3
>FileRules pathname contains /CVS
>FollowSymLinks yes
>ReplaceRules remove "/var/opt/web/rrc-web/htdocs"
>IndexOnly .htm .html
>MetaNames keywords
>MinWordLimit 2
>MaxWordLimit 30
>WordCharacters abcdefghijklmnopqrstuvwxyz0123456789_\|/-+?!@$%^'"`~.[]{}()
>
>
>this search:
>
>swish-e -w 'vf17' -f cbo_products.index
>
>gives the following result:
>
># SWISH format: 2.4.0-pr4
># Search words: vf17
># Removed stopwords:
># Number of hits: 1
># Search time: 0.000 seconds
># Run time: 0.060 seconds
>1000
>/rrc/performance/html_src/cbo/products/7DD2278DE65B38AD85256D8E006EB286/7DD2278DE65B38AD85256D8E006EB286_1.html 
>
>"hp pavilion vf17 17" LCD flat panel display" 16196
>.
>
>
>But if I try to search on:
>
>swish-e -w '17"' -f cbo_products.index
>
>even though the string '17"' is in the title, I get:
>
># SWISH format: 2.4.0-pr4
># Search words: 17"
># Removed stopwords:
>err: Syntax error in query (missing end quote or unbalanced parenthesis?)
>.
>
>
>and if I try to search on:
>
>swish-e -w '17\"' -f cbo_products.index
>
>I get:
>
># SWISH format: 2.4.0-pr4
># Search words: 17\"
># Removed stopwords:
>err: No search words specified
>.
>
>
>Do these results make sense?  The double quote char is listed in
>WordCharacters, so shouldn't one or both of these search strings return a
>result?
>
>
>Thanks for any assistance,
>
>David Wood
Received on Wed Oct 15 14:06:46 2003