Hi... 2 queries
I am trying to index an intranet site using version 2.2.3 All was going
well, but suddenly I have been receiving "err: the index file format is
unknown" errors whenever I try to search the index files from the
command line. I have reverted to the last working version of the config.
file, but still get the message. Any clues as to what I'm doing wrong?
(I do want the storedescription, etc., in eventually, but have remmed it
Also, is there any way to stop the parsing of .gif and .jpg files - I
have the NoContents towards the end, but this isn't stopping it - is it
possible to specifically block them from the parser?
Many thanks in advance,
IndexContents HTML* .htm .html .shtml
IndexContents TXT .txt .log .text
IndexContents XML* .xml
Metanames swishdocpath swishtitle
#PropertyNamesMaxLength 1000 swishdescription
#PropertyNameAlias swishdescription body
#StoreDescription TXT 2000
#StoreDescription HTML* <body> 2000
ReplaceRules replace "/home/httpd/html" "http://smpnet"
ReplaceRules replace "/home/wasters/PCfiles/public_html"
IgnoreLimit 50 1000
# This option allows the user decide if to index the comments in the
# default is 1. Set to 0 if comment indexing is not required.
NoContents .pps .EXE .xls .XLS .chm .CHM
NoContents .gif .xbm .au .mov .mpg .ps .jpg .zip .exe .ppt .avi
FileFilter .pdf /usr/local/bin/pdf-filter.sh
FileFilter .doc /usr/local/bin/docfilter.sh
FileFilter .dot /usr/local/bin/docfilter.sh
Received on Mon Oct 6 13:27:00 2003