Skip to main content.
home | support | download

Back to List Archive

(no subject)

From: Chad Day <CDay(at)not-real.mindshare.net>
Date: Fri Dec 02 2005 - 14:56:41 GMT
No idea why either. :-(

=20

>From the indexing process (swish-e -c swish.conf -v 3 -S http)

=20

retrieving
http://dev.website.org/files/Joomla%20Quick%20Start.pdf?PHPSESSID=3D413c0=
4
013e7c3505db9a68bedf8a8951 (3)...

sleeping 1 seconds before fetching
http://dev.website.org/files/Joomla%20Quick%20Start.pdf?PHPSESSID=3D413c0=
4
013e7c3505db9a68bedf8a8951

Now fetching
[http://dev.website.org/files/Joomla%20Quick%20Start%201.0.pdf?PHPSESSID
=3D413c04013e7c3505db9a68bedf8a8951]...Status: 200. application/pdf

=20

$ cat swish.conf

# Example configuration file

=20

# Tell Swish-e what to index (same as -i switch above)

IndexDir http://dev.website.org/index.php

IndexFile /usr/local/apache/htdocs/webiste.index=20

IndexOnly .php .txt .html .htm .pdf .xml .htm .shtml

=20

# Index the PDF files

FileFilter .pdf /usr/X11R6/bin/pdftotext '"%p" -'

=20

# Tell Swish-e that .txt files are to use the text parser.

IndexContents TXT* .txt .pdf

IndexContents XML* .xml

IndexContents HTML* .htm .html .shtml .php

=20

PropertyNamesMaxLength 1000 swishdescription

PropertyNameAlias swishdescription body

=20

StoreDescription TXT* 250000

Delay 1

=20

# Otherwise, use the HTML parser

DefaultContents HTML*

=20

Any ideas?=20

Chad Day

Developer

Mindshare Interactive Campaigns, LLC
202.654.0832 - www.mindshare.net <http://www.mindshare.net/> =20

=20




*********************************************************************
Due to deletion of content types excluded from this list by policy,
this multipart message was reduced to a single part, and from there
to a plain text message.
*********************************************************************
Received on Fri Dec 2 06:56:44 2005