Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] partial indexing

From: Peter Karman <peter(at)>
Date: Thu Mar 26 2009 - 21:23:30 GMT
Zhou Xiang wrote on 03/26/2009 03:29 PM:
> Hi David,
> Thank you for your reply!
> I tested it again today. It shows that the crawler can only index the
> webpages within "". It cannot crawl the pages
> on "" or any other websites, even though i used real
> URLs instead of queries.
> Any ideas about it?

don't use the old spider.

Use instead with -S prog.

See this documentation:


Note that with there are 2 config files: 1 for swish-e, and 1

Your swish-e config file can remain unchanged with the exception of

MaxDepth 2
TmpDir /usr/local/swish-e-2.4.5/tmp

since those are ignored with the -S prog method.

Peter Karman  .  peter(at)  .

Users mailing list
Received on Thu Mar 26 17:23:27 2009