Skip to main content.
home | support | download

Back to List Archive

Re: swish-e only spiders the server it started on

From: Cas Tuyn <cas.tuyn(at)not-real.gmail.com>
Date: Mon May 15 2006 - 14:38:43 GMT
Hi,

Since http://aaa.company.com, http://bbb.company.comand and
http://ccc.company.com offer different content, the same_hosts cannot
be used (only seems useful for removing "www." from the url).

So we'll make an array out of the servers:

  base_url => [qw! http://aaa.company.com/intranet/index.html
http://bbb.company.com/ http://ccc.company.com/ !],

And see what happens tonight.

Thanks,

Cas




On 5/15/06, Bill Moseley <moseley@hank.org> wrote:
> On Mon, May 15, 2006 at 02:20:57AM -0700, Cas Tuyn wrote:
>
> >    base_url    => 'http://aaa.company.com/intranet/index.html',
>
> That's the only domain that will be spidered.
>
> See the docs about base_url and same_hosts.
>
>
> --
> Bill Moseley
> moseley@hank.org
>
> Unsubscribe from or help with the swish-e list:
>    http://swish-e.org/Discussion/
>
> Help with Swish-e:
>    http://swish-e.org/current/docs
>    swish-e@sunsite.berkeley.edu
Received on Mon May 15 07:38:45 2006