I added my email address to the SwishSpiderConfig.pl. The output of this is below.
cat SwishSpiderConfig.pl
my %main_site = (
base_url => 'http://10.10.10.10/',
email => 'allen.lung@ftb.ca.gov',
);
my %news_site = (
base_url => 'http://10.10.10.11/doc',
email => 'allen.lung@ftb.ca.gov',
);
@servers = ( \%main_site, \%news_site );
1;
swish-e -S prog -c swish.conf
Indexing Data Source: "External-Program"
Indexing "/app/swish/lib/swish-e/spider.pl"
External Program found: /app/swish/lib/swish-e/spider.pl
/app/swish/lib/swish-e/spider.pl: Reading parameters from '/var/www/cgi-bin/SwishSpiderConfig.pl'
Warning: Substituted 6741 embedded null character(s) in file 'http://10.10.10.10/ghgmemo.doc' with a newline
Warning: Unknown header line: 'Â Â
&Â9Ã5ÃY¾j!f\0Â8ÂîV³ÂF ' from program /app/swish/lib/swish-e/spider.pl
}' from program /app/swish/lib/swish-e/spider.pl
Ã¥ÃÃH(¿Â]ôhPå±bà çäÃe: 'Ãñõ©úT
Warning: Unknown header line: '&a÷'n8DÃÃ
ÃÃÂ4¤ª¢ñ«-B^ÿõ$`ÂJ?DóÃ
Warning: Unknown header line: '#<' from program /app/swish/lib/swish-e/spider.pl
err: External program failed to return required headers Path-Name:
.
root@localhost cgi-bin# 6c
-----Original Message-----
From: Bill Moseley [mailto:moseley@hank.org]
Sent: Monday, March 22, 2004 4:40 PM
To: Lung.Allen
Cc: Multiple recipients of list
Subject: Re: Multiple web sites
On Mon, Mar 22, 2004 at 04:34:49PM -0800, Lung.Allen wrote:
> root(at)not-real.localhost /#spider.pl default http://10.10.10.10/
Are you sure you want to run that as root?
> External Program found: /app/swish/lib/swish-e/spider.pl
> /app/swish/lib/swish-e/spider.pl: Reading parameters from '/var/www/cgi-bin/SwishSpiderConfig.pl'
> LWP::RobotUA from address required at /app/swish/lib/swish-e/spider.pl line 262
You need to supply a from email address in the spider config:
email => 'Allen.Lung@ftb.ca.gov',
--
Bill Moseley
moseley@hank.org
Received on Tue Mar 23 09:25:11 2004