Skip to main content.
home | support | download

Back to List Archive

Re: Trying to run spider.pl with self-defined config file...

From: Justin Tang <justin.tang(at)not-real.positionresearch.com>
Date: Mon Mar 08 2004 - 19:11:53 GMT
Hi Bill:
  All I got is,
swish-e]# /usr/local/lib/swish-e/spider.pl config.pl > output
/usr/local/lib/swish-e/spider.pl: Reading parameters from 'config.pl'
swish-e]#

and output is an empty file...

-Justin



----- Original Message ----- 
From: "Bill Moseley" <moseley@hank.org>
To: "Justin Tang" <justin.tang@positionresearch.com>
Cc: "Swish-e" <swish-e@sunsite.berkeley.edu>
Sent: Monday, March 08, 2004 10:58 AM
Subject: Re: [SWISH-E] Re: Trying to run spider.pl with self-defined config
file...


> On Mon, Mar 08, 2004 at 10:44:21AM -0800, Justin Tang wrote:
> > Enabled debugging... all of it... still got nothing.... here's what the
file
> > looks like now.
>
> Got nothing?  Are you using the correct config file path?  You should
> have got something printed to stdout.
>
>
> >
> > my %serverA = (
> >         base_url => 'http://www.mysite.com',
> >         same_hosts => [qw/www.mysite.com/],
> >         email => 'justin@truepath.com',
> >         debug => DEBUG_INFO | DEBUG_URL | DEBUG_HEADERS | DEBUG_ERRORS |
> > DEBUG_FAILED | DEBUG_SKIPPED | DEBUG_LINKS,
> >         test_url => sub { print 'TESTING\n\n';},
> >         test_response => sub { print 'TESTING RESPONSE\n\n';},
> >         filter_content => sub {
> >                 print '\n\n\n\n\n';
> >                 print $_[0];
> >                 print '\n';
> >                 print $_[1];
> >                 print '\n';
> >                 print $_[2];
> >                 print '\n***************************************\n';
> >                 print $_[3];
> >                 return 1;
> >         },
> >         );
> >
> > @server = (\%serverA);
> > 1;
> >
> >
> >
> >
> > ----- Original Message ----- 
> > From: "Bill Moseley" <moseley@hank.org>
> > To: "Multiple recipients of list" <swish-e@sunsite.berkeley.edu>
> > Sent: Friday, March 05, 2004 10:25 PM
> > Subject: [SWISH-E] Re: Trying to run spider.pl with self-defined config
> > file...
> >
> >
> > > On Fri, Mar 05, 2004 at 05:13:30PM -0800, Justin Tang wrote:
> > > > Hi:
> > > >   I tried to run the following config file for spider.pl, but I got
no =
> > > > result.  Is there something wrong with my config file? This the
command
> > =
> > > > line I ran:=20
> > >
> > > Enable debugging?
> > >
> > > (BTW -- anyone know why do I see =3D> in some posts?)
> > > >
> > > >
> > > > /usr/local/lib/swish-e/spider.pl config.pl > output
> > >
> > > >
> > > >
> > > > output is an empty file.
> > > >
> > > > Here is the config.pl I passed in:
> > > >
> > > >
> > > > my %serverA =3D (
> > > >         base_url =3D> 'http://www.mysite.com',
> > > >         same_hosts =3D> [qw/www.mysite.com/],
> > > >         email =3D> 'test@mysite.com',
> > > >         test_url =3D> sub { print 'TESTING\n\n';},
> > > >         test_response =3D> sub { print 'TESTING RESPONSE\n\n';},
> > > >         filter_content =3D> sub {
> > > >                 print '\n\n\n\n\n';
> > > >                 print $_[0];
> > > >                 print '\n';
> > > >                 print $_[1];
> > > >                 print '\n';
> > > >                 print $_[2];
> > > >                 print '\n***************************************\n';
> > > >                 print $_[3];
> > > >                 return 1;
> > > >         },
> > > >         );
> > > >
> > > > @server =3D (\%serverA);
> > > > 1;
> > > >
> > > >
> > > > Thanks!
> > > >
> > > > -Justin
> > > >
> > > >
> > > >
*********************************************************************
> > > > Due to deletion of content types excluded from this list by policy,
> > > > this multipart message was reduced to a single part, and from there
> > > > to a plain text message.
> > > >
*********************************************************************
> > > >
> > >
> > > -- 
> > > Bill Moseley
> > > moseley@hank.org
> > >
> > >
> > >
> >
> >
> >
>
> -- 
> Bill Moseley
> moseley@hank.org
>
>
>
Received on Mon Mar 8 11:11:54 2004