Skip to main content.
home | support | download

Back to List Archive

Re: Problems with spider.pl on windows 98 SE

From: Adam Edelman <aedelma(at)not-real.tulane.edu>
Date: Wed Feb 13 2002 - 17:55:36 GMT
> Try running with some trace options turned on:
> 
>       -T parsed_words

I tried it with your config files and with -T parsed words and got:

Indexing Data Source: "External-Program"
Indexing "c:\perl\bin\perl.exe"
c:\swish-e\spider.pl: Reading parameters from 'SwishSpiderConfig.pl'
-- Starting to spider: http://arena.internet2.edu/sample.htm --
?Testing 'test_url' user supplied function #1
'http://arena.internet2.edu:80/sample.htm'
+Passed all 1 tests for 'test_url' user supplied function
?Testing 'test_response' user supplied function #1
'http://arena.internet2.edu:80/sample.htm'
+Passed all 1 tests for 'test_response' user supplied function
>> +Fetched 0 Cnt: 1 http://arena.internet2.edu:80/sample.htm 200 OK
text/html 33 parent:
! Found 0 links in http://arena.internet2.edu:80/sample.htm
Path-Name: http://arena.internet2.edu:80/sample.htm
Content-Length: 33
Last-Mtime: 1013569857
<HTML>Sample document</HTML>c:\swish-e\spider.pl: Max indexed files Reached
Summary for: http://arena.internet2.edu/sample.htm
Total Bytes: 33 (33.0/sec)
Total Docs:   1 (1.0/sec)
Unique URLs:   1 (1.0/sec)
Removing very common words...
no words removed.
Writing main index...
err: No unique words indexed!
Received on Wed Feb 13 17:56:08 2002