Skip to main content.
home | support | download

Back to List Archive

Re: Spider on Windows

From: Ron Klatchko <ron(at)not-real.library.ucsf.edu>
Date: Mon Jul 26 1999 - 16:20:22 GMT
At 06:02 PM 7/23/99 -0700, Pebbs wrote:
>Swishspider running under Windows NT does not crawl pages. It only fetches
>the page I specify in the config file. has anyone else had this problem or
>have ideas?
>
>I've tested debth 2,3,5 and it doesn't matter. Swish builds the index fine
>so I know the single fetch is working. No errors are reported by Swish-e.

Try running swishspider manually.  You can invoke it using:
  swishspider.pl /path/filename_without_extensions url

It should generate three files:
  /path/filename.response
  /path/filename.links
  /path/filename.contents

filename.response should contain the HTTP status code and the mime
content-type.
filename.links should have the URLs from all <A> and <IMG> tag found in the
contents.
filename.contents should have the HTML of the page.

Please post the results of your test and we can start tracking it down.

moo




----------------------------------------------------------------------
          Ron Klatchko - Manager, Advanced Technology Group           
           UCSF Library and Center for Knowledge Management           
                        ron@library.ucsf.edu                
Received on Mon Jul 26 09:13:28 1999