Skip to main content.
home | support | download

Back to List Archive

Re: spider.pl

From: Z <techlistreader(at)not-real.yahoo.com>
Date: Wed Aug 16 2006 - 18:18:10 GMT
Bill,

I am sorry for the misunderstanding. The server exist, but only on my internal network.

I am trying to set up the search, before putting it on an external machine, for fear that one or two of the things that I am unclear about will take down the site.

These results are formed from command line requests of the internal machine, that hosts dev.site.com

##############################################
 E:\INETPUB\WWWROOT\SITE\WINDOWS>perl spider | swish-e.exe -S prog -c test.conf  
 Indexing Data Source: "External-Program" Indexing "spider.pl" 
 External Program found: E:\INETPUB\WWWROOT\SITE\WINDOWS\lib\swish-e/spider.pl spider.pl: Reading parameters from 'SwishSpiderConfig_test.pl' Skipping Server Config: http://dev.site.com/index.html E:\INETPUB\WWWROOT\SITE\WINDOWS\lib\swish-e\spider.pl: Reading parameters from 'default'  

Summary for: http://dev.site.com/
Connection: Close: 1  (0.2/sec)
      Unique URLs: 1  (0.2/sec)

 
 Removing very common words... 
 no words removed. 
 Writing main index... 
 err: No unique words indexed! 
 .  

##############################################

Z




Bill Moseley <moseley@hank.org> wrote: On Wed, Aug 16, 2006 at 10:57:26AM -0700, Z wrote:
> I'm sorry I don't understand. Does that mean that Swish-e can not work with a url that is dev.something or is there another issue?

You can't spider a web server that doesn't exist.

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs
   swish-e@sunsite.berkeley.edu



 		
---------------------------------
Stay in the know. Pulse on the new Yahoo.com.  Check it out. 


*********************************************************************
Due to deletion of content types excluded from this list by policy,
this multipart message was reduced to a single part, and from there
to a plain text message.
*********************************************************************
Received on Wed Aug 16 11:18:11 2006