Skip to main content.
home | support | download

Back to List Archive

RE: Swish-E and HTML documents with frames

From: PropheZine Owner <bob(at)not-real.prophezine.com>
Date: Thu Feb 24 2000 - 19:25:16 GMT
Chris:

Certainly should be useful to everyone I would think.  I'll take a copy
thank you.  Please email to

bob@prophezine.com

Thanks again.  I'll let you know how well it works.

Bob


-----Original Message-----
From: swish-e@sunsite.berkeley.edu
[mailto:swish-e@sunsite.berkeley.edu]On Behalf Of Chris Humphries
Sent: Thursday, February 24, 2000 1:51 PM
To: Multiple recipients of list
Subject: [SWISH-E] Swish-E and HTML documents with frames


I have been testing a new version of the Swish-E spider that can handle
HTML documents with frames.

If the spider program detects that the document is a framed HTML, it
recursively builds content by reading through the <frame src> pointers, and
builds up a list of all the <a href> links that it finds. It then passes
*this* back to the C program, which indexes the document as if it were one
big HTML. The spidering will work as if all the <a href> links found in in
the frameset HTML files were at level 1.

Would this be of use to anyone?

Chris Humphries
Received on Thu Feb 24 14:29:21 2000