On Fri, Dec 12, 2003 at 01:15:43PM -0800, Francis Hwang wrote:
> Hi, I'm trying to evaluate SWISH-E as a possible search engine to use
> in crawling my site. I had a few questions where I couldn't find the
> answers in the archives:
> 1. Can SWISH-E do any searching by URL? For example, if I wanted to
> search for all pages containing the word "butter" and whose URLs
> contained the substring "recipe.html" would SWISH-E support that?
Yes. Check out this in the FAQ:
How do I limit searches to just parts of the index?
> 2. Is there any way to change the behavior of the spider based on what
> pages change more quickly? Let's say I wanted to spider my site once an
> hour but only index a few pages that change all the time, and then do a
> weekly spider of the entire site, would htdig let me do that?
htdig might, I'm not sure.
Swish-e doesn't have "incremental" indexing that will remove old data.
But indexing is quite fast.
Received on Sat Dec 13 00:09:15 2003