Yes I have a similar config and I added this to my conf.
ReplaceRules replace /public_web_sites/www.mysite.com/reflex/
And all the results from the index are prefixed with www.mysite.com.
Hope it helps..
From: Peter Karman [mailto:email@example.com]
Sent: Tuesday, March 15, 2005 2:22 PM
To: Multiple recipients of list
Subject: [SWISH-E] Re: problem with returned URL?
I assume that the 'generated URLs' are coming from swish.cgi. Can you see
the cmd line returns?
% swish-e -w shell
You probably need to configure swish.cgi to set the correct base URL, and/or
play with the ReplaceRules config option.
John Young scribbled on 3/15/05 1:02 PM:
> I am trying to set up Swish-e 2.4.3 on a Solaris box to
> use with local copy of the UNIXhelp web pages from the University
> of Edinburgh (http://unixhelp.ed.ac.uk/index.html) on a non-public
> web server. The server is running Apache httpd 2.0.53.
> My swish.conf file looks like this:
> # SWISH configuration file
> IndexDir /usr/opt/MSMBweb/htdocs/Edinburgh
> IndexFile /usr/opt/MSMBweb/htdocs/Edinburgh/index.swish
> IndexName "Index of UNIXhelp 1.3"
> IndexDescription "This is a full index of UNIXhelp release 1.3."
> IndexPointer "http://msmb.larc.nasa.gov/cgi-bin/unixhelp_search"
> IndexAdmin "webmaster"
> IndexOnly .html
> IndexReport 3
> NoContents .gif .xbm .au .mov .mpg
> IgnoreLimit 70 200
> I tried indexing the pages by doing:
> % ../cgi-bin/swish-e -i ./Edinburgh -c ./Edinburgh/swish.conf
> Which resulted in a great deal of output ending with:
> Removing very common words...
> Getting IgnoreLimit stopwords: Complete
> 13 words removed by IgnoreLimit:
> a, gov, by, of, to, larc, the, site, maintained, this, msmb, nasa, help,
> Writing main index...
> Sorting words ...
> Sorting 3,952 words alphabetically
> Writing header ...
> Writing index entries ...
> Writing word text: Complete
> Writing word hash: Complete
> Writing word data: Complete
> 3,952 unique words indexed.
> 4 properties sorted.
> 884 files indexed. 1,307,022 total bytes. 87,710 total words.
> Elapsed time: 00:00:07 CPU time: 00:00:07
> Indexing done!
> It seems to have worked, and searching for, say, "shell"
> produces a reasonable list of pages. But the generated URLs
> are incorrect, e.g. the first result is for
> "1. Using the Bourne shell to interpret a shell script"
> and the associated URL is
> but it *should* be
> I cannot figure out why "cgi-bin" is being incorrectly inserted into
> the results. I have searched the documentation on http://swish-e.org
> but I still have not found the answer. Any suggestions? Is anyone
> else on this list using the Edinburgh UNIXhelp pages with Swish-e?
> John E. Young B1148/R202
> Analytical Services and Materials, Inc. (757) 864-8659
Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
This electronic transmission contains confidential information intended only
for the person(s) named. Any use, distribution, copying, or disclosure by
any other person is strictly prohibited. If you received this transmission
in error, please notify the sender by return e-mail and delete all copies of
Received on Tue Mar 15 11:27:00 2005