So would the following work?
(delete the DefaultContents line)
IndexOnly HTML* .htm .html .cfm .doc .pdf .ppt
NoContents .doc .pdf .ppt
The Docs, under NoContents, say that having different file types in each property won't work. So from what I am understanding, using spider.pl and this config file, IndexOnly means the spider will only index the files in which I specify the extension, and then the contents of those files will be indexed, EXCEPT for the file types in NoContents.
Is this a correct assumption? I'm not sure I understand how using IndexOnly and DefaultContents would work.
Thanks so much!
-Alan
>
> From: Peter Karman <karman@cray.com>
> Date: 2004/06/17 Thu PM 12:03:10 EDT
> To: Multiple recipients of list <swish-e@sunsite3.berkeley.edu>
> Subject: [SWISH-E] Re: PPT & swish.cgi (trying again)
>
> You likely want the IndexOnly config in addition to DefaultContents.
> Either that or NoContents, which will still index the name of the .ppt
> file but not the contents.
>
> adivey1@cox.net wrote on 06/17/2004 08:58 AM:
> > No one responded so I'm assuming the is because the message looked ridiculous coming from my webmail app.
> >
> > Here's a link of my message... I know this is an extra step but I would really appreciate the help.
> >
> > http://members.cox.net/adivey1/swishhelp.txt
> >
> > Thanks,
> > Alan
>
> --
> Peter Karman - Software Publications Programmer - Cray Inc
> phone: 651-605-9009 - mailto:karman@cray.com
>
>
Received on Thu Jun 17 17:00:53 2004