On Wed, Aug 25, 2004 at 02:43:42PM -0700, mpeters wrote:
> I have the following rules in my swish-e.conf file..
>
> ExtractPath category regex !^/web/htdocs/([^/]+)/.*$!$1!
> ExtractPathDefault category unkown
>
> When I index the files, the files found at /web/htdocs/*.html don't
> get 'unknown' as the value of their category, they get a null string
> (""). Am I missing something?
Yes, an example.
moseley@bumby:~$ cat c
ExtractPath category regex !^/usr/share/doc/apache-doc/([^/]+)/.*$!$1!
ExtractPathDefault category unknown
moseley@bumby:~$ swish-e -c c -i /usr/share/doc/apache-doc/manual/programs/other.html -T indexed_words | grep category
Adding:[1:category(10)] 'manual' Pos:1 Stuct:0x1 ( FILE )
moseley@bumby:~$ swish-e -c c -T indexed_words -i /usr/share/doc/apache-doc/manual.html | grep category
Adding:[1:category(10)] 'unknown' Pos:1 Stuct:0x1 ( FILE )
--
Bill Moseley
moseley@hank.org
Unsubscribe from or help with the swish-e list:
http://swish-e.org/Discussion/
Help with Swish-e:
http://swish-e.org/current/docs
swish-e@sunsite.berkeley.edu
Received on Wed Aug 25 17:43:15 2004