Skip to main content.
home | support | download

Back to List Archive

Re: 8-bit chars

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Wed Dec 10 2003 - 20:06:17 GMT
On Wed, Dec 10, 2003 at 11:26:11AM -0800, David L Norris wrote:
> On Wed, 2003-12-10 at 14:11, John Angel wrote:
> > So how to use pure HTML parser instead of HTML2 with prog script?
> 
> Modify the script to specify HTML instead of HTML* or HTML2.  If you are
> using a SWISH::Filter based prog-bin script then you'll need to modify
> Filter.pm.
> 
> $ grep -n 'HTML*' lib/swish-e/perl/SWISH/Filter.pm
> 21:    'text/html'     => 'HTML*',

That's one way, but you have to ask SWISH::Filter for that info.

For example, in spider.pl:

            # let's see if we can set the parser.
            $server->{parser_type} = $doc->swish_parser_type || '';

so it's the -S prog program, not really the filter that's setting the parser
type.

In other words.  If you don't want your program to set the parser type
then don't set the parser type.

-- 
Bill Moseley
moseley@hank.org
Received on Wed Dec 10 20:06:53 2003