Skip to main content.
home | support | download

Back to List Archive

RE: Mailman pipermail archive HOWTO

From: Fletcher Cocquyt <fcocquyt(at)not-real.stanford.edu>
Date: Wed Sep 07 2005 - 20:44:03 GMT
And how do you generate the hypermail archive from the mailman pipermail
archives?

The pipermail archive is divided into Year-Month.txt files
 
Thanks!

-----Original Message-----
From: Bill Moseley [mailto:moseley@hank.org] 
Sent: Wednesday, September 07, 2005 11:02 AM
To: Fletcher Cocquyt
Cc: Multiple recipients of list
Subject: Re: Mailman pipermail archive HOWTO

On Wed, Sep 07, 2005 at 10:44:22AM -0700, Fletcher Cocquyt wrote:
> Hello,
> I am looking to integrate swish-e search into my mailman list archive's
web
> interface for searching the pipermail archives.
> 
> I would like to copy this site's interface:
> 
> http://swish-e.org/search_archive/swish.cgi
> 
> I've seen some old threads but nothing definitive
> Something like a howto or cookbook in the wiki would be great!


That's just the default swish.cgi setup as it comes packaged with
swish-e.

Indexing is done with the index_hypermail.pl script that's also part
of the swish-e distribution.

All the files are thrown together in the same place on the website:

$ ls -a
                   .htaccess           index.swish-e
index_hypermail.pl  swish.conf
.                  .swishcgi.conf      index.swish-e.prop  swish.cgi

$ cat swish.conf
IndexDir ./index_hypermail.pl
SwishProgParameters ../archive

MetaNames swishtitle name email
PropertyNames name email
IndexContents HTML* .html
StoreDescription HTML* <body> 100000
UndefinedMetaTags  ignore


$ cat .swishcgi.conf 
    $ENV{TZ} = 'UTC'; # display dates in UTC format
    $ENV{LD_LIBRARY_PATH} = '/usr/local/lib';  # until path added with crle

    return {
        title           => "Search the Swish-e List Archive",
        display_props   => [qw/ name email swishlastmodified /],
        sorts           => [qw/swishrank swishtitle email
swishlastmodified/],
        metanames       => [qw/swishdefault swishtitle name email/],
        name_labels     => {
            swishrank           =>  'Rank',
            swishtitle          =>  'Subject Only',
            name                =>  "Poster's Name",
            email               =>  "Poster's Email",
            swishlastmodified   =>  'Message Date',
            swishdefault        =>  'Subject & Body',
        },

        highlight       => {
            package         => 'SWISH::PhraseHighlight',

            highlight_on    => '<font style="background:#FFFF99">',
            highlight_off   => '</font>',

            meta_to_prop_map => {   # this maps search metatags to display
properties
                swishdefault    => [ qw/swishtitle swishdescription/ ],
                swishtitle      => [ qw/swishtitle/ ],
                email           => [ qw/email/ ],
                name            => [ qw/name/ ],
                swishdocpath    => [ qw/swishdocpath/ ],
            },
        },
    };


And here's the crontab entry:

$HOME/.bashrc && rm time_to_index 2>/dev/null && cd swish/search && perl
index_hypermail.pl ../archive | swish-e -c swish.conf -S prog -i stdin -v0


That's all there is too it.

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs
   swish-e@sunsite.berkeley.edu
Received on Wed Sep 7 13:44:10 2005