I have a newbie question.
I have started to create hypermail archives of our majordomo lists in
order to be able to search them via Swish-E. (swish-e 2.2.3)
The only thing that still makes me unhappy about the way I have the
Swish-E index generated is that it grabs the header and footer html from
the hypermail message, actually everything that falls within the <body>
tag. So, for instance if I am searching for my name "Paul Kissman", the
search brings back results where the only mention of my name is in the
footer pointing to the next or previous message, but not in the current
The hypermail conversion assigns the following tag to the part of my
email messages that I want to index as the swishdescription
Body of message goes here.
I can't figure out if there is a way to have swish-e just index this
part of the document or not.
PropertyNameAlias swishdescription <div class="mail"> doesn't work (not
I suppose I could have hypermail paste in some arbitrary xml tag like
Around the <div class="mail"> tags.
But since the documents coming out of hypermail are not really
well-formed xhtml, I didn't think I could use xml parsing.
Paul J. Kissman
Library Information Systems Specialist
Massachusetts Board of Library Commissioners
648 Beacon St.
Boston, MA 02215
www.mlin.lib.ma.us or www.mlin.org
617-267-9400 / 800-952-7403 (in-state)
Received on Thu Oct 9 18:35:52 2003