> -----Original Message-----
> From: swish-e@sunsite3.berkeley.edu
[mailto:swish-e@sunsite3.berkeley.edu]
> On Behalf Of Bill Moseley
> Sent: Wednesday, June 22, 2005 5:23 PM
> To: Multiple recipients of list
> Subject: [SWISH-E] Re: swish-e 2.4.3 windows 2003 iis success!
>
> On Wed, Jun 22, 2005 at 05:05:53PM -0400, Revillini, James wrote:
> > RTF's are killing it now. As soon as it runs into one, the output
file
> > from dirtree.pl goes like this:
>
> $ perl -MMIME::Types -le 'print
MIME::Types->new->mimeTypeOf("foo.rtf")'
> text/rtf
>
> I think catdoc will handle rtr somewhat. So add text/rtf to the list
> of content types that it will handle.
>
> I thought I added and easy way to add mime-types from the calling
> program (ie DirTree.pl), but don't see it right now. Try adding it to
> Filters/Doc2HTML.pm:
>
> mimetypes => [
> qr!application/(x-)?msword!,
> qr!text/rtf!,
> ]
Tried this, but it didn't seem to take. When I ran Dirtree.pl in debug
mode, the output was
>> Starting to process new document: text/rtf
++Checking filter [SWISH::Filters::Doc2txt=HASH(0x1bf1000)] for
text/rtf
++Checking filter [SWISH::Filters::Pdf2HTML=HASH(0x1c03c68)] for
text/rtf
Final Content type for //fileservername/folder/path/to/the/ArTeeEff.rtf
is text/rtf
*No filters were used
I did end up getting it to work, though. Here's how:
1. download RTF2html.pm from the link posted on swish-e.org:
http://swish-e.org/archive/2005-03/9136.html . Place this in your
filter directory - for me it's (I'm on windows 2003)
\swish-e\lib\swish-e\perl\SWISH\Filters\
2. download and install unrtf:
http://www.gnu.org/software/unrtf/unrtf.html
NOTE: windoze users, you want the executable install from
http://gnuwin32.sourceforge.net/packages/unrtf.htm (complete package,
except sources - you can dl the sources using the exe actually)
3. copy unrtf.exe to your swish-e bin directory, or else add the GnuWin
directory to your path. I did the former (copy \GnuWin\bin\unrtf.exe to
\swish-e\lib\swish-e\
4. rerun dirtree (ON A DIR WITH ONLY A FEW RTFs - they take a long time
to process!) in debug mode and the output from rtf's should be something
like:
>> Starting to process new document: text/rtf
++Checking filter [SWISH::Filters::Doc2txt=HASH(0x1bf1000)] for
text/rtf
++Checking filter [SWISH::Filters::Pdf2HTML=HASH(0x1c03c68)] for
text/rtf
++Checking filter [SWISH::Filters::Rtf2html=HASH(0x1c0e1b8)] for
text/rtf
This is UnRTF, version 0.19.3
By Dave Davey and Marcos Serrou do Amaral
Original Author: Zach T. Smith
Processing //fileservername/folder/path/to/the/ArTeeEff.rtf...
++ text/rtf *WAS* filtered by SWISH::Filters::Rtf2html=HASH(0x1c0e1b8)
Final Content type for //fileservername/folder/path/to/the/ArTeeEff.rtf
is text/
html
>Filter SWISH::Filters::Rtf2html=HASH(0x1c0e1b8) converted from
[text/rtf] to
[text/html]
I also tested the import of the new out.txt into swish-e and it worked
beautifully. I'm really close to having the right setup.
-Jim
>
>
>
> --
> Bill Moseley
> moseley@hank.org
>
> Unsubscribe from or help with the swish-e list:
> http://swish-e.org/Discussion/
>
> Help with Swish-e:
> http://swish-e.org/current/docs
> swish-e@sunsite.berkeley.edu
>
>
>
>
------------------------------------------------------------------------
--
> -----
> WARNING: The sender of this email is different from the email address
> shown in the headers. The real sender of this message is: swish-
> e@sunsite3.berkeley.edu
> If you want to add this sender to your Safe or Blocked Senders List,
you
> would need to add swish-e@sunsite3.berkeley.edu
>
------------------------------------------------------------------------
--
> -----
Received on Thu Jun 23 07:04:56 2005