Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] pdftotext

From: Thomas Dowling <tdowling(at)>
Date: Tue Mar 10 2009 - 10:52:10 GMT
On 03/10/2009 06:23 AM, Michelangelo Rezzonico wrote:
> Hi all,
> I use pdftotext to index pdf-files.
> This works ok.
> The only problem is that in the output of pdftotext there are many spaces.
> If in the pdf-file there is the string "2001", then in the output of
> pdftotext I find "2 0 0 1".

I don't see this behavior with pdftotext 3.02.

The original may actually have space characters as a way to do faux
letter spacing.  What happens if you copy the text from the PDF file and
paste it into a text editor?

Thomas Dowling

Users mailing list
Received on Tue Mar 10 06:52:13 2009