Re: xsane & tesseract
- Date: Fri, 25 Aug 2017 22:00:56 -0600
- From: Joe Pfeiffer <pfeiffer@xxxxxxxxxxx>
- Subject: Re: xsane & tesseract
Doug <dmcgarrett@xxxxxxxxxxxxx> writes:
> On 08/25/2017 08:31 PM, Stephen Grant Brown wrote:
> Hi All,
> How do I setup xsane to use the tesseract OCR engine?
> I see gocr under preferences->setup->ocr.
> Yours Sincerely
> Stephen Grant Brown.
> Unless it has been vastly improved, you might as well copy the document by hand! Finding and fixing all the mistakes is not worth the
> Abbyy for Windows does an excellent job. One of only two programs I will boot Windows for. (The other one is a phono-to-CD program.)
My experience OCRing a 16 page document with tesseract last spring was
quite good. I didn't try to set xsane up to do it (as I thought it
would be a *long* time before I did it again), I scanned the document to
ppm files, sent them to tesseract, put the output of tesseract into a
.txt file, and cleaned up from there. While it wasn't perfect, it was
far better than retyping the whole thing would have been.
"Erwin, have you seen the cat?" -- Mrs. Shrödinger