Scanning and OCR with Mint

Questions about applications and software
Forum rules
Before you post please read how to get help
alfreddo
Level 1
Level 1
Posts: 20
Joined: Wed Jun 24, 2009 3:28 pm

Scanning and OCR with Mint

Postby alfreddo » Fri Jul 24, 2009 8:00 am

Using my Windows XP PC, I scan images from my Epson V350 into Paintshop Pro, and documents (typed or duplicated) using the OCR program Abbyy Finereader 6.
What would be the best equivalents using Mint? I've read that the Epson V350 Linux driver is not easy to use and will only scan to 2400 dpi instead iof the 4800 dpi available on Windows. Perhaps I should think of buying a new scanner to work with Mint - if so, which scanner would you recommend.
Paintshop Pro isn't available in Linux. Can I scan using Gimp? Scanning isn't mentioned in the Gimp help files. If not Gimp, then what?
Abbyy Finereader does an excellent job of producing editable text from old stencil duplicated originals, but also isn't available in Linux. Can you recommend a competent OCR program to use with Mint.

Many thanks. I'm hoping to move from XP to Mint completely, but not unless the scanning issue can be resolved.

kwisher
Level 5
Level 5
Posts: 745
Joined: Wed Mar 05, 2008 12:54 pm
Location: Greentown, Indiana USA

Re: Scanning and OCR with Mint

Postby kwisher » Fri Jul 24, 2009 12:21 pm

Xsane is a good scanning application. I believe it will do OCR also.
The instructions suggested Windows XP or better, so I installed Linux :)

User avatar
DrHu
Level 17
Level 17
Posts: 7560
Joined: Wed Jun 17, 2009 8:20 pm

Re: Scanning and OCR with Mint

Postby DrHu » Fri Jul 24, 2009 2:11 pm

alfreddo wrote:Abbyy Finereader does an excellent job of producing editable text from old stencil duplicated originals, but also isn't available in Linux. Can you recommend a competent OCR program to use with Mint.

OCR..
--try a few different engines on your source material to determine for yourself which works best, as different sources will produce different results across the board..
    p tesseract-ocr - Command line OCR tool
    p tesseract-ocr-eng - tesseract-ocr language files for English t
--I don't know if you need the language files or not, it is originally a German product..
http://code.google.com/p/tesseract-ocr/

Scanning..
mint menu>All Applications>Graphics>Scanner Utility
http://projects.gnome.org/gnome-scan/index
http://www.associatedcontent.com/articl ... thats.html

There is a gimp plug-in to acquire (scan) into gimp
--however you can use any application that works to scan and maniplulate the image within Gimp..
In Terminal, check
    apt search gimp
    p gimp2.0-quiteinsane - A Qt based SANE plugin for GIMP 2.0

An ocr scan is a separate problem, in Linux there are several OCR engines available
--translating a page from text/image and extracting the text content..
In Terminal, check
    apt search ocr
    p gocr - A command line OCR
    p ocrad - Optical Character Recognition program
These are the two that are usually included with sane scanning package or the graphical scanning package on the desktop

alfreddo
Level 1
Level 1
Posts: 20
Joined: Wed Jun 24, 2009 3:28 pm

Re: Scanning and OCR with Mint

Postby alfreddo » Fri Jul 24, 2009 4:42 pm

Many thanks for the suggestions. I'll let you know the outcome.

lexon
Level 5
Level 5
Posts: 868
Joined: Sat Jan 31, 2009 10:53 pm
Location: MA USA

Re: Scanning and OCR with Mint

Postby lexon » Fri Jul 24, 2009 8:18 pm

I have scanned with Xsane but when I try to convert to text, the results are quite lousy. No solution.

lexon
Lindows, Linspire, Freespire, Ubuntu, Mint 15 Cinnamon, Mint 16 XFCE, Mint 17 Cinnamon 64 bit.

julianvb
Level 4
Level 4
Posts: 229
Joined: Sat Dec 14, 2013 9:43 pm
Location: Virginia Beach, VA

Re: Scanning and OCR with Mint

Postby julianvb » Fri Oct 24, 2014 3:59 pm

Has anyone used gocr, a command line OCR application. When I try it with a couple of simple recipes, I see no system errors but the results are just a bunch of random characters. I first converf the jpg file to pnm using convert since gocr requires pnm as one of the input formats.

Because it's so simple to use, I really would like to see it work in Lnux Mint 17 for me. Any help will be appreciated.

julianvb
Julianvb: Linux Mint 17.1 KDE "Rebecca" 32bit, Ubuntu 12.04. HP Compaq Presario: 32bit, 2gb Ram, 80gb hdd.

julianvb
Level 4
Level 4
Posts: 229
Joined: Sat Dec 14, 2013 9:43 pm
Location: Virginia Beach, VA

Re: Scanning and OCR with Mint

Postby julianvb » Sat Oct 25, 2014 1:32 am

I think I am very satisfied with Tesseract as an OCR as it is simple to use and its performance is more than adequate for me. It is such a pleasure to edit its output with LibreOffice writer. In no time I end up with an excellent .odt text file. I highly recommend it to any Linux user.

In case one is new to Tesseract in LInux after installing it, one only needs to convert the image file to .tif format and then issue the following command:

tesseract filename.tif filename

julianvb
Julianvb: Linux Mint 17.1 KDE "Rebecca" 32bit, Ubuntu 12.04. HP Compaq Presario: 32bit, 2gb Ram, 80gb hdd.

User avatar
Bolle1961
Level 4
Level 4
Posts: 429
Joined: Tue Jun 29, 2010 12:59 pm

Re: Scanning and OCR with Mint

Postby Bolle1961 » Sat Oct 25, 2014 3:29 am

I use ocrfeeder (uses tesseract as backend) and that works good.


Return to “Software & Applications”

Who is online

Users browsing this forum: gonzodavis, Yahoo [Bot] and 8 guests