Scanning and OCR with Mint

Questions about applications and software
Forum rules
Before you post please read how to get help

Scanning and OCR with Mint

Postby alfreddo on Fri Jul 24, 2009 8:00 am

Using my Windows XP PC, I scan images from my Epson V350 into Paintshop Pro, and documents (typed or duplicated) using the OCR program Abbyy Finereader 6.
What would be the best equivalents using Mint? I've read that the Epson V350 Linux driver is not easy to use and will only scan to 2400 dpi instead iof the 4800 dpi available on Windows. Perhaps I should think of buying a new scanner to work with Mint - if so, which scanner would you recommend.
Paintshop Pro isn't available in Linux. Can I scan using Gimp? Scanning isn't mentioned in the Gimp help files. If not Gimp, then what?
Abbyy Finereader does an excellent job of producing editable text from old stencil duplicated originals, but also isn't available in Linux. Can you recommend a competent OCR program to use with Mint.

Many thanks. I'm hoping to move from XP to Mint completely, but not unless the scanning issue can be resolved.
alfreddo
Level 1
Level 1
 
Posts: 20
Joined: Wed Jun 24, 2009 3:28 pm

Linux Mint is funded by ads and donations.
 

Re: Scanning and OCR with Mint

Postby kwisher on Fri Jul 24, 2009 12:21 pm

Xsane is a good scanning application. I believe it will do OCR also.
The instructions suggested Windows XP or better, so I installed Linux :)
User avatar
kwisher
Level 5
Level 5
 
Posts: 643
Joined: Wed Mar 05, 2008 12:54 pm
Location: Greentown, Indiana USA

Re: Scanning and OCR with Mint

Postby DrHu on Fri Jul 24, 2009 2:11 pm

alfreddo wrote:Abbyy Finereader does an excellent job of producing editable text from old stencil duplicated originals, but also isn't available in Linux. Can you recommend a competent OCR program to use with Mint.

OCR..
--try a few different engines on your source material to determine for yourself which works best, as different sources will produce different results across the board..
    p tesseract-ocr - Command line OCR tool
    p tesseract-ocr-eng - tesseract-ocr language files for English t
--I don't know if you need the language files or not, it is originally a German product..
http://code.google.com/p/tesseract-ocr/

Scanning..
mint menu>All Applications>Graphics>Scanner Utility
http://projects.gnome.org/gnome-scan/index
http://www.associatedcontent.com/articl ... thats.html

There is a gimp plug-in to acquire (scan) into gimp
--however you can use any application that works to scan and maniplulate the image within Gimp..
In Terminal, check
    apt search gimp
    p gimp2.0-quiteinsane - A Qt based SANE plugin for GIMP 2.0

An ocr scan is a separate problem, in Linux there are several OCR engines available
--translating a page from text/image and extracting the text content..
In Terminal, check
    apt search ocr
    p gocr - A command line OCR
    p ocrad - Optical Character Recognition program
These are the two that are usually included with sane scanning package or the graphical scanning package on the desktop
User avatar
DrHu
Level 17
Level 17
 
Posts: 7021
Joined: Wed Jun 17, 2009 8:20 pm

Re: Scanning and OCR with Mint

Postby alfreddo on Fri Jul 24, 2009 4:42 pm

Many thanks for the suggestions. I'll let you know the outcome.
alfreddo
Level 1
Level 1
 
Posts: 20
Joined: Wed Jun 24, 2009 3:28 pm

Re: Scanning and OCR with Mint

Postby lexon on Fri Jul 24, 2009 8:18 pm

I have scanned with Xsane but when I try to convert to text, the results are quite lousy. No solution.

lexon
Lindows, Linspire, Freespire, Ubuntu, Mint 15 Cinnamon, Mint 16 XFCE, Mint 17 Cinnamon 64 bit.
lexon
Level 5
Level 5
 
Posts: 726
Joined: Sat Jan 31, 2009 10:53 pm
Location: MA USA

Re: Scanning and OCR with Mint

Postby julianvb on Fri Oct 24, 2014 3:59 pm

Has anyone used gocr, a command line OCR application. When I try it with a couple of simple recipes, I see no system errors but the results are just a bunch of random characters. I first converf the jpg file to pnm using convert since gocr requires pnm as one of the input formats.

Because it's so simple to use, I really would like to see it work in Lnux Mint 17 for me. Any help will be appreciated.

julianvb
julianvb
Level 1
Level 1
 
Posts: 17
Joined: Sat Dec 14, 2013 9:43 pm

Re: Scanning and OCR with Mint

Postby julianvb on Sat Oct 25, 2014 1:32 am

I think I am very satisfied with Tesseract as an OCR as it is simple to use and its performance is more than adequate for me. It is such a pleasure to edit its output with LibreOffice writer. In no time I end up with an excellent .odt text file. I highly recommend it to any Linux user.

In case one is new to Tesseract in LInux after installing it, one only needs to convert the image file to .tif format and then issue the following command:

tesseract filename.tif filename

julianvb
julianvb
Level 1
Level 1
 
Posts: 17
Joined: Sat Dec 14, 2013 9:43 pm

Re: Scanning and OCR with Mint

Postby Bolle1961 on Sat Oct 25, 2014 3:29 am

I use ocrfeeder (uses tesseract as backend) and that works good.
Bolle1961
Level 4
Level 4
 
Posts: 212
Joined: Tue Jun 29, 2010 12:59 pm

Linux Mint is funded by ads and donations.
 

Return to Software & Applications

Who is online

Users browsing this forum: No registered users and 36 guests