Tag Archives: ocr

OCR Comparison By Marco Arment

If there was some way to find out what the most used non-Apple app on my iPhone and iPad is, it would almost certainly be Instapaper by Marco Arment. I am either saving or reading articles in there every single day. Between my Instapaper use and listening to the Build and Analyze podcast, I spend […]

Comments ( 5 )

How To Find PDFs That Are Not Searchable

Sometimes, especially when you are a doing a big OCR project, you might want to find all the PDFs that are not searchable. That is to say, you want to find the PDFs that have not been OCR-ed. It turns out that this is not as easy as you might think. Here are a few […]

Comments ( 3 )

Free Online OCR With RICOH Innovations

Most scanners these days come with an Optical Character Recognition, or OCR, program of some sort to make PDFs searchable. However, what if you don’t have an OCR program or you just want to do a quick and dirty file conversion without messing around with an application? A few DocumentSnap commenters have pointed out that […]

Comments ( 3 )

OCR Smackdown: ABBYY FineReader vs. Adobe Acrobat

A very common request that I get here at DocumentSnap is to compare the Optical Character Recognition (OCR) capabilities of ABBYY FineReader with Adobe Acrobat. Why? Well, for starters, both of them come included with models the Fujitsu ScanSnap as well as other scanners. I decided to do a quick test comparing the OCR of […]

Comments ( 23 )

OCR And Orphan Works

As I have written about before, I always find it fascinating to read about different scanning projects, especially when it comes to scanning old stuff. Over at the GalleyCat blog, Jason Boog writes about using Optical Character Recognition software to dig through orphan works. What the heck are “orphan works”? I didn’t know either. According […]

Comments ( 0 )

Hazel Rule To OCR Documents Using PDFPen

The other day I posted an Applescript to OCR documents using PDFPen. In the comments, awesome DocumentSnap reader Josh requested that it be done as a Hazel rule instead. Given that my love for Hazel is well documented, I am happy to oblige. I created a folder and then created the following Hazel rule to […]

Comments ( 25 )

PDFPen OCR Applescript To Automatically Make PDFs Searchable

I don’t know if it is because I have been glued to a computer since I was six years old, but my handwriting and printing is terrible. Really terrible. I think my 5 year old son and I have pretty similar handwriting skills. Normally this is not a problem, except when I have to fill […]

Comments ( 17 )

ScanSnap and Hazel Is A Match Made In Paperless Heaven

There are a lot of tricks out there for keeping your documents organized based on their location or filename, but the holy grail is to be able to keep them organized based on the actual contents of the documents themselves. I have written before about how the Fujitsu ScanSnap S1500, the S1500M and the S1300 […]

Comments ( 32 )

Lifehacker OCR Call For Votes

The folks over at Lifehacker are running one of their famous High Five calls for submissions, this time about readers’ favorite OCR tools. OCR tools have been around for decades, but only recently have they been affordable (in many instances free) and accessible to people outside of government and corporate offices. This week we want […]

Comments ( 1 )

Using Microsoft Office Document Imaging To OCR For Free

If you are a Windows user and already have Microsoft Office XP through 2007, chances are you already have the ability to OCR documents to get the text out of them. It’s called Microsoft Office Document Imaging (MODI). I’m not going to lie, what I am about to show you is not exactly the best […]

Comments ( 9 )