Batch OCR help. | General Going Paperless | Forum

 
You must be logged in to post Login Register


Register? | Lost Your Password?

Search Forums:


 






Minimum search word length is 3 characters – Maximum search word length is 84 characters
Wildcard Usage:
*  matches any number of characters    %  matches exactly one character

Batch OCR help.

UserPost

1:33 pm
July 27, 2011


drew

Member

posts 5

My workflow thoughts so far of what I would like to do:

I save PDFs via the print area of OS X and save them to a folder in my dropbox account so i can send stuff to there from anywhere.  I then have hazel watching that folder to move and OCR it and then file based on what the text the OCR reads. The problem is I can't seem to figure out how to OCR a document with NO user input. I don't want to spend anytime answering dialog boxes while running a manual batch process.

 

Any ideas?

2:21 pm
July 27, 2011


drew

Member

posts 5

I keep getting these errors when tryin to use your OCRit and are the kind of user interventions I am talking about.

2:47 pm
July 27, 2011


Brooks

Vancouver, BC

Admin

posts 203

Hm, that is a weird message. I guess Acrobat doesn't like the PDFs that are created by OSX's Print dialog. What are you printing and what are you printing from?

 

In my experience, OSX is pretty good about making PDFs created via Printing already searchable, so it may be that your PDF is already searchable and you don't need to OCR?

8:18 am
July 28, 2011


drew

Member

posts 5

So I decided to give OCR PDF X a try.  That thing works that way I want.  No intervention needed. Do documents coming out of the ScanSnap need to be batch processed for OCR or does it do it automatically? My S1500M arrives next week.

10:44 am
July 28, 2011


Brooks

Vancouver, BC

Admin

posts 203

Thanks for the tip about OCR PDF X. With the ScanSnap, if you check a checkbox on your Profile that says "Create searchable PDF", it will automatically OCR them for you. No intervention needed.

10:47 am
July 28, 2011


drew

Member

posts 5

I learned about PDF OCR X from a local developer @ http://stevelosh.com/blog/2011/05/paper-free/

10:49 am
July 28, 2011


Brooks

Vancouver, BC

Admin

posts 203

Thanks for that. I actually have that post in my Instapaper to read/link to someday but you've motivated me to bump up the timeframe a bit!

10:54 am
July 28, 2011


drew

Member

posts 5

Interesting he is local to me and the people with OCR PDF X are local to you

10:57 am
July 28, 2011


Brooks

Vancouver, BC

Admin

posts 203

Not only local but about a 10 minute walk away. I had no idea!  I should reach out to them.  Thanks for the pointer!

No Tags