Letter to a young programmer
From Guido van Rossum - inventor of Python.
I heard you enjoy a certain programming language named Python. Programming is a wonderful activity. I am a little jealous that you have access to computers at your age; when I grew up I didn’t even know what a computer was! I was an electronics hobbyist though, and my big dream was to build my own electronic calculator from discrete components. I never did do that, but I did build several digital clocks, and it was amazing to build something that complex and see it work.
Free OCR solution on OS X
Tesseract OCR is probably the best open source OCR engine available. It allows you to convert text from an image.
Install tesseract # I suppose you already have homebrew installed. If not, copy and paste this into Terminal.
ruby -e "$(curl -fsSL https://raw.github.com/mxcl/homebrew/go)" Once you got homebrew, installing tesseract is as simple as brew install tesseract
Automate the process # tesseract accepts input as image. If you input is PDF, you will have to convert it to image first, perferably TIFF format.