From Guido van Rossum - inventor of Python. I heard you enjoy a certain programming language named Python. Programming is a wonderful activity. I am a little jealous that you have access to computers at your age; when I grew up I didn’t even know what a computer was! I was an electronics hobbyist though, and my big dream was to build my own electronic calculator from discrete components. I never did do that, but I did build several digital clocks, and it was amazing to build something that complex and see it work.
Tesseract OCR is probably the best open source OCR engine available. It allows you to convert text from an image. Install tesseract # I suppose you already have homebrew installed. If not, copy and paste this into Terminal. ruby -e "$(curl -fsSL https://raw.github.com/mxcl/homebrew/go)" Once you got homebrew, installing tesseract is as simple as brew install tesseract Automate the process # tesseract accepts input as image. If you input is PDF, you will have to convert it to image first, perferably TIFF format.