December 2018
Intermediate to advanced
182 pages
4h 21m
English
Tesseract is Google's optical character recognition library, and is not natively a Python package. Because of this, there's a Python binding for it that calls the executable, which can then be installed manually.
Go to the GitHub repository for Tesseract, which is found at the following link: https://github.com/tesseract-ocr/tesseract.
Scroll down to the Installing Tesseract section in the GitHub readme. Here, we are presented with two options:
We want to install it via the pre-built binary package, so click on that link. We can also build it from source if we want to, but that doesn't really offer any advantages. The Tesseract Wiki explains the ...
Read now
Unlock full access