Deep Learning Techniques for Automation and Industrial Applications
by Pramod Singh Rathore, Sachin Ahuja, Srinivasa Rao Burri, Ajay Khunteta, Anupam Baliyan, Abhishek Kumar
1Text Extraction from Images Using Tesseract
Santosh Kumar*, Nilesh Kumar Sharma, Mridul Sharma and Nikita Agrawal
Department of Computer Science, Global Institute of Technology, Jaipur, India
Abstract
Images play a crucial role in describing, representing, and conveying information, which are essential in many businesses and organizations. Text extraction from images is the method used to convert text to plain text. Text extraction refers to the systematic procedure used for converting textual material into a simplified plain text format. The task at hand presents a significant difficulty as a result of the many changes in variables such as the size, orientation, and alignment of the text. Furthermore, the inclusion of low-resolution, pixelated pictures, coupled with the existence of noisy backgrounds, exacerbates the complexity associated with the process of text extraction. Using the Tesseract OCR engine, we aim to reduce these issues in this project.
Tesseract is developed by Google and is an open-source optical character recognition (OCR) engine. OCR technology allows computers to recognize text in images, making it possible to convert images of text into machine-readable text. Tesseract has been trained in a wide variety of languages and scripts, including English, Chinese, and Arabic.
Tesseract can process images that are rotated, tilted, or skewed and can recognize text written in different scripts, such as English and Arabic. It uses machine learning algorithms to ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access