Machine Learning in Linux: EasyOCR – OCR that reads natural scene text and dense text in documents
Our Machine Learning in Linux series focuses on apps that make it easy to experiment with machine learning.
EasyOCR is a Python module for extracting text from images. It is a general OCR that can read both natural scene text and dense text in documents. The software supports more than 80 languages. It’s free and open source software.
We tested EasyOCR with PCs running Ubuntu and Manjaro including a machine with an NVIDIA GeForce RTX 3060 Ti dedicated graphics card, and an Intel NUC 13 Pro which only has onboard Intel Iris Xe. We’ll go through installing the GPU and CPU versions of PyTorch.
There are a variety of ways of installing EasyOCR without polluting our machines. We’ll install EasyOCR in a isolated Python environment.