Current Position:fig. beginning " AI Answers

How to install OCRmyPDF on Linux system?

2025-08-14

156

The following steps are required to install OCRmyPDF on a Linux system (Ubuntu 22.04 for example):

First make sure that Python 3 and pip are installed on your system:
python3 --version
pip3 --version
Install dependencies:
sudo apt update
sudo apt install tesseract-ocr ghostscript python3-pip pngquant
Install OCRmyPDF using pip:
pip3 install ocrmypdf
Verify the installation:
ocrmypdf --version
If the version number is displayed, the installation was successful

Note: Processing non-English documents requires additional installation of the corresponding Tesseract language packs, such as Chinese need to install thetesseract-ocr-chi-simThe

This answer comes from the articleOCRmyPDF: scanned PDF into searchable text of the open source toolThe