OCRmyPDF supports text recognition in up to 39 languages, including English, German, Chinese and other widely used languages. Users can specify the language code through the -l parameter to recognize the text of a particular language, such as recognizing the simplified Chinese chi_sim. It should be noted that the use of non-English recognition needs to be installed in addition to the corresponding Tesseract language packages, for example, in the Ubuntu system can be 'sudo apt install For example, in Ubuntu, you can install the simplified Chinese language pack via the command 'sudo apt install tesseract-ocr-chi-sim'.
This answer comes from the articleOCRmyPDF: scanned PDF into searchable text of the open source toolThe