Designed for long-term archiving of documents, OCRmyPDF's solutions for achieving compliant storage include:
- Default generation of PDF/A format (ISO 19005 standard), which is a subset of PDF specifically designed for long-term archiving
- pass (a bill or inspection etc)
--output-type pdfa
Ensure output is PDF/A compliant - Automatic handling of non-standard elements in documents, such as image format conversion and font embedding
- Supports metadata retention, so important document information will not be left behind
- furnish
--clean-final
Option to further remove temporary data and redundant information
These features make OCRmyPDF ideal for legal documents, financial records and other scenarios that require compliant archiving, generating documents that remain readable for decades.
This answer comes from the articleOCRmyPDF: scanned PDF into searchable text of the open source toolThe