The steps to install and use Docstrange are listed below:
- Ensure that Python 3.8 or higher is installed
- Install the Docstrange library via pip:
pip install docstrange
- Select the processing mode:
- Cloud Mode (default): no additional configuration required, but you can register NanoNets to get API keys to increase limits
- Native mode: OCR dependencies (e.g. Ollama) need to be installed, use the
--cpu-mode
maybe--gpu-mode
parameters
- Start working with documents via the Python API or command line
For example, the basic PDF to Markdown command:docstrange document.pdf --output markdown
More detailed usage can be found in the GitHub documentation.
This answer comes from the articleDocstrange: a tool for extracting data from documents and images and converting them to multiple formatsThe