Zero-config CLI for PDF and image OCR via local LLM
Self-contained command-line tool that converts PDFs and images to structured Markdown using a local GLM-OCR model through Ollama. Manages the full pipeline automatically — Ollama lifecycle, model pulling, PDF rendering, layout detection via PP-DocLayoutV3, and result merging. Dual distribution through npm and pip with 50+ configurable options.