glmmedia-ocr

Zero-config CLI for PDF and image OCR via local LLM

RoleDevelopment

Year2026

pythoncliollamaocrpytorch

Self-contained command-line tool that converts PDFs and images to structured Markdown using a local GLM-OCR model through Ollama. Manages the full pipeline automatically — Ollama lifecycle, model pulling, PDF rendering, layout detection via PP-DocLayoutV3, and result merging. Dual distribution through npm and pip with 50+ configurable options.

Related ProjectsAll Projects →

♠

Application

Crimson Royale

♠

Application

PingBox

♠

Application

YAR: Navala Siege

All Projects Back to Index