Local Document Intelligence Pipeline That Unifies OCR, AI Classification, and Search
Getting a complete document intelligence workflow running locally requires stitching together Paperless-ngx for storage, Stirling PDF for manipulation, paperless-gpt for AI tagging, and custom scripts for the gaps. Built-in OCR still fails on tables and photographs. Users want one self-hosted pipeline that handles scan-to-searchable-archive with AI categorization without uploading anything to the cloud.
Don't rebuild Paperless-ngx. Build the missing middle layer: a local OCR+AI service that accepts documents via API, runs vision-model OCR (not Tesseract), classifies, extracts structured data, and pushes results back to Paperless-ngx or any document store. Ship it as a single Docker container with Qwen-VL or similar baked in.
landscape (3 existing solutions)
The pieces exist but the pipeline is fragmented across 3-4 separate tools requiring Docker expertise to glue together. The approaching native AI in Paperless-ngx may close part of this gap, but the OCR quality problem (tables, photos, handwriting) persists because Tesseract is the bottleneck. Vision-capable local LLMs are the solution but integration is DIY.