YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
-
Updated
Feb 28, 2026 - Python
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
A hybrid Computer Vision pipeline designed to transform high-variance smartphone document photos into structured, digital PDFs. It bridges the gap between raw pixel data and semantic document understanding using a combination of classical geometric processing and Deep Learning.
Add a description, image, and links to the doclaynet topic page so that developers can more easily learn about it.
To associate your repository with the doclaynet topic, visit your repo's landing page and select "manage topics."