A FastAPI-based web application that uses OCR (Optical Character Recognition) to automatically extract totals from receipt and invoice images. The system is containerized with Docker for simple and fast deployment.
- Intelligent OCR: Smart total extraction from receipt images using advanced pattern recognition
- Multi-format Support: Supports all common image formats (JPEG, PNG, WEBP, BMP, TIFF)
- Web Interface: easy image uploads
- REST API: RESTful endpoints for programmatic integrations
- Multi-language: Recognizes totals in Italian, English, French, German, and Spanish
- Data Persistence: Automatic saving of results in JSON format
- Containerized: Docker and Docker Compose
- Docker and Docker Compose installed on your system
- At least 2GB of RAM available (for OCR models)
- Port 8000 available
- Modern web browser (Chrome, Firefox, Safari, Edge)
- Clone the repository: git clone cd ocr-receipt-extractor
- docker compose up -d
- docker compose ps
- pip install -r requirements.txt
- uvicorn main:app --host 0.0.0.0 --port 8000
ocr-receipt-extractor/
├── main.py
├── requirements.txt
├── Dockerfile
├── docker-compose.yml
├── templates/
│ └── index.html
├── static/
├── data/
│ ├── media/
│ └── data.json
└── README.md