Skip to content

Geethanjaliii/DocumindAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

54 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

DocuMind AI โ€“ Enterprise Document Intelligence & Extraction Platform

DocuMind AI is an AI-powered enterprise document intelligence platform that automates document ingestion, OCR extraction, document classification, duplicate detection, and analytics. The platform transforms unstructured business documents into searchable, actionable intelligence through an interactive dashboard and real-time insights.


๐Ÿš€ Features

๐Ÿ“„ AI-Powered OCR Extraction

  • Extract text and structured fields from invoices, receipts, purchase orders, and business documents
  • High-accuracy OCR pipeline for scanned and digital documents
  • Automated field extraction and metadata generation

๐Ÿง  Intelligent Document Classification

  • Automatically classify uploaded documents into categories
  • AI-assisted extraction and categorization pipeline
  • Supports invoices, receipts, purchase orders, and custom document types

๐Ÿ” Duplicate Detection Engine

  • Detect duplicate invoices and receipts
  • Invoice number matching
  • Vendor similarity detection
  • Duplicate confidence scoring
  • Fraud prevention workflows

๐Ÿ“‘ Document Management Console

  • View processed documents
  • Search and filter documents
  • Metadata management
  • Processing status tracking
  • Document lifecycle management

๐Ÿ“Š Real-Time Analytics Dashboard

  • Field extraction accuracy metrics
  • OCR confidence scores
  • Processing latency monitoring
  • Success ratio tracking
  • Operational insights and reporting

๐Ÿ“ฑ Responsive Enterprise Dashboard

  • Fully responsive UI
  • Mobile, tablet, and desktop support
  • Modern dashboard interface
  • Real-time data visualization

๐Ÿ› ๏ธ Tech Stack

Frontend

  • Framework: React.js + TypeScript + Vite
  • Styling: Tailwind CSS
  • Charts: Recharts
  • State Management: React Hooks & Context API
  • Icons: Lucide React

Backend

  • Framework: FastAPI (Python 3.11)
  • Database: PostgreSQL
  • ORM: SQLAlchemy
  • Authentication: JWT Authentication
  • AI Services: Gemini AI APIs
  • OCR Engine: Document OCR & Information Extraction Pipeline

๐Ÿ“ Architecture Diagram

graph TD
    User[๐Ÿ‘ค User Browser]

    FE[โš›๏ธ React + TypeScript Frontend]
    API[โšก FastAPI Backend]

    Auth[๐Ÿ” JWT Authentication]
    OCR[๐Ÿ“„ OCR Extraction Engine]
    AI[๐Ÿง  Gemini AI Services]
    Dup[๐Ÿ” Duplicate Detection Engine]
    Analytics[๐Ÿ“Š Analytics Service]

    DB[(๐Ÿ˜ PostgreSQL Database)]

    User --> FE
    FE -->|REST API Requests| API

    API --> Auth
    API --> OCR
    API --> AI
    API --> Dup
    API --> Analytics

    OCR --> DB
    AI --> DB
    Dup --> DB
    Analytics --> DB

    API --> DB
    API --> FE
Loading

๐Ÿ”„ Document Processing Flow

sequenceDiagram
    participant U as User
    participant F as React Frontend
    participant A as FastAPI Backend
    participant O as OCR Engine
    participant G as Gemini AI
    participant D as PostgreSQL

    U->>F: Upload Document
    F->>A: POST /documents/upload
    A->>O: Extract Text & Fields
    O-->>A: Structured Data
    A->>G: Classify Document
    G-->>A: Category + Metadata
    A->>D: Store Results
    D-->>A: Persisted Records
    A-->>F: Extraction Results
    F-->>U: Dashboard & Analytics
Loading

๐Ÿš€ Core Modules

๐Ÿ“„ Document Upload

  • Drag-and-drop uploads
  • PDF and image support
  • Batch upload capabilities
  • Upload progress tracking

๐Ÿ“‘ Document Management

  • View processed documents
  • Search and filtering
  • Metadata management
  • Processing status tracking

๐Ÿ” Duplicate Detection

  • Invoice number matching
  • Vendor similarity detection
  • Duplicate confidence scoring
  • Fraud prevention workflows

๐Ÿ“Š Analytics Dashboard

  • Extraction accuracy monitoring
  • OCR confidence tracking
  • Processing latency analysis
  • Success ratio monitoring
  • Operational reporting

๐Ÿ“ธ Application Screenshots

Authentication

Authentication

Dashboard Overview

Dashboard

Document Upload

Document Upload

Documents Management

Documents Management

Duplicate Detection

Duplicate Detection

Analytics Dashboard

Analytics Dashboard


๐Ÿ“ฆ Installation & Configuration

Prerequisites

  • Python 3.11+
  • Node.js 18+
  • PostgreSQL
  • Gemini API Key

1๏ธโƒฃ Backend Setup

cd backend

python -m venv .venv

# Windows
.venv\Scripts\activate

# Linux/Mac
source .venv/bin/activate

pip install -r requirements.txt

Create a .env file:

DATABASE_URL=your_postgresql_connection_string
SECRET_KEY=your_secret_key
GEMINI_API_KEY=your_gemini_api_key
ALGORITHM=HS256
ACCESS_TOKEN_EXPIRE_MINUTES=60

Run the server:

uvicorn app.main:app --reload

Backend URL:

http://127.0.0.1:8000

Swagger Documentation:

http://127.0.0.1:8000/docs

2๏ธโƒฃ Frontend Setup

cd frontend

npm install
npm run dev

Frontend URL:

http://localhost:3000

๐Ÿ“ก API Endpoints

Authentication

  • POST /auth/register
  • POST /auth/login
  • GET /users/me

Document Operations

  • POST /api/v1/documents/upload
  • GET /api/v1/documents
  • GET /api/v1/documents/{id}
  • DELETE /api/v1/documents/{id}

Duplicate Detection

  • GET /api/v1/duplicates
  • POST /api/v1/duplicates/analyze

Analytics

  • GET /api/v1/stats
  • GET /api/v1/search

๐Ÿ›ก๏ธ Environment Variables

Backend (backend/.env)

DATABASE_URL=
SECRET_KEY=
GEMINI_API_KEY=
ALGORITHM=HS256
ACCESS_TOKEN_EXPIRE_MINUTES=60

Frontend (frontend/.env)

VITE_API_URL=http://127.0.0.1:8000

๐ŸŒ Deployment Guide

Backend

docker build -t documind-backend .
docker run -p 8000:8000 documind-backend

Deployment Platforms:

  • Render
  • Railway
  • AWS ECS
  • Google Cloud Run

Frontend

npm run build

Deployment Platforms:

  • Vercel
  • Netlify
  • AWS Amplify

๐Ÿ”ฎ Future Enhancements

  • Role-Based Access Control (RBAC)
  • Vector Search for Semantic Document Retrieval
  • RAG-Powered Document Question Answering
  • Real-Time Processing Queues using Celery and Redis
  • Multi-Tenant Enterprise Workspaces
  • Cloud Storage Integration (AWS S3 / GCS)
  • CI/CD Pipeline with GitHub Actions
  • Docker Compose and Kubernetes Deployment
  • AI-Powered Document Summarization
  • Document Chat Assistant

๐Ÿ“ˆ Project Highlights

  • AI-Powered Enterprise Document Intelligence Platform
  • End-to-End OCR Extraction Pipeline
  • Duplicate Invoice Detection System
  • Real-Time Analytics Dashboard
  • FastAPI + React Full-Stack Architecture
  • Gemini AI Integration
  • Responsive Enterprise UI
  • Production-Ready Modular Architecture

โœ๏ธ Author

Geethanjali V N

GitHub: https://github.com/Geethanjaliii

Project Repository: https://github.com/Geethanjaliii/DocumindAI


๐Ÿ“„ License

This project is licensed under the MIT License.

About

AI-powered document intelligence platform with OCR, document classification, structured extraction, duplicate detection, search, and analytics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors