This project is a Retrieval-Augmented Generation (RAG) based agent for GST (Goods and Services Tax) analysis. The system is designed to extract, process, and analyze GST-related data from various sources, including PDFs, images, and text inputs. It leverages large language models and advanced document parsing techniques to provide meaningful insights.
- RAG-Based Architecture: Combines retrieval and generation for accurate GST analysis.
- Multi-Modal Input: Supports text, PDFs, and images.
- Advanced OCR & Table Extraction: Extracts structured data from scanned invoices and GST documents.
- Integration with O1-mini Model: Utilizes the O1-mini model for enhanced response generation.
Ensure you have the following dependencies installed:
- Python 3.12.3
- Pip
- Virtual Environment (recommended)
- python -m venv botenv
- source botenv/bin/activate # On Windows, use: botenv\Scripts\activate
- pip install -r requirements.txt
- setup api key in .env (open api key)
- python app.py
- Upload Data:
- Endpoint:
<span>/UploadData</span> - Method:
<span>POST</span> - Supports: Text, PDFs, Images
- Endpoint:
- PDF Processing:
- Uses
<span>unstructured.partition.pdf</span>for structured data extraction. - Extracts tables and text separately.
- Uses
- Image Processing:
- Uses
<span>pytesseract</span>for OCR-based text extraction. - Object detection using
<span>torchvision</span>.
- Uses
- Integration with O1-mini:
- The system leverages the
<span>O1-mini</span>model for intelligent text generation and reasoning over retrieved GST data.
- The system leverages the