📄 DocQuery AI

DocQuery AI is an intelligent, session-isolated RAG (Retrieval-Augmented Generation) chatbot application designed to transform how you interact with PDF documents. Instead of searching by keywords or scrolling through massive files, you can converse directly with your documents in real time.

Built with Streamlit, LangChain, Chroma DB, HuggingFace, and Mistral AI, it processes documents locally, indexes content into a fresh vector store per session, and enables instant semantic retrieval and answering.

📸 Interface Preview

🚀 Key Features

📂 Instant Document Processing: Upload any PDF through a clean sidebar layout to process and chunk it on the fly.
🔄 Session-Isolated Database: Uses an in-memory instance of Chroma DB. Every session is fresh, isolated, and completely private—no documents are persisted on disk unless configured.
🧠 Advanced Semantic Retrieval: Implements HuggingFace embeddings combined with Maximal Marginal Relevance (MMR) retrieval to extract the most relevant contexts while keeping content diverse.
💬 Intelligent QA Chatbot: Leverages Mistral AI's mistral-small-latest language model to summarize, synthesize, and answer questions accurately.
🧹 Single-Click Reset: Clean up your chat history and memory instantly using the "Clear Session" option.

🛠️ Tech Stack

Frontend & UI: Streamlit
LLM Engine: Mistral AI API
RAG Framework: LangChain
Vector Database: Chroma DB (In-memory)
Embeddings: HuggingFace Embeddings (sentence-transformers)

💻 Setup & Installation

Prerequisites

Python 3.10 or higher
Mistral AI API Key (Get one from Mistral Console)

1. Clone the Repository

git clone https://github.com/coderashhar/Document.AI.git
cd Document.AI

2. Configure Environment Variables

Create a .env file in the root directory and add your Mistral API key:

MISTRAL_API_KEY=your_mistral_api_key_here

3. Setup Virtual Environment

Create and activate your python virtual environment:

python3 -m venv .venv
source .venv/bin/activate  # On macOS/Linux
# .venv\Scripts\activate   # On Windows

4. Install Dependencies

pip install -r requirements.txt

5. Launch the Application

streamlit run app.py

📖 Usage Guide

Open your browser and navigate to the local address provided by Streamlit (typically http://localhost:8502).
Upload a PDF document in the left sidebar.
Once the green success banner appears ("Document processed successfully!"), type your question into the chat input box at the bottom.
Get concise, context-aware answers directly extracted from your document.
Hit Clear Session if you want to upload a different PDF and start a new conversation.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
document loaders		document loaders
retrievers		retrievers
vector store		vector store
.gitignore		.gitignore
README.md		README.md
Screenshot.png		Screenshot.png
app.py		app.py
create_database.py		create_database.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 DocQuery AI

📸 Interface Preview

🚀 Key Features

🛠️ Tech Stack

💻 Setup & Installation

Prerequisites

1. Clone the Repository

2. Configure Environment Variables

3. Setup Virtual Environment

4. Install Dependencies

5. Launch the Application

📖 Usage Guide

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 DocQuery AI

📸 Interface Preview

🚀 Key Features

🛠️ Tech Stack

💻 Setup & Installation

Prerequisites

1. Clone the Repository

2. Configure Environment Variables

3. Setup Virtual Environment

4. Install Dependencies

5. Launch the Application

📖 Usage Guide

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages