🔬 ResearchMate — AI Research Assistant with RAG & LLMs

An AI-powered research assistant that revolutionizes how researchers discover, analyze, and manage academic literature using advanced Retrieval-Augmented Generation (RAG) and large language models.

Tech Stack: Python • FastAPI • Transformers • Groq (LLaMA 3.3 70B) • ChromaDB • RAG • Docker

📖 Built as a solo effort to deepen my understanding of modern NLP stacks — including RAG pipelines, citation graph analysis, and LLM-powered literature review generation.

🎯 Project Overview

ResearchMate is a full-stack research assistant system designed to explore the integration of Retrieval-Augmented Generation (RAG) pipelines, citation graph analysis, and task-specific prompting for academic research support. Built entirely as a solo project, it serves as a practical study in applying LLMs (specifically Groq-hosted LLaMA 3.3 70B) to literature review automation, scientific Q&A, and research trend analysis.

🔍 Motivation

The academic landscape is increasingly characterized by:

Rapid publication velocity, making it difficult to track developments in a domain
Shallow context understanding in traditional keyword-based retrieval tools (e.g., Google Scholar, Semantic Scholar)
Disjointed workflows, where search, summarization, and citation management are siloed

This project aims to build an integrated system where these capabilities are unified using RAG + vector search + LLM-based synthesis, offering a more coherent and semantically rich research workflow.

⚙️ Technical Goals

Implement a custom RAG pipeline for semantic search and summarization over paper corpora
Develop project-based research management, enabling storage and recall of paper sets by topic
Use ChromaDB for document vector storage, embedding papers with Sentence Transformers
Perform citation network analysis, extracting structured citation graphs from uploaded PDFs
Run LLM inference using Groq Cloud (LLaMA 3.3 70B) for high-throughput, low-latency generation
Support multi-turn question-answering, trend detection, and review generation over paper clusters
Enable upload and PDF parsing, converting documents into extractive + abstractive summaries

🔬 Learning Focus

This project was built for the purpose of:

Deepening my understanding of LLM application architectures, especially RAG
Experimenting with embedding-based search, hybrid pipelines, and LLM prompting
Building robust backends using FastAPI, integrating with frontend templates and API routes
Handling real-world constraints like low-resource deployment, cold-start initialization, and secure multi-user access

🧠 Core Technologies & Architecture

Retrieval-Augmented Generation (RAG) System

ResearchMate implements a sophisticated RAG pipeline that combines retrieval mechanisms with generative AI:

Query → Embedding → Vector Search → Context Retrieval → LLM Generation → Response

Key Components:

Document Processing Pipeline
- PDF text extraction with advanced cleaning
- Chunking strategies for optimal retrieval
- Metadata extraction (authors, titles, citations)
Vector Database (ChromaDB)
- Semantic embeddings for research papers
- Efficient similarity search
- Persistent storage for knowledge bases
Language Model Integration (Groq Llama 3.3 70B)
- High-performance inference
- Context-aware response generation
- Multi-turn conversation support
Retrieval Engine
- Semantic search capabilities
- Contextual ranking algorithms
- Multi-modal retrieval (text, metadata, citations)

Technical Stack

AI & Machine Learning

LLM: Groq Llama 3.3 70B (ultra-fast inference)
Embeddings: Sentence Transformers for semantic search
Vector Database: ChromaDB for efficient similarity search
RAG Framework: Custom implementation with advanced retrieval strategies

Backend

Framework: FastAPI (high-performance async Python web framework)
Authentication: JWT-based secure user management
PDF Processing: PyMuPDF, pdfplumber, pypdf for robust text extraction
Data Storage: JSON files for user data, ChromaDB for vector storage

Frontend

Framework: Vanilla JavaScript with modern ES6+ features
UI Library: Bootstrap 5 for responsive design
Visualization: Chart.js for research analytics and trends
Icons: Font Awesome for consistent iconography

Development & Infrastructure

Development Server: Custom server with hot-reload capabilities
Containerization: Docker and Docker Compose
Deployment: Azure App Service with environment-based configuration
Monitoring: Health checks and comprehensive logging

✨ Features & Capabilities

🔍 Intelligent Paper Search

Natural Language Queries: Search using conversational language
Semantic Understanding: Goes beyond keyword matching
Multi-source Integration: Searches across multiple academic databases
Real-time Results: Fast, responsive search experience

🧠 AI-Powered Analysis

Document Summarization: Generate concise summaries of research papers
Key Insight Extraction: Identify main contributions and findings
Comparative Analysis: Compare multiple papers and methodologies
Question Answering: Get specific answers from research content

📚 Project Management

Research Projects: Organize papers into themed collections
Literature Reviews: Automatically generate comprehensive reviews
Knowledge Graphs: Visualize connections between research areas
Progress Tracking: Monitor research milestones and discoveries

📊 Citation Network Analysis

Reference Mapping: Visualize citation relationships
Impact Analysis: Assess paper influence and importance
Research Lineage: Track the evolution of research ideas
Collaboration Networks: Identify key researchers and institutions

📈 Research Trend Monitoring

Trend Detection: Identify emerging research areas
Temporal Analysis: Track research evolution over time
Predictive Insights: Forecast future research directions
Comparative Studies: Analyze trends across different fields

📄 Advanced PDF Processing

Text Extraction: High-quality text extraction from academic PDFs
Structure Recognition: Identify sections, abstracts, references
Metadata Extraction: Extract author information, publication details
Content Cleaning: Remove formatting artifacts and noise

🔧 RAG System Deep Dive

Architecture Overview

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   User Query    │───▶│   Query Engine  │───▶│   Embeddings    │
└─────────────────┘    └─────────────────┘    └─────────────────┘
                                                        │
┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Generated     │◀───│   LLM Engine    │◀───│  Vector Search  │
│   Response      │    │  (Groq Llama)   │    │   (ChromaDB)    │
└─────────────────┘    └─────────────────┘    └─────────────────┘
                                │                        │
                       ┌─────────────────┐    ┌─────────────────┐
                       │   Context       │◀───│   Document      │
                       │   Builder       │    │   Retriever     │
                       └─────────────────┘    └─────────────────┘

Key Components

1. Document Processing (`pdf_processor.py`)

Multi-library PDF extraction for robust text extraction
Content cleaning to remove formatting artifacts
Intelligent chunking with overlap for context preservation
Metadata extraction for enhanced search capabilities

2. Vector Storage (`rag_system.py`)

ChromaDB integration for efficient vector operations
Semantic embeddings using Sentence Transformers
Persistent storage for knowledge base persistence
Similarity search with configurable parameters

3. Query Processing (`groq_processor.py`)

Query understanding and intent recognition
Context retrieval from vector database
Prompt engineering for optimal LLM performance
Response generation with citation tracking

4. Research Assistant (`research_assistant.py`)

Multi-turn conversations with context awareness
Project-based knowledge management
Literature review generation
Trend analysis and insights

RAG Implementation Details

Chunking Strategy

# Configurable chunking parameters
CHUNK_SIZE = 1000        # Characters per chunk
CHUNK_OVERLAP = 200      # Overlap between chunks

Retrieval Process

Query Embedding: Convert user query to vector representation
Similarity Search: Find most relevant document chunks
Context Assembly: Combine retrieved chunks with metadata
Response Generation: Generate answer using retrieved context

Context Management

Conversation history for multi-turn interactions
Project context for domain-specific responses
Citation tracking for source attribution
Relevance scoring for answer quality

🚀 Quick Start

Prerequisites

Python 3.11+
Groq API key (Get one here)
4GB+ RAM recommended for optimal performance

Installation

Clone the repository

git clone https://github.com/ananthakr1shnan/ResearchMate.git
cd ResearchMate

Set up virtual environment

python -m venv venv
# Windows
venv\Scripts\activate
# macOS/Linux
source venv/bin/activate

Install dependencies

pip install -r requirements.txt

Configure environment variables Create a .env file in the root directory:

GROQ_API_KEY=your_groq_api_key_here

Run the application

# Development server (recommended)
python src/scripts/dev_server.py

# Or basic server
python main.py

Access the application

Web Interface: http://127.0.0.1:8000
API Documentation: http://127.0.0.1:8000/docs
Interactive API: http://127.0.0.1:8000/redoc

🛠️ Development Workflow

Development Server

The development server provides a rich development experience:

# Start development server
python src/scripts/dev_server.py

# Custom configuration
python src/scripts/dev_server.py --host 0.0.0.0 --port 8080 --no-browser

# Run with code quality checks
python src/scripts/dev_server.py --lint

# Run tests
python src/scripts/dev_server.py --test

Development server features:

✅ Automatic port management
✅ File change detection
✅ Browser auto-launch
✅ Development-friendly logging
✅ Same codebase as production

Management System

Use the comprehensive management system:

# System status
python src/scripts/manager.py status

# Dependency management
python src/scripts/manager.py install

# Server management
python src/scripts/manager.py dev      # Development
python src/scripts/manager.py start    # Production

# Data management
python src/scripts/manager.py backup
python src/scripts/manager.py restore --backup-name backup_20250713_120000
python src/scripts/manager.py list-backups

# Maintenance
python src/scripts/manager.py clean-logs
python src/scripts/manager.py reset-db

🚀 Deployment

🔗 Live Demo

Access the deployed ResearchMate app here:
🌐 ResearchMate on Azure
Hosted via Azure App Service (Canada Central region)

Docker Deployment (Recommended)

# Build and run
docker-compose up --build

# Production deployment
docker-compose -f docker-compose.prod.yml up -d

Traditional Server Deployment

# Install dependencies
pip install -r requirements.txt

# Set environment variables
export GROQ_API_KEY=your_key_here

# Run production server
python main.py

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🔬 ResearchMate — AI Research Assistant with RAG & LLMs

🎯 Project Overview

🔍 Motivation

⚙️ Technical Goals

🔬 Learning Focus

🧠 Core Technologies & Architecture

Retrieval-Augmented Generation (RAG) System

Technical Stack

AI & Machine Learning

Backend

Frontend

Development & Infrastructure

✨ Features & Capabilities

🔍 Intelligent Paper Search

🧠 AI-Powered Analysis

📚 Project Management

📊 Citation Network Analysis

📈 Research Trend Monitoring

📄 Advanced PDF Processing

🔧 RAG System Deep Dive

Architecture Overview

Key Components

1. Document Processing (pdf_processor.py)

2. Vector Storage (rag_system.py)

3. Query Processing (groq_processor.py)

4. Research Assistant (research_assistant.py)

RAG Implementation Details

Chunking Strategy

Retrieval Process

Context Management

🚀 Quick Start

Prerequisites

Installation

🛠️ Development Workflow

Development Server

Management System

🚀 Deployment

🔗 Live Demo

Docker Deployment (Recommended)

Traditional Server Deployment

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Document Processing (`pdf_processor.py`)

2. Vector Storage (`rag_system.py`)

3. Query Processing (`groq_processor.py`)

4. Research Assistant (`research_assistant.py`)

Packages