Skip to content

Sanmeet007/transcript-chatbot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎬 Transcript Chatbot

Transcript Chatbot is a lightweight Retrieval-Augmented Generation (RAG) web application that enables users to interact conversationally with YouTube video transcripts.
By fetching, embedding, and retrieving transcript data from a Chroma Vector Store, the system allows users to ask natural questions and receive contextually relevant, human-like responses.


🚀 Key Features

  • 🎥 Transcript Integration: Load and process YouTube video transcripts seamlessly.
  • 💬 Conversational AI: Generates natural, context-aware answers without rigid phrasing.
  • 🧠 RAG Architecture: Combines retrieval and generation for accurate, grounded responses.
  • 💾 Chroma Vector Store: Efficiently stores and retrieves transcript embeddings for fast similarity search.
  • 🌙 Modern UI: Clean, responsive, dark-themed interface with typing indicators.
  • ⚡ Real-Time Interaction: Smooth and fast chat experience powered by FastAPI and Uvicorn.

📸 Screenshots

Screenshot-1 Screenshot-2 Screenshot-3 Screenshot-4

🛠️ Tech Stack

Layer Technology
Frontend HTML, CSS, JavaScript
Backend Python (FastAPI)
AI Layer Gemini / LangChain with text embeddings
Vector Database ChromaDB
Storage In-memory session management
Deployment Uvicorn web server

🧩 System Overview

  1. Session Initialization: Creates a temporary session to manage chat state.
  2. Transcript Loading: Retrieves and processes YouTube transcripts, splitting them into chunks.
  3. Embedding & Storage: Generates embeddings and stores them in Chroma Vector Store.
  4. Question Answering (RAG): On each query, relevant transcript chunks are retrieved from Chroma and passed to the model for answer generation.
  5. Session Termination: Sessions and embeddings are automatically cleared upon exit.

📦 Setup Instructions

1️⃣ Clone the Repository

git clone https://github.com/your-username/transcript-chatbot.git
cd transcript-chatbot

2️⃣ Install Dependencies

pip install -r requirements.txt

3️⃣ Run the Application

uvicorn app:app --reload

4️⃣ Access the Web Interface

Open your browser and navigate to:

http://localhost:8000

🧾 Environment Variables

Create a .env file with the following configuration:

GOOGLE_API_KEY=your_gemini_api_key

About

RAG-based conversational chatbot that lets you chat with YouTube video transcripts. Powered by Gemini, LangChain, and Chroma Vector Store for fast, context-aware, human-like responses.

Topics

Resources

License

Stars

Watchers

Forks

Contributors