Engineering intelligent systems where ideas connect like synapses to create powerful AI-driven solutions.
Scrybe is a breakthrough AI-driven application designed to evaluate video and audio responses against reference answers intelligently. By extracting media, performing high-fidelity transcription, generating frame-by-frame visual insights, and verifying semantic similarity, Scrybe's advanced ML pipeline gives instant, actionable feedback and highly predictable evaluation scores.
Whether you're practicing interview answers, pitching presentations, or assessing spoken assignments, Scrybe acts as your top-tier AI-powered evaluator.
We utilized powerful AI tools in the development and creation of this website:
- Antigravity (Google): Utilized for deep system integration, rapid feature implementation, and extensive debugging across the full stack.
- ChatGPT: Leveraged for brainstorming architectural ideas, system design, and establishing the optimal application workflow.
- Upload & Input: The user uploads a video/audio file and provides a reference/expected answer through the React frontend interface.
- API Communication: The file and text are securely transmitted to the FastAPI backend via the
/evaluateendpoint. - Audio & Frame Extraction:
- Uses
FFmpegto extract audio for transcription. - Uses
ffmpeg-pythonto perform backend frame generation, extracting a representative video frame.
- Uses
- Speech-to-Text Transcription: The
OpenAI Whispermodel converts the captured audio into highly accurate text. - Frame Analysis & Visual Comparison:The captured frame is compared with the reference answer using
Google Gemini Vision, providing constructive feedback on visual presence and body language relative to the topic. - Advanced Similarity Refinement (
similarity_model.py):
-Upgraded to train/calculate at higher score predictability usingall-mpnet-base-v2and a hybrid metric approach. It measures both abstract semantic similarity and exact keyword overlap, making scoring extremely reliable. - Feedback Generation: LLMs (
Google Generative AI/OpenAI) evaluate the transcript to generate descriptive structural summaries. - Results Delivery: The comprehensive evaluation, transcript, visual feedback, similarity scores, and grade are intuitively displayed to the user via the Vite/React frontend.
- Core Framework: React 19
- Build Tool: Vite
- Styling: Standard CSS (Glassmorphism & Gradients)
- Other Tools: jsPDF (for exporting Evaluation Reports)
- Core Framework: FastAPI (Python)
- Server: Uvicorn
- Speech-to-Text: OpenAI Whisper
- Semantic Evaluation: Sentence-Transformers (
all-mpnet-base-v2) - Generative & Vision Feedback:
google-generativeai(Gemini),openai - Core ML Libraries:
torch,numpy - Media Processing:
ffmpeg-pythonfor Audio & Frame extraction
The-Synapse-Squad/
│
├── Backend/ # Python FastAPI server and ML Pipeline
│ ├── app.py # Main application entry point & API routes
│ ├── requirements.txt # Python dependencies definitions
│ ├── models/
│ │ └── similarity_model.py # Refined hybrid similarity evaluator
│ ├── pipeline/ # Core processing pipeline modules
│ │ ├── extract_audio.py # Audio separation logic
│ │ ├── extract_frames.py# Video frame generation logic
│ │ ├── speech_to_text.py# Whisper transcription
│ │ ├── text_cleaning.py # Formatting and cleaning
│ │ └── evaluator.py # Main orchestration pipeline
│ └── utils/ # Helper ML utilities
│ ├── frame_analyzer.py# Visual comparison using Gemini
│ ├── feedback_generator.py # LLM integrations for descriptive feedback
│ ├── score_calculator.py # Statistical/Math calculation helpers
│ └── summarizer.py # Response digestion and summarization
│
└── Frontend/ # React Application User Interface
├── package.json # Node dependencies and project scripts
├── vite.config.js # Vite specifications
├── index.html # Main entry HTML file
└── src/ # Source code (Components, Pages, Styles)
- Node.js: v18+ for the Vite frontend environment.
- Python: v3.8+ for the backend configuration.
- FFmpeg: Must be installed on your system to extract media/frames.
- Navigate to the
Backenddirectory:cd Backend - Create and activate a Virtual Environment (Optional but recommended).
- Install dependencies:
pip install -r requirements.txt
- Define your environment variables in a
.envfile (OPENAI_API_KEY,GEMINI_API_KEY). - Run the server:
The server runs locally at
uvicorn app:app --reload
http://localhost:8000.
- Navigate to the
Frontenddirectory:cd Frontend - Install dependencies:
npm install
- Boot up the Vite dev server:
npm run dev
Built with ❤️ by The-Synapse-Squad