Language Analysis of Text Retell Networks
A desktop application for audio transcription, document comparison, and linguistic network analysis using local Whisper AI models.
- Transcribe audio files using OpenAI's Whisper model
- 100% local processing - no cloud APIs required
- Multiple model options (tiny, base, small, medium, large)
- Support for various audio formats (MP3, WAV, M4A, FLAC, OGG, etc.)
- Generate co-occurrence matrices from transcriptions
- Customizable window sizes and minimum frequency thresholds
- Visual heatmaps and relationship graphs
- Compute graph-theoretic metrics:
- Modularity (community structure)
- Entropy (information content)
- Global efficiency (connectivity)
- Diameter (network size)
- Average shortest path length
- Analyze linguistic network properties
- Compare multiple transcription documents
- Side-by-side visualization
- Highlight differences and similarities
- Word relationship networks
- Co-occurrence heatmaps
- Network graph visualizations
- Download the installer for your platform from Releases
- Install following platform-specific instructions
- Install dependencies (Python, ffmpeg, Python packages)
- Launch the application
See INSTALLATION.md for detailed installation instructions.
- OS: macOS 10.15+, Windows 10+, or Linux (Ubuntu 20.04+)
- RAM: 4GB minimum, 8GB recommended
- Disk: 2GB free space
- Python: 3.8 or higher
- ffmpeg: Required for audio processing
# Clone the repository
git clone <repository-url>
cd ebrltranscriptionUI
# Install dependencies
npm install
cd backend && pip3 install -r requirements.txt && cd ..
# Start development servers
./start.shThe application will be available at:
- Frontend: http://localhost:5173
- Backend API: http://127.0.0.1:5000
ebrltranscriptionUI/
βββ src/ # React frontend source
β βββ components/ # UI components
β βββ pages/ # Page components
β βββ services/ # API services
β βββ utils/ # Utility functions
βββ backend/ # Python Flask backend
β βββ server.py # Main server file
β βββ requirements.txt # Python dependencies
β βββ build_backend.py # Backend build script
βββ electron/ # Electron main process
β βββ main.cjs # Main process entry
β βββ preload.cjs # Preload script
βββ public/ # Static assets
βββ dist/ # Built frontend (generated)
See PACKAGING.md for detailed packaging instructions.
Quick build:
# Build for current platform
npm run package:mac # macOS
npm run package:win # Windows
npm run package:linux # Linux
# Build for all platforms
npm run package:allnpm run dev- Start Vite development servernpm run build- Build frontend for productionnpm run backend:build- Build backend executablenpm run electron:dev- Run Electron in development modenpm run electron:build:mac- Build macOS appnpm run electron:build:win- Build Windows installernpm run electron:build:linux- Build Linux AppImage./start.sh- Start both frontend and backend for development
- React 18 - UI framework
- TypeScript - Type safety
- Vite - Build tool
- TailwindCSS - Styling
- Radix UI - Component primitives
- React Query - Data fetching
- React Router - Navigation
- Recharts - Data visualization
- Python 3 - Runtime
- Flask - Web framework
- OpenAI Whisper - Speech recognition
- Pandas - Data processing
- NumPy - Numerical computing
- NetworkX - Graph analysis
- Electron 28 - Desktop framework
- electron-builder - Packaging
LANTERN is designed with privacy in mind:
- β 100% Local Processing: All transcription happens on your machine
- β No Cloud APIs: No data sent to external servers
- β No Internet Required: Works completely offline (after initial install)
- β Your Data Stays Yours: Files never leave your computer
- Research: Analyze interview transcriptions, focus groups, oral histories
- Education: Study language patterns, vocabulary usage, speech analysis
- Accessibility: Generate transcripts for audio/video content
- Content Creation: Transcribe podcasts, lectures, meetings
- Linguistics: Study word relationships, network structures, language patterns
Copyright Β© 2025 Education and Brain Sciences Research Lab, Vanderbilt University
[License details here]
Developed by: Armaan Parikh Organization: Education and Brain Sciences Research Lab (EBRL), Vanderbilt University Contact: armaan.c.parikh@vanderbilt.edu
- OpenAI Whisper - Speech recognition
- Electron - Desktop framework
- React - UI framework
- Flask - Backend framework
- NetworkX - Graph analysis
For issues, questions, or feature requests:
- Email: armaan.c.parikh@vanderbilt.edu
- Issues: GitHub Issues
- Documentation: See INSTALLATION.md and PACKAGING.md
- Additional Whisper model options
- Batch processing for multiple files
- Export results to various formats (PDF, CSV, JSON)
- Custom vocabulary/language models
- Multi-language support
- Advanced network visualization options
- Plugin system for custom analysis
Contributions are welcome! Please:
- Fork the repository
- Create a feature branch
- Make your changes
- Submit a pull request
- Initial release
- Local Whisper transcription
- Co-occurrence matrix generation
- Network metrics computation
- Document comparison
- Electron desktop application
- macOS, Windows, Linux support
LANTERN - Illuminating language networks through local AI analysis