Virtual Try-On System

An AI-powered virtual try-on application that allows users to visualize how clothing items would look on them using advanced diffusion models.

🌟 Features

AI-Powered Try-On: Uses Stable Diffusion and Dreambooth models to generate realistic clothing overlays
Web Application: User-friendly Next.js interface for uploading selfies and outfit images
Chrome Extension: Browser extension to capture clothing images from e-commerce websites
Email Notifications: Receive results via email after processing
Image Processing Pipeline: Automated image captioning, cropping, and augmentation
RESTful API: Django-based backend with API endpoints for seamless integration

🏗️ Architecture

The project consists of four main components:

virtual-try-on-team/
├── frontend/           # Next.js web application
├── backend/            # Django REST API
├── chrome-plugin/      # Chrome extension
├── diffusion/          # Diffusion model training scripts
├── diffusion_improved/ # Enhanced diffusion implementation
└── diffusion_optimization/ # Performance optimization techniques

🛠️ Technology Stack

Frontend

Framework: Next.js 14
Language: TypeScript
Styling: Tailwind CSS
HTTP Client: Axios
UI: React 18

Backend

Framework: Django 4.1
API: Django Ninja
Database: SQLite (development)
Storage: Local file storage (configurable for AWS S3)
CORS: django-cors-headers

AI/ML

Model: Stable Diffusion with Dreambooth fine-tuning
Image Processing: BLIP captioning
Framework: PyTorch
Optimization: Mixed precision training, gradient checkpointing

Chrome Extension

Manifest Version: 3
Permissions: Storage, Active Tab, Scripting

📋 Prerequisites

Node.js (v18 or higher)
Python (v3.8 or higher)
pip and virtualenv
Chrome Browser (for extension testing)
CUDA-capable GPU (recommended for diffusion model training)

🚀 Installation

1. Clone the Repository

git clone https://github.com/jeesunikim/virtual-try-on-team.git
cd virtual-try-on-team

2. Frontend Setup

cd frontend
npm install
cp .env.example .env.local  # Configure environment variables
npm run dev

The frontend will be available at http://localhost:3000

3. Backend Setup

cd backend
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

# Create .env file with required variables
cat > .env << EOF
SECRET_KEY=your-secret-key-here
DEBUG=True
EOF

# Run migrations
python manage.py makemigrations
python manage.py migrate

# Create superuser (optional)
python manage.py createsuperuser

# Start the development server
python manage.py runserver

The backend API will be available at http://127.0.0.1:8000

4. Chrome Extension Setup

Open Chrome and navigate to chrome://extensions/
Enable "Developer mode" in the top right
Click "Load unpacked"
Select the chrome-plugin directory from the project
The GetDressed extension will now appear in your browser

5. Diffusion Model Setup

cd diffusion

# Install dependencies
pip install diffusers transformers accelerate torch torchvision

# Run training scripts (adjust paths as needed)
bash identity_run.sh    # Train identity model
bash clothes_run.sh     # Train clothing overlay model
bash transforms.sh      # Process and augment images

📖 Usage

Web Application

Navigate to http://localhost:3000
Upload a body image (clear photo of yourself)
Upload an outfit/clothing item image
Enter your email address
Submit and wait for processing
Receive results via email or in the interface

API Endpoints

Try-On Endpoint

POST /api/try_on_outfit/{email}
Content-Type: multipart/form-data

Parameters:
- email: User's email address
- selfie: Image file (user's photo)
- outfit: Image file (clothing item)

Response:
{
  "message": "This is what it looks like"
}

🔬 How It Works

Diffusion Model Pipeline

User Input: Upload 12 pictures of yourself for personalized model training
Dreambooth Training: Fine-tune a Stable Diffusion model on your images
Mask Generation: Create or generate masks defining where clothing appears
Inpainting: Apply clothing overlay using the trained diffusion model
Refinement: Iteratively improve output through the denoising process

Key Parameters

Training steps: 1000
Learning rate: 5e-6
Sampling: DDIM (fewer steps, faster inference)
Precision: Mixed FP16/FP32 for optimal performance

Image Processing

BLIP Captioning: Automatic image description generation
Crop & Augment: Preprocessing for consistent input dimensions
Mask Application: Guide model for realistic clothing placement

🎯 Performance Optimization

The project implements several optimization techniques:

Mixed Precision Training: Using PyTorch AMP for faster computation
Gradient Checkpointing: Reduced memory usage for larger batches
Efficient Sampling: DDIM and PFGM for fewer diffusion steps
Flash Attention: Memory-efficient attention mechanisms
Dynamic Batching: Minimized GPU idle time

📁 Project Structure

virtual-try-on-team/
├── frontend/
│   ├── app/                 # Next.js app directory
│   ├── src/
│   │   ├── components/      # React components
│   │   └── helpers/         # Utility functions
│   ├── public/              # Static assets
│   └── package.json
├── backend/
│   ├── django_project/      # Django settings
│   ├── try_on/              # Main app
│   │   ├── api.py          # API endpoints
│   │   ├── models.py       # Database models
│   │   └── schemas.py      # Request/response schemas
│   ├── emails/              # Email functionality
│   ├── media/               # Uploaded files
│   └── manage.py
├── chrome-plugin/
│   ├── manifest.json        # Extension configuration
│   ├── popup/               # Extension UI
│   ├── content-script.js    # Page interaction
│   └── background.js        # Background processes
├── diffusion/
│   ├── captioning/          # BLIP image captioning
│   ├── transforms/          # Image preprocessing
│   ├── clothes_run.sh       # Clothing model training
│   └── identity_run.sh      # Identity model training
├── diffusion_improved/      # Enhanced model implementation
└── diffusion_optimization/  # Performance improvements

🗄️ Database Models

User Model

Email (unique identifier)
Extended from Django's AbstractUser

Try Model

User (foreign key)
Selfie image
Outfit image
Timestamps

🔐 Environment Variables

Backend (.env)

SECRET_KEY=your-django-secret-key
DEBUG=True
# Optional AWS S3 configuration
# AWS_ACCESS_KEY_ID=your-aws-access-key
# AWS_SECRET_ACCESS_KEY=your-aws-secret-key
# AWS_STORAGE_BUCKET_NAME=your-bucket-name

Frontend (.env.local)

NEXT_PUBLIC_API_URL=http://127.0.0.1:8000

📚 References

🐛 Known Issues

ML model integration is partially implemented (placeholder functions)
Email notification system needs configuration
AWS S3 storage is commented out (local storage used by default)
Training requires significant computational resources

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
backend		backend
chrome-plugin		chrome-plugin
diffusion		diffusion
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Virtual Try-On System

🌟 Features

🏗️ Architecture

🛠️ Technology Stack

Frontend

Backend

AI/ML

Chrome Extension

📋 Prerequisites

🚀 Installation

1. Clone the Repository

2. Frontend Setup

3. Backend Setup

4. Chrome Extension Setup

5. Diffusion Model Setup

📖 Usage

Web Application

API Endpoints

Try-On Endpoint

🔬 How It Works

Diffusion Model Pipeline

Key Parameters

Image Processing

🎯 Performance Optimization

📁 Project Structure

🗄️ Database Models

User Model

Try Model

🔐 Environment Variables

Backend (.env)

Frontend (.env.local)

📚 References

🐛 Known Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages