🎬 Transformer-Based Movie Recommendation System

A sequence-aware collaborative filtering system that uses a Transformer architecture to predict the next movie a user is likely to watch.

Instead of predicting ratings (traditional approach), this system models user behavior as a sequence prediction problem, similar to how GPT models generate text.

🚀 Why This Project Matters

Traditional recommender systems (KNN, SVD):

Ignore sequence order
Treat interactions as static

This system:

Learns watch patterns over time
Captures contextual relationships between movies
Applies LLM-style learning to recommendation systems

🧠 Core Idea

Reframing recommendation as next-token prediction:

Movies → Tokens
User history → Sequence
Next movie → Prediction

Example:

[Movie₁, Movie₂, Movie₃] → Movie₄

This is the same learning paradigm used in GPT models.

🏗️ System Pipeline

Data Ingestion
- Netflix Prize Dataset (~100M ratings, sampled)
Preprocessing
- Clean and parse raw data
- Merge ratings with movie metadata
Filtering
- Keep only positive interactions (rating ≥ 4)
Sequence Construction
- Convert user histories into sequential training samples
Tokenization
- Map movie IDs → integer tokens
Dataset Preparation
- Padding and batching for fixed-length input
Model
- Transformer Encoder with multi-head self-attention
Prediction
- Outputs probability distribution over all movies

🤖 Model Architecture

Embedding Layer: Converts movie IDs into dense vectors
Transformer Encoder:
- Multi-head self-attention
- Captures relationships across watched movies
Output Layer:
- Fully connected layer for next-movie prediction

🧪 Training Details

Loss Function: Cross-Entropy
Optimizer: Adam
Objective: Maximize likelihood of correct next movie

📊 Example Recommendations

Input sequence:

Reservoir Dogs
Dogma
Lilo & Stitch

Predicted next movies:

North by Northwest
The Deer Hunter
Chasing Amy

⚖️ Comparison with Traditional Methods

Method	Sequence Awareness	Context Understanding
KNN	❌ No	❌ Limited
SVD	❌ No	❌ Limited
Transformer (This Work)	✅ Yes	✅ Strong

🧠 Key Learnings

How Transformers generalize beyond NLP
Importance of sequence modeling in recommendations
Role of attention in capturing user behavior
Data pipeline design for large-scale sequential systems

⚠️ Limitations

Trained on sampled dataset (not full scale)
No hyperparameter optimization
Cold-start problem not addressed

🚧 Future Improvements

Incorporate temporal embeddings
Hybrid model (content + collaborative)
Fine-tune with larger dataset
Deploy as real-time recommendation API

🎥 Demo

https://drive.google.com/file/d/1zblcSgEyVbHYxe5F_LK7qHxMqzkp1MwW/view?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Collaborative_Movie_Recommendation_System.ipynb		Collaborative_Movie_Recommendation_System.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎬 Transformer-Based Movie Recommendation System

🚀 Why This Project Matters

🧠 Core Idea

🏗️ System Pipeline

🤖 Model Architecture

🧪 Training Details

📊 Example Recommendations

⚖️ Comparison with Traditional Methods

🧠 Key Learnings

⚠️ Limitations

🚧 Future Improvements

🎥 Demo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎬 Transformer-Based Movie Recommendation System

🚀 Why This Project Matters

🧠 Core Idea

🏗️ System Pipeline

🤖 Model Architecture

🧪 Training Details

📊 Example Recommendations

⚖️ Comparison with Traditional Methods

🧠 Key Learnings

⚠️ Limitations

🚧 Future Improvements

🎥 Demo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages