MNIST Deep Learning Model Comparison

Overview

This project implements and evaluates multiple deep learning architectures on the MNIST handwritten digit dataset using PyTorch. The goal was to compare neural network architectures and optimizers while benchmarking against a classical machine learning baseline.

Models Implemented

Multi-Layer Perceptron (MLP)
- ReLU + Adam
- Sigmoid + Adam
- ReLU + SGD
- 5-layer MLP (Best MLP)
Convolutional Neural Networks (CNN)
- Custom 4-layer CNN (Best Overall Model)
- LeNet-5
Classical Baseline
- XGBoost (from previous assignment)

Dataset

MNIST (28x28 grayscale handwritten digits)
60,000 training samples
10,000 test samples

Performance Comparison

Model	Accuracy	Training Time
XGBoost	97.2%	30s
Best MLP	98.2%	65s
LeNet-5	98.3%	40s
4-layer CNN	98.6%	55s

Key Insights

CNNs significantly outperform classical ML models on image data.
ReLU activation consistently outperformed Sigmoid.
Adam optimizer converged faster and achieved higher accuracy than SGD.
Deeper MLP improved performance but CNN architectures were superior overall.

Technologies Used

Python
PyTorch
NumPy
Matplotlib
Scikit-learn

How to Run

Install dependencies: pip install torch torchvision numpy matplotlib scikit-learn
Run notebooks:

MLP.ipynb
CNN.ipynb

Conclusion

The 4-layer CNN achieved the best overall performance, demonstrating the effectiveness of convolutional architectures for image-based tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CNN.ipynb		CNN.ipynb
MLP.ipynb		MLP.ipynb
PCA.ipynb		PCA.ipynb
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Deep Learning Model Comparison

Overview

Models Implemented

Dataset

Performance Comparison

Key Insights

Technologies Used

How to Run

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MNIST Deep Learning Model Comparison

Overview

Models Implemented

Dataset

Performance Comparison

Key Insights

Technologies Used

How to Run

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages