🛰️ Geospatial Land Classification: CNN & ViT Hybrid Study

Geospatial Land Classification Study

📋 Project Overview

The capstone project, completed as part of the IBM Deep Learning Professional Certificate, focuses on building an advanced land classification system for agricultural applications using satellite imagery.

The project simulates an AI Engineer role at a fertilizer company, where the core objective is to develop and rigorously compare state-of-the-art deep learning models for accurately classifying terrain (e.g., crops, forests, water bodies). The entire deep learning pipeline was implemented—from custom geospatial data handling to comparative model analysis—showcasing expertise across leading deep learning frameworks.

Final Results

Robust Deep Learning Model Development

CNN Implementation: Developed and trained independent CNN models using both Keras and PyTorch to solve the land classification problem.

Vision Transformer Integration: Designed and implemented a hybrid deep learning model by integrating features from pre-trained CNNs and Vision Transformers. The entire combined architecture was then fine-tuned to optimize performance for the agricultural land classification task.

Comparative Analysis: Conducted a comprehensive comparative study of CNNs and Hybrid CNN-Vision Transformer performance across the two major frameworks.
Full-Cycle Deep Learning Pipeline

Data Handling: Implemented efficient techniques for geospatial image dataset loading and applied custom data augmentation strategies in both Keras and PyTorch.

Model Evaluation: Rigorously evaluated all models using a suite of quantitative metrics, including F1-score and AU-ROC, to ensure robust and reliable performance for a real-world application.

📂 Repository Contents

The research is organized into three distinct phases, each contained within its own subfolder featuring a localized README for specific implementation details:

🚀 Research Pipeline & Notebooks

#	Phase	Technical Focus	Colab Access
01	Data Engineering	Memory-Based vs. Generator-Based Ingestion	Launch 🚀
02	Data Engineering	Scalable Augmentation Strategies (TensorFlow)	Launch 🚀
03	Data Engineering	Torchvision Pipeline & Tensor Transformations	Launch 🚀
04	CNN Development	Keras Convolutional Baseline & Optimization	Launch 🚀
05	CNN Development	PyTorch Implementation & State Dict Management	Launch 🚀
06	Analysis	Cross-Framework Performance Benchmarking	Launch 🚀
07	Hybrid Integration	Vision Transformers (ViT) in Keras	Launch 🚀
08	Hybrid Integration	Vision Transformers (ViT) in PyTorch	Launch 🚀
09	Final Study	Hybrid CNN-ViT Model Integration	Launch 🏆

📊 Dataset

The project utilizes the EuroSAT-style Geospatial Dataset (Land Use and Land Cover Classification). The raw data is fetched automatically within the notebooks from the public IBM Cloud Object Storage:

Source URL: images-dataSAT.tar

⚙️ Execution Guide

Option A: Colab Execution (Cloud)

Click on the link in the Colab Access tab in the table [🚀 Research Pipeline & Notebooks](### 🚀 Research Pipeline & Notebooks).

Option B: Local Execution (WSL2/GPU)

Recommended for leveraging local GPU acceleration.

1. Environment Setup

It is recommended to use an environment with Python 3.12.8:

Using Conda (Recommended):

conda env create -f environment.yml
conda activate vit-research

Using Pip:

pip install -r requirements.txt

2. Run the Research Study

Navigate to the notebooks/ directory and launch the modules via VS Code or Jupyter Lab.

💻 Tech Stack

Deep Learning Frameworks: Keras (TensorFlow) and PyTorch.
Model Architectures: Convolutional Neural Networks (CNNs), Vision Transformers (ViT) and Hybrid CNN-ViT model.
Data Handling: Geospatial Image Data Loading (memory-based vs. generator-based), Data Augmentation, Preprocessing.
Advanced Techniques: Transfer Learning (fine-tuning pre-trained models).
Performance Evaluation: Accuracy, Precision, Recall, F1-score, AU-ROC, Confusion Matrix.
| Deliverables: Jupyter Notebooks (technical rigor)

📜 Attributions & License This project was developed as a Capstone for the IBM Deep Learning Professional Certificate. The core datasets and initial lab structures are provided by IBM Skills Network under their educational terms. All model implementations, hybrid architecture integration, and comparative analyses were performed by me as part of this study.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
data		data
images		images
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛰️ Geospatial Land Classification: CNN & ViT Hybrid Study

Geospatial Land Classification Study

📋 Project Overview

Final Results

📂 Repository Contents

🚀 Research Pipeline & Notebooks

📊 Dataset

⚙️ Execution Guide

Option A: Colab Execution (Cloud)

Option B: Local Execution (WSL2/GPU)

1. Environment Setup

Using Conda (Recommended):

Using Pip:

2. Run the Research Study

💻 Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🛰️ Geospatial Land Classification: CNN & ViT Hybrid Study

Geospatial Land Classification Study

📋 Project Overview

Final Results

📂 Repository Contents

🚀 Research Pipeline & Notebooks

📊 Dataset

⚙️ Execution Guide

Option A: Colab Execution (Cloud)

Option B: Local Execution (WSL2/GPU)

1. Environment Setup

Using Conda (Recommended):

Using Pip:

2. Run the Research Study

💻 Tech Stack

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages