Skip to content

fvalerii/geospatial-land-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

68 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ›ฐ๏ธ Geospatial Land Classification: CNN & ViT Hybrid Study

Geospatial Land Classification Study

Python TensorFlow PyTorch Keras IBM


๐Ÿ“‹ Project Overview

The capstone project, completed as part of the IBM Deep Learning Professional Certificate, focuses on building an advanced land classification system for agricultural applications using satellite imagery.

The project simulates an AI Engineer role at a fertilizer company, where the core objective is to develop and rigorously compare state-of-the-art deep learning models for accurately classifying terrain (e.g., crops, forests, water bodies). The entire deep learning pipeline was implementedโ€”from custom geospatial data handling to comparative model analysisโ€”showcasing expertise across leading deep learning frameworks.


Final Results

Hybrid Model Comparison

  1. Robust Deep Learning Model Development

    CNN Implementation: Developed and trained independent CNN models using both Keras and PyTorch to solve the land classification problem.

    Vision Transformer Integration: Designed and implemented a hybrid deep learning model by integrating features from pre-trained CNNs and Vision Transformers. The entire combined architecture was then fine-tuned to optimize performance for the agricultural land classification task.

    Comparative Analysis: Conducted a comprehensive comparative study of CNNs and Hybrid CNN-Vision Transformer performance across the two major frameworks.

  2. Full-Cycle Deep Learning Pipeline

    Data Handling: Implemented efficient techniques for geospatial image dataset loading and applied custom data augmentation strategies in both Keras and PyTorch.

    Model Evaluation: Rigorously evaluated all models using a suite of quantitative metrics, including F1โ€‹-score and AU-ROC, to ensure robust and reliable performance for a real-world application.


๐Ÿ“‚ Repository Contents

The research is organized into three distinct phases, each contained within its own subfolder featuring a localized README for specific implementation details:

๐Ÿš€ Research Pipeline & Notebooks

# Phase Technical Focus Colab Access
01 Data Engineering Memory-Based vs. Generator-Based Ingestion Launch ๐Ÿš€
02 Data Engineering Scalable Augmentation Strategies (TensorFlow) Launch ๐Ÿš€
03 Data Engineering Torchvision Pipeline & Tensor Transformations Launch ๐Ÿš€
04 CNN Development Keras Convolutional Baseline & Optimization Launch ๐Ÿš€
05 CNN Development PyTorch Implementation & State Dict Management Launch ๐Ÿš€
06 Analysis Cross-Framework Performance Benchmarking Launch ๐Ÿš€
07 Hybrid Integration Vision Transformers (ViT) in Keras Launch ๐Ÿš€
08 Hybrid Integration Vision Transformers (ViT) in PyTorch Launch ๐Ÿš€
09 Final Study Hybrid CNN-ViT Model Integration Launch ๐Ÿ†

๐Ÿ“Š Dataset

The project utilizes the EuroSAT-style Geospatial Dataset (Land Use and Land Cover Classification). The raw data is fetched automatically within the notebooks from the public IBM Cloud Object Storage:


โš™๏ธ Execution Guide

Option A: Colab Execution (Cloud)

Click on the link in the Colab Access tab in the table [๐Ÿš€ Research Pipeline & Notebooks](### ๐Ÿš€ Research Pipeline & Notebooks).

Option B: Local Execution (WSL2/GPU)

Recommended for leveraging local GPU acceleration.

1. Environment Setup

It is recommended to use an environment with Python 3.12.8:

Using Conda (Recommended):
conda env create -f environment.yml
conda activate vit-research
Using Pip:
pip install -r requirements.txt

2. Run the Research Study

Navigate to the notebooks/ directory and launch the modules via VS Code or Jupyter Lab.


๐Ÿ’ป Tech Stack

  • Deep Learning Frameworks: Keras (TensorFlow) and PyTorch.
  • Model Architectures: Convolutional Neural Networks (CNNs), Vision Transformers (ViT) and Hybrid CNN-ViT model.
  • Data Handling: Geospatial Image Data Loading (memory-based vs. generator-based), Data Augmentation, Preprocessing.
  • Advanced Techniques: Transfer Learning (fine-tuning pre-trained models).
  • Performance Evaluation: Accuracy, Precision, Recall, F1โ€‹-score, AU-ROC, Confusion Matrix.
  • | Deliverables: Jupyter Notebooks (technical rigor)

๐Ÿ“œ Attributions & License This project was developed as a Capstone for the IBM Deep Learning Professional Certificate. The core datasets and initial lab structures are provided by IBM Skills Network under their educational terms. All model implementations, hybrid architecture integration, and comparative analyses were performed by me as part of this study.

About

๐Ÿ›ฐ๏ธ Geospatial Land Classification study using Hybrid CNN-ViT architectures. Implemented across TensorFlow & PyTorch, featuring custom satellite imagery ETL and comparative analysis of Attention vs. Convolutions. Developed for the IBM Professional Capstone.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors