Feather Scan

A full-stack image classification web application that identifies birds, plants, and animals using pretrained HuggingFace & BioCLIP machine learning models, with additional species information retrieved from internal datasets and an external LLM API when applicable.

The project focuses on backend architecture, service separation, and reliable API design rather than training custom ML models.

![Feather Scan Demo](https://via.placeholder.com/800x400/1e293b/ffffff? text=Feather+Scan+Demo+Screenshot)

Overview

Feather Scan is a three-service web application that allows users to upload images and classify them as birds, plants, or animals.

The system is split into a React frontend, an Express + TypeScript backend, and a FastAPI-based ML inference service. The backend acts as a mediator between the client and the ML service.
It handles authentication, user history persistence, request validation, file uploads, and response shaping.

Pretrained models from Hugging Face and BioCLIP are used for inference, while Gemini API is used to enrich bird predictions with additional contextual information.

Features

User-Facing

Upload an image of (up to 5 MB) and classify it as a bird, plant, or animal
Bird species identification using a pretrained classifier
Plant identification using a medicinal plant model
Animal classification using BioCLIP (zero-shot)
Species information enrichment using Gemini API (birds only)
Confidence score returned with each prediction

Engineering Highlights

Clear separation between frontend, backend, and ML services
Centralized request validation and file handling in the backend
Backend-to-ML communication via HTTP APIs
FastAPI-based ML inference service
Automated backend tests using Jest and Supertest

Architecture

Feather Scan follows a service-separated architecture to keep concerns isolated and the system easy to reason about.

System Design

┌───────────────┐
│ Frontend      │ React + Vite (5173)
└───────┬───────┘
        │ POST /upload
┌───────▼───────┐
│ Backend API   │ Express + TypeScript (3000)
└───────┬───────┘
        │ POST /predict
┌───────▼───────┐
│ ML Service    │ FastAPI (5000)
└───────────────┘

The backend acts as a boundary layer, preventing the client from directly interacting with the ML service and allowing validation, authentication, and future extensibility without changing the client.

Data Flow

The client uploads an image along with the selected model type.
The backend validates the request and handles file processing.
The image is forwarded to the ML service for inference:
- Birds and plants use pretrained Hugging Face models.
- Animals use BioCLIP for zero-shot classification.
For bird predictions, additional species information is generated using Gemini API.
For animal & plant predictions, info is retrieved from a database based on predicted label.
The backend returns a structured JSON response (label, confidence, info) to the client for display.

Tech Stack

Frontend

Backend

ML Service

Tooling

ML Models

Category	Model	Source
Birds	`chriamue/bird-species-classifier`	Hugging Face
Plants	`dima806/medicinal_plants_image_detection`	Hugging Face
Animals	`imageomics/bioclip-2`	BioCLIP

These pretrained models were chosen to prioritize system design and integration over custom model training.

📦 Prerequisites

Before you begin, ensure you have the following installed:

Node.js (v18 or higher) - Download
Python (v3.9 or higher) - Download
npm or yarn - Comes with Node.js
pip - Python package manager
Git - Version control

Optional but recommended:

Virtual environment (venv or conda) for Python dependencies

🚀 Installation

1. Clone the Repository

git clone https://github.com/devansh436/feather-scan.git
cd feather-scan

2. Install Root Dependencies

npm install

3. Install Client Dependencies

cd client
npm install
cd ..

4. Install Server Dependencies

cd server
npm install
cd ..

5. Install Model Dependencies

cd model
pip install -r requirements.txt
cd ..

⚙️ Configuration

Environment Variables

Create .env files in the respective directories:

`client/.env`

VITE_API_BASE=http://localhost:3000/upload
VITE_FIREBASE_API_KEY=<your-key>
VITE_FIREBASE_AUTH_DOMAIN=example-project.firebaseapp.com
VITE_FIREBASE_PROJECT_ID=example-project-a1b2c3
VITE_FIREBASE_APP_ID=x:123456789:web:e1x1m1p1le

`server/.env`

PORT=3000
FAST_API_URL=http://localhost:5000/predict
MONGO_URI=<your-mongo-uri>

`model/.env`

GEMINI_API_KEY=your_gemini_api_key_here
GEMINI_MODEL=gemini-2.5-flash

🏃 Running the Application

Option 1: Run All Services Concurrently (Recommended)

From the root directory:

npm run dev

This starts all three services simultaneously:

🎨 Client: http://localhost:5173
🔌 Server: http://localhost:3000
🤖 Model: http://localhost:5000

Option 2: Run Services Individually

In separate terminal windows:

# Terminal 1 - ML Service
npm run model

# Terminal 2 - Backend Server
npm run server

# Terminal 3 - Frontend Client
npm run client

Accessing the Application

Open your browser and navigate to:

http://localhost:5173

![Application Interface](https://via.placeholder.com/800x450/1e293b/ffffff? text=Upload+Interface+Screenshot)

API (Backend)

POST /upload

Accepts an image and model type, returns the predicted label and confidence.

Request

multipart/form-data
file: image (jpg/png)
modelType: bird | plant | animal

Response

{
  "label": "bald eagle",
  "type": "bird",
  "confidence": 0.95,
  "info": {
    "name": "Bald Eagle",
    "scientific_name": "Haliaeetus leucocephalus",
    "confidence": 97.3,
    "habitat": "Near large bodies of open water, forests",
    "origin": "North America",
    "description": "A large bird of prey known for its white head and tail."
  }
}

API details are intentionally kept minimal, as the backend is not intended for third-party consumption.

📁 Project Structure

feather-scan/
│
├── client/                 # Frontend React application
│   ├── src/
│   │   ├── components/    # React components
│   │   ├── pages/         # Page components
│   │   ├── App.jsx        # Main app component
│   │   └── main.jsx       # Entry point
│   ├── public/            # Static assets
│   ├── package.json
│   └── vite.config.js     # Vite configuration
│
├── server/                # Backend Express API
│   ├── src/
│   │   ├── routes/        # API routes
│   │   ├── middleware/    # Custom middleware
│   │   ├── utils/         # Helper functions
│   │   └── index.ts       # Server entry point
│   ├── package.json
│   └── tsconfig.json      # TypeScript config
│
├── model/                 # ML inference service
│   ├── data/              # Static species data
│   ├── main.py            # FastAPI application
│   ├── model.py           # Model loading & inference
│   └── requirements. txt   # Python dependencies
│
├── .github/
│   └── workflows/         # CI/CD workflows
│       └── test.yml       # Automated testing
│
├── package.json           # Root package (scripts)
└── README.md

🧪 Testing

Run backend tests:

cd server
npm test

Tests are automatically run on push via GitHub Actions.

Key Engineering Decisions

Used pretrained ML models to focus on backend architecture and system integration.
Isolated ML inference into a separate FastAPI service to keep the Node.js backend non-blocking.
Centralized validation and file handling in the backend to avoid exposing the ML service.
Implemented backend testing early using Jest and Supertest to ensure API stability.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Built with ❤️ by devansh436

*If you found this project useful, consider giving it a ⭐! *

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
.github/workflows		.github/workflows
client		client
model		model
server		server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
revision.md		revision.md

License

devansh436/feather-scan

Folders and files

Latest commit

History

Repository files navigation