Autonomous Perception Ensemble

Multi-model sensor fusion for autonomous driving scene understanding.

Overview

This project combines three perception models into a unified scene understanding system:

┌─────────────────────────────────────────────────────────────┐
│                      CAMERA INPUT                           │
└─────────────────────────┬───────────────────────────────────┘
                          │
        ┌─────────────────┼─────────────────┐
        ▼                 ▼                 ▼
┌───────────────┐ ┌───────────────┐ ┌───────────────┐
│   YOLOv8      │ │    U-Net      │ │    MiDaS      │
│  Detection    │ │ Segmentation  │ │    Depth      │
└───────────────┘ └───────────────┘ └───────────────┘
        │                 │                 │
        └─────────────────┼─────────────────┘
                          ▼
                ┌─────────────────┐
                │     FUSION      │
                └─────────────────┘

Models

Model	Task	Dataset	Output
YOLOv8-small	Object Detection	BDD100K	Bounding boxes
U-Net	Drivable Area	BDD100K	Segmentation mask
MiDaS	Depth Estimation	Pretrained	Depth map

Results

Component	Metric	Value
Detection	mAP@50	~0.50
Segmentation	IoU	~0.85
Depth	Relative	Pretrained
Ensemble	FPS	~15-20

Project Structure

├── notebooks/
│   ├── 01_Detection.ipynb      # YOLOv8 fine-tuning
│   ├── 02_Segmentation.ipynb   # U-Net training
│   └── 03_Fusion_ONNX.ipynb    # Ensemble + export
├── models/                      # ONNX models (see HuggingFace)
├── src/
│   └── inference.py            # Inference pipeline
├── demo/
│   └── app.py                  # Gradio demo
└── assets/
    └── sample_output.png

Quick Start

Installation

git clone https://github.com/aryanp2107/Autonomous-Perception-Ensemble.git
cd Autonomous-Perception-Ensemble
pip install -r requirements.txt

Inference

from src.inference import PerceptionEnsemble

# Load ensemble
ensemble = PerceptionEnsemble(
    detection_model="models/yolov8n_bdd100k.onnx",
    segmentation_model="models/unet_drivable.onnx",
    depth_model="models/midas_small.onnx"
)

# Run on image
result = ensemble.predict("path/to/dashcam.jpg")

# Visualize
ensemble.visualize(result, save_path="output.png")

Demo

cd demo
python app.py
# Opens Gradio interface at localhost:7860

Notebooks

Notebook	Description	Colab
01_Detection	Fine-tune YOLOv8 on BDD100K
02_Segmentation	Train U-Net for drivable area
03_Fusion_ONNX	Combine models + ONNX export

Models Download

ONNX models hosted on HuggingFace:

# Download all models
huggingface-cli download aryanp2107/autonomous-perception-ensemble --local-dir models/

Or download individually:

Dataset

BDD100K — Berkeley DeepDrive 100K

We use a ~10K image subset via Roboflow for training.

Tech Stack

Detection: Ultralytics YOLOv8
Segmentation: PyTorch U-Net
Depth: Intel MiDaS
Export: ONNX Runtime
Demo: Gradio

Deployment

Deployed on Arxelos (coming soon)

License

MIT

Author

Aryan Patel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Autonomous Perception Ensemble

Overview

Models

Results

Project Structure

Quick Start

Installation

Inference

Demo

Notebooks

Models Download

Dataset

Tech Stack

Deployment

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
demo		demo
models		models
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Autonomous Perception Ensemble

Overview

Models

Results

Project Structure

Quick Start

Installation

Inference

Demo

Notebooks

Models Download

Dataset

Tech Stack

Deployment

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages