🚗 UK Electric Vehicles Data Pipeline

An end-to-end ETL pipeline that extracts, cleans, and visualizes the latest electric vehicle (EV) registrations and charging infrastructure data from the UK Department for Transport (DfT).

💻View the live dashboard

📖 The Problem:

While 1 in 5 vehicles sold in the UK are now electric, infrastructure rollout risks stalling. Vital data for policymakers is often locked in dense, quarterly Excel/CSV reports, making it difficult to track regional disparities or real-time progress.

💡 The Solution:

This pipeline automates the ingestion of quarterly DfT statistics, feeding into a Power BI Dashboard. This enables local councillors and EV advocates to visualize charger density and vehicle adoption across all UK regions instantly.

Process

🛠 Tech Stack

Layer	Tools	Purpose
Extraction	`Python`, `Beautiful Soup`	Scraping GOV.UK for the latest release links.
Transformation	`Pandas`,`dbt`	Data cleaning, date formatting, and hierarchical modeling.
Loading	`SQLAlchemy`	Direct ingestion into the cloud warehouse.
Database	`Supabase (PostgreSQL)`	Scalable storage with a managed relational schema.
Orchestration	`GitHub Actions`	CRON-scheduled runs (Mon/Fri) to check for data updates.
Visualization	`PowerBI`	Interactive spatial and trend analysis.

🔧 Data Engineering Challenges

1. Solving the ragged hierarchy The UK's regional geography can be messy. To prevent "double counting" in Power BI:

The problem: If the dashboard sums "London" and "Westminster" a subset of London, the totals overinflate.

The fix: I introduced a closure table and recursive CTE. All fact tables are atomized to the smallest possible regional slice.

The result: Users can drill down from "National" to "Local" without double-counting data.

SELECT
    c.region_ons, c.quarter, c.all_chargers, c.fast_chargers
FROM raw_data c
WHERE NOT EXISTS (
    SELECT 1
    FROM closure rc
    JOIN raw_data c_child
        ON c_child.region_ons = rc.descendant_ons
        AND c_child.quarter = c.quarter
    WHERE rc.ancestor_ons = c.region_ons
    AND rc.descendant_ons <> c.region_ons
)

2. Synchronizing data release dates The problem: Charger data and EV registrations release on different schedule to each other. The fix: Pipeline logic uses vehichle registrations as the 'global date', preventing nulls from lagging data releases.

📊 Key Insights (As of Q3 2025)

Growth: Total public chargers increased 4x since 2019 (15k → 82k).

The "Gap": Windsor & Maidenhead show the largest imbalance with 1,020 vehicles per charger.

Efficiency: Coventry and Hackney lead the country with a 3:1 vehicle-to-charger ratio.

Infrastructure Mix: Rapid/Ultra-rapid chargers doubled in volume (8k → 16k) but their total market share only grew by 2%, indicating slower-speed chargers still dominate the rollout.

🚀 Getting started

Prerequisites

Python 3.9+
Supabase account
PowerBI
dbt Core (for running local transformations)

Setup 1. Clone and install

# Clone the repository
git clone https://github.com/TurnerHaa/ev_dashboard.git

cd your-project

# Install dependencies
pip install -r requirements.txt

2. Environment variables Create a .env file based on examples.env or setup your GitHub actions with the following secrets:

env:
  DB_HOST: ${{ secrets.DB_HOST }}
  DB_USER: ${{ secrets.DB_USER }}
  DB_PASSWORD: ${{ secrets.DB_PASSWORD }}
  DB_NAME: ${{ secrets.DB_NAME }}
  DB_PORT: "6543"

3. Database schema The dbt models are located in /models. Run dbt run to create hierarchy and closure tables.

View GitHub Actions time settings

on:
  schedule:
    - cron: '00 00 * * 1,5' # set to run the pipeline at midnight every Monday and Friday.

Future roadmap

[ ] Mobile Alerts: Integrate Twilio for SMS notifications when new DfT data is detected.

[ ] Optimization: Partition chargers and vehicles tables for faster recursive queries.

[ ] Advanced Viz: Add a "Comparison Mode" to benchmark two specific Local Authorities side-by-side.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github/workflows		.github/workflows
ev_transform		ev_transform
.gitignore		.gitignore
README.md		README.md
ev_pipeline.py		ev_pipeline.py
examples.env		examples.env
profiles_example.yml		profiles_example.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚗 UK Electric Vehicles Data Pipeline

📖 The Problem:

💡 The Solution:

Process

🛠 Tech Stack

🔧 Data Engineering Challenges

📊 Key Insights (As of Q3 2025)

🚀 Getting started

Future roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚗 UK Electric Vehicles Data Pipeline

📖 The Problem:

💡 The Solution:

Process

🛠 Tech Stack

🔧 Data Engineering Challenges

📊 Key Insights (As of Q3 2025)

🚀 Getting started

Future roadmap

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages