DUNCAN OTIENO Duncan610

Duncan Otieno

Data Engineer | Analytics Engineer • Data Pipeline Architect

Building reliable data infrastructure, one transformation at a time

LinkedIn • Email • Nairobi, Kenya 🇰🇪

👨🏾‍💻 What I Do

I transform raw data into reliable insights. Currently specializing in modern data stack engineering with a focus on:

Data Modeling → Dimensional modeling, slowly changing dimensions, data vault
Pipeline Orchestration → Airflow DAGs, dependency management, error handling
Analytics Engineering → dbt transformations, incremental materialization, data quality
Cloud Infrastructure → AWS data services, infrastructure-as-code, cost optimization

🎯 Current Mission

Transitioning into production analytics engineering after completing a 1-year intensive data science certification and earning my AWS Cloud Practitioner certification. Building portfolio projects that solve real business problems with clean code and thoughtful architecture.

🚀 Featured Work

🏗️ [Instacart Analytics pipeline]

Production-Grade E-Commerce Analytics Platform

Building an end-to-end analytics pipeline that processes customer transaction data using the modern data stack.

Stack: dbt • PostgreSQL • Airflow • Python • Docker • Snowflake
Highlights: Incremental ETL, data quality testing, CI/CD automation

Technical Deep Dive

Architecture:

Medallion architecture (Bronze → Silver → Gold layers)
Incremental materialization for performance
Great Expectations for data quality
GitHub Actions for continuous deployment

Business Impact:

Reduces data processing time by 80%
Automated data quality checks catch 99% of issues
Self-service analytics layer for stakeholders

💼 Technical Toolkit

Core Data Engineering Stack

Cloud & DevOps

Data Science & ML

Data Engineering

Languages: Python, SQL, Bash
Orchestration: Apache Airflow
Transformation: dbt (data build tool)
Databases: PostgreSQL, Snowflake
Data Quality: Great Expectations
Version Control: Git, GitHub Actions

Cloud & Infrastructure

Cloud Platform: AWS (EC2, S3, RDS, Lambda)
Containerization: Docker, Docker Compose
IaC: Terraform (learning)
BI Tools: Tableau, Power BI (basic)
ML Background: Scikit-learn, Pandas, NumPy
IDEs: VS Code, Jupyter, PyCharm

📈 What Makes Me Different

🔍 Detail-Oriented Engineering
I don't just make pipelines work—I make them maintainable, testable, and cost-efficient.

🧠 Business-First Mindset
Every technical decision traces back to business value. Data engineering isn't just moving data—it's enabling better decisions.

📚 Continuous Learner
From data science certification to cloud engineering to analytics engineering—I'm always expanding my technical horizons.

🌍 Global Perspective, Local Impact
Based in Nairobi, building skills that compete globally while looking to create impact locally.

💻 Philosophy in Code

def approach_to_data_engineering():
    principles = {
        "quality": "Test everything, twice",
        "efficiency": "Automate the boring stuff",
        "clarity": "Code is read more than written",
        "impact": "Focus on business value"
    }
    return principles

📊 2025 Focus Areas

graph LR
    A[Modern Data Stack] --> B[dbt Mastery]
    A --> C[Airflow Expertise]
    A --> D[Cloud Architecture]
    B --> E[Production Projects]
    C --> E
    D --> E
    E --> F[Analytics Engineering Role]

Currently Building:

✅ Production-grade data pipelines
✅ Data quality frameworks
✅ CI/CD for analytics code
🎯 Real-time streaming (next phase)

📍 Now

Last updated: January 2025

Currently:

🔨 Building: Data Engineering and analytics engineering projects
📚 Learning: Advanced data modeling patterns (Kimball methodology)
🎯 Seeking: Data Engineer / Analytics Engineer roles
🌱 Reading: "The Data Warehouse Toolkit" by Ralph Kimball

This Week:

Implementing incremental loads in dbt
Building Airflow DAGs for orchestration
Networking with data engineers on LinkedIn
Contributing to data engineering communities

💭 Philosophy

"The best data pipeline is the one you don't have to think about—it just works, scales, and alerts you when it doesn't."

I believe in:

Automation over manual work → If I do it twice, I automate it
Documentation as code → Good docs prevent 3 AM debugging sessions
Test-driven development → Catch bugs before they catch you
Incremental improvement → Small wins compound into excellence

🎓 Certifications & Education

AWS Certified Cloud Practitioner • 2024
ALX Data Science Tech Programs • 1-Year Program • 2023-2024

⚡ Fun Fact

When I'm not building data pipelines, I'm probably:

⚽ Training for a football tournament around Nairobi
☕ Experimenting with pour-over coffee (yes, I track the extraction ratios in a spreadsheet)
📖 Reading technical blogs and data engineering case studies
♟️ Playing chess online (data analysis extends to opening theory!)

I've written SQL queries that join 10+ tables without losing my sanity. My secret? CTEs, lots of CTEs.

🤝 Let's Connect

I'm actively seeking Analytics Engineer or Junior Data Engineer roles where I can:

Build scalable data infrastructure
Work with modern data stack (dbt, Airflow, Snowflake, Databricks)
Collaborate with data teams solving real problems
Learn from experienced engineers

Reach out if you're:

Hiring for analytics engineering roles
Want to discuss data architecture
Building something interesting in the data space
Looking for collaboration on open-source data tools

📧 Email: otienoduncan99@gmail.com
💼 LinkedIn: duncan-otieno
📍 Location: Nairobi, Kenya (Open to remote)
🕐 Timezone: EAT (UTC+3)

📊 GitHub Activity

💡 Current Status

+ Building production-grade projects
+ Networking with data engineering community
+ Actively seeking analytics engineering roles
! Available for opportunities - Let's build something great together

"Data is the new electricity, and Engineers are the power grid. Keep Building, Keep Automating, Keep Scaling."

Last Updated: January 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly