Skip to content
View sarthakmahale123's full-sized avatar

Block or report sarthakmahale123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sarthakmahale123/README.md

πŸ‘‹ About Me

I'm a passionate Data Engineer & Full-Stack Developer who loves building end-to-end systems β€” from raw data pipelines all the way to polished user interfaces. Whether it's orchestrating Airflow DAGs, tuning PySpark jobs on Databricks, or crafting a React frontend that speaks to a Golang API, I enjoy the full journey.

I'm actively looking for roles where I can make an impact at scale β€” teams that care about clean architecture, good data, and shipping things that matter.

"Data is the new oil β€” and I'm here to refine it."


🎯 Open To Roles

Role What I Bring
Data Engineer End-to-end pipeline design, orchestration, warehouse optimization
Full-Stack Developer React + Golang/Node backends, REST APIs, cloud-native apps
ML / Analytics Engineer EDA, feature engineering, Spark ML, dashboards
Backend Developer Go (Gin), Java, Python β€” building scalable, production-grade services

πŸ“ Based in Thane, Maharashtra, India Β· Open to remote / hybrid / relocation


πŸ› οΈ Tech Stack

Languages

Python JavaScript Go Java SQL

Data Engineering & Analytics

Apache Airflow Apache Spark Databricks dbt DuckDB Delta Lake MLflow

Frontend & Backend

React Chakra UI Gin Node.js

Databases & Storage

PostgreSQL MySQL MongoDB

ML & Data Science

Pandas NumPy Scikit-Learn Seaborn Jupyter

DevOps & Tools

Docker Git GitHub Metabase


πŸš€ Featured Projects

πŸ—οΈ ecommerce-pipeline

End-to-end Data Engineering Pipeline

Built a production-grade pipeline ingesting, transforming, and visualizing e-commerce data using:

  • Python Β· dbt Β· DuckDB Β· Airflow Β· Metabase Β· Docker
  • Automated orchestration with Airflow DAGs
  • Analytical transformations with dbt models
  • Containerized via Docker for reproducibility

Production-Grade Data Lakehouse

Processed 19.4M real NYC taxi trips on Databricks using:

  • PySpark Β· Delta Lake Β· Spark Streaming Β· MLflow
  • Lakehouse architecture with Bronze / Silver / Gold layers
  • ML experiments tracked with MLflow

Global Health Data Analysis

Analyzed WHO life expectancy data to uncover socio-economic factors affecting lifespan:

  • Python Β· Pandas Β· NumPy Β· Seaborn Β· Jupyter
  • Extensive EDA, visualizations, and correlation analysis

Full-Stack ETF Intelligence App Β· Portfolio project for Mirae Asset β€” $721B AUM, 19 global markets

  • React Β· JavaScript Β· REST APIs
  • Investment research & portfolio visualization platform
  • Designed to mirror real-world fintech product UX

Full-Stack Stock Portfolio Tracker

A complete stock tracking app from frontend to backend:

  • React Β· Chakra UI Β· Golang (Gin) Β· MySQL
  • Real-time portfolio monitoring & trade tracking

Blood Bank Management System

A database-driven management system built with:

  • Java Β· MySQL
  • CRUD operations for donors, inventory, and requests

πŸ“Š By The Numbers

πŸ—‚οΈ Projects 🌐 Languages ☁️ Cloud & Big Data πŸ”§ Tools & Frameworks
6 public repos 5 languages 3 platforms 10+ frameworks
End-to-end pipelines Python Β· Go Β· Java Β· JS Β· SQL Databricks Β· Airflow Β· Docker React Β· Gin Β· dbt Β· Spark

🧠 Language Breakdown (across pinned projects)

Python          β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘  ~45%   (pipelines, ML, analytics)
JavaScript      β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  ~20%   (React frontends)
Go              β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  ~15%   (Gin backend APIs)
Java            β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘  ~12%   (backend systems)
SQL / dbt       β–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   ~8%   (transformations)

🀝 Let's Connect

LinkedIn GitHub Email


Open to Data Engineering Β· Full-Stack Β· Backend Β· Analytics roles

If you're building something exciting β€” let's talk!

Pinned Loading

  1. ecommerce-pipeline ecommerce-pipeline Public

    End-to-end data engineering pipeline: Python Β· dbt Β· DuckDB Β· Airflow Β· Metabase Β· Docker

    Python

  2. nyc-taxi-lakehouse nyc-taxi-lakehouse Public

    Production-grade data lakehouse on Databricks β€” PySpark Β· Delta Lake Β· Spark Streaming Β· MLflow Β· 19.4M real NYC taxi trips

    Jupyter Notebook

  3. who_lifeexpectancy_analysis who_lifeexpectancy_analysis Public

    Analyzed WHO life expectancy data using Python (Pandas, NumPy, Seaborn) to uncover key health and socio-economic factors affecting lifespan. Performed EDA, built visualizations, and identified corr…

    Jupyter Notebook

  4. miraescope_etf_intelligence_platform miraescope_etf_intelligence_platform Public

    MiraeScope is a full-stack React application built as a portfolio project for Mirae Asset Financial Group β€” one of Asia's largest asset managers with $721B AUM across 19 global markets. The platfor…

    JavaScript

  5. React-Go-Stocks-Portfolio React-Go-Stocks-Portfolio Public

    A full-stack stock portfolio tracker application built with React, Chakra UI, Golang (Gin), and MySQL. This platform enables users to manage and monitor their stock investments efficiently by track…

    JavaScript

  6. blood-bank-management blood-bank-management Public

    Welcome to this BloodBank Management project using JAVA and MYSQL.

    Java