Python Developer · Data Engineering & Analytics
BS in Data Science and Applications @ IIT Madras · Open Source Builder
Building reproducible, production-grade Python tooling for data engineering and analytics.
BS in Data Science and Applications @ IIT Madras, focused on building practical Python systems for data engineering and analytics.
Interested in:
- Data Engineering & Pipeline Design
- Python Development & Developer Tooling
- Analytics and Reproducible Workflows
- Open Source Software
I build production-grade Python tools for data engineering and analytics, with a focus on reproducible workflows, automation, and developer tooling.
- Modular ETL frameworks with CLI orchestration
- Reproducible environments and automated workflows
- Data analysis and visualization pipelines
- Test before release — pytest on every project
- Automate repetitive workflows — GitHub Actions CI/CD
- Prefer reproducible environments — uv, pinned dependencies
- Build installable, maintainable software — packaged wheels, clean APIs
- Enforce code quality — Ruff linting, pre-commit hooks
End-to-end Python data engineering framework for ingestion, preprocessing, feature engineering, model evaluation, and reporting — 16 submodules, CI/CD, PyArrow Parquet I/O, automated testing, and installable via wheel.
Built with: Python · pytest · GitHub Actions · Ruff · uv · PyArrow
Python tool with a browser-based timeline viewer for tracing and validating object trace outputs — includes CI workflow and headless trace server runner.
Built with: Python · HTML · GitHub Actions · pytest · flake8
| Area | Tools |
|---|---|
| Languages | Python, SQL (fundamentals) |
| Data Engineering | ETL pipelines, Parquet, PyArrow, Pandas |
| Dev Practices | CI/CD, pytest, Ruff, Git, uv |
| Currently Learning | Advanced SQL, data modeling |
Building open-source Python tooling for data engineering and analytics.