Skip to content
View RaazSareen's full-sized avatar

Block or report RaazSareen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RaazSareen/README.md

Hi, I'm Raaz 👋

Python Developer · Data Engineering & Analytics
BS in Data Science and Applications @ IIT Madras · Open Source Builder

Building reproducible, production-grade Python tooling for data engineering and analytics.


About Me

BS in Data Science and Applications @ IIT Madras, focused on building practical Python systems for data engineering and analytics.

Interested in:

  • Data Engineering & Pipeline Design
  • Python Development & Developer Tooling
  • Analytics and Reproducible Workflows
  • Open Source Software

🔧 What I Build

I build production-grade Python tools for data engineering and analytics, with a focus on reproducible workflows, automation, and developer tooling.

  • Modular ETL frameworks with CLI orchestration
  • Reproducible environments and automated workflows
  • Data analysis and visualization pipelines

⚙️ Engineering Principles

  • Test before release — pytest on every project
  • Automate repetitive workflows — GitHub Actions CI/CD
  • Prefer reproducible environments — uv, pinned dependencies
  • Build installable, maintainable software — packaged wheels, clean APIs
  • Enforce code quality — Ruff linting, pre-commit hooks

🚀 Projects

End-to-end Python data engineering framework for ingestion, preprocessing, feature engineering, model evaluation, and reporting — 16 submodules, CI/CD, PyArrow Parquet I/O, automated testing, and installable via wheel.

Built with: Python · pytest · GitHub Actions · Ruff · uv · PyArrow


Python tool with a browser-based timeline viewer for tracing and validating object trace outputs — includes CI workflow and headless trace server runner.

Built with: Python · HTML · GitHub Actions · pytest · flake8


🛠️ Tech Stack

Area Tools
Languages Python, SQL (fundamentals)
Data Engineering ETL pipelines, Parquet, PyArrow, Pandas
Dev Practices CI/CD, pytest, Ruff, Git, uv
Currently Learning Advanced SQL, data modeling

📫 Let's Connect


Building open-source Python tooling for data engineering and analytics.

Pinned Loading

  1. pipelineforge pipelineforge Public

    Modular Python framework for data engineering and analytics workflows — ETL pipelines, reproducible environments, automation, and CI/CD.

    Python

  2. object_trace_visualizer object_trace_visualizer Public

    Forked from 23f3002694-rishi/object_trace_visualizer

    Windows CI smoke workflow, headless trace server runner, and a browser timeline viewer for visualizing and validating object trace outputs.

    Python