Skip to content
View Joyan9's full-sized avatar

Block or report Joyan9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Joyan9/README.md

Hi there, I'm Joyan ๐Ÿ‘‹

Welcome to my GitHub profile!

About Me

  • Currently a Masters of Computer Science student at IU Berlin
  • Over 2+ years experience as analytics consultant
  • Strong passion for learning and acquiring knowledge

๐Ÿ› ๏ธ Technologies & Tools

  • Python, SQL, JavaScript
  • Apache Spark, Apache Flink
  • Azure Data Factory, Azure Databricks, Synapse Analytics, BigQuery
  • Looker Studio, Power BI

๐Ÿš€ Projects

  • Technologies Used: Python, Apache Spark, Docker, dlt, DuckDB, Jupyter Lab
  • Designed and implemented a scalable microservices-based data pipeline to process 9+ million farmer query records from India's Kisan Call Centre API.
  • Four core services:
    • data ingestion using DLT for incremental/backfill loads
    • Spark-based processing transforming raw data into star schema
    • interactive analytics dashboards
    • comprehensive testing suite
  • Containerized architecture handles high-volume batch processing with configurable data limits and API integration.
  • Technologies Used: Excel, SQL, Google Sheets, Looker Studio, Canva
  • Analyzed market penetration patterns across 10 Indian cities
  • Conducted comprehensive analysis of trip volumes, fares, and customer behaviors
  • Identified strategic insights:
  • Uncovered correlation between retention rates and customer satisfaction
  • Revealed operational differences between tourist and business markets
  • Highlighted pricing and service enhancement opportunities
  • Developed actionable recommendations for business growth
  • Technologies Used: Python (DuckDB), SQL, AWS EC2, AWS S3, Parquet
  • Created a data pipeline to collect and analyze data for Berlin tram line M13
  • Extracted data using BVG REST API and filtered as the requirements
  • Used Python for the Extract and Load (EL) process
  • Loaded the results as parquet files into a S3 bucket
  • Scheduled the job on an EC2 instance using the Crontab scheduler.
  • Fetches hourly weather forecasts from the Open-Meteo API
  • Suggests clothing items for morning, daytime, and evening
  • Adds weather-specific accessories like umbrellas, sunglasses, or windbreakers
  • Sends daily outfit recommendations via email
  • Runs automatically every morning with GitHub Actions
  • Dockerized for easy deployment
  • Includes unit and integration testing

๐Ÿ“ซ Connect with Me

Thanks for visiting my profile! Feel free to check out my repositories and get in touch if you want to collaborate on any projects.

Pinned Loading

  1. good_cabs_analysis good_cabs_analysis Public

  2. pyspark-learning-journey pyspark-learning-journey Public

    Jupyter Notebook 10 4

  3. 8-weeks-sql-challenge 8-weeks-sql-challenge Public

    Jupyter Notebook

  4. bvg-open-data-project bvg-open-data-project Public

    Data Engineering Project on extracting data from BVG API for a particular Tram line (M13)

    Jupyter Notebook

  5. learning_data_engineering learning_data_engineering Public

    Jupyter Notebook 3