I'm a second-year IT student who loves turning raw data into meaningful stories. Currently diving deep into data analysis, visualisation & ML β making numbers make sense.
- π BE in Information Technology (Expected 2028)
- π Focus: Data Analysis β’ Data Engineering β’ AI/ML
- π± Learning: PySpark, NLP, Data Pipelines
- π Strong interest in real-world datasets & scalable systems
- π Built ETL pipelines processing 100K+ records using PySpark
- π Analyzed 20K+ global data points to uncover environmental trends
- π€ Developed chatbot with 100% query accuracy (10/10 test cases)
- βοΈ Strong foundation in data structures, OOP, and system design
- Data Cleaning & Preprocessing
- Feature Engineering & Transformation
- Exploratory Data Analysis (EDA)
- Scalable Data Pipelines
- Visualization & Insight Generation
Languages
Data & ML
Tools
End-to-end PySpark-based ETL pipeline built on the Olist dataset.
- Processed 100K+ orders across multiple tables
- Designed modular pipeline for scalability
- Implemented feature engineering for delivery performance
- Stored optimized output in Parquet format
Tech: PySpark β’ ETL β’ Parquet β’ Big Data
A retrieval-based chatbot using NLP techniques.
- Implemented TF-IDF + Cosine Similarity
- Built hybrid system for FAQs + small talk
- Applied confidence threshold filtering
- Visualized performance metrics
Tech: Python β’ NLP β’ scikit-learn β’ Matplotlib
EDA project analyzing pollution trends across global cities.
- Worked on 20K+ records across 50+ cities
- Identified seasonal PM2.5 trends
- Created insightful visualizations
Tech: Pandas β’ Matplotlib
Java-based system using OOP principles.
- Applied inheritance & polymorphism
- Implemented file-based persistence
- Automated stock tracking & reporting
Tech: Java β’ OOP β’ Serialization
- βοΈ Advanced data pipelines using PySpark
- π§ NLP-based intelligent systems
- ποΈ Data engineering concepts (warehousing, ETL optimization)
- π Improving data visualization & storytelling
β If you like my work, consider giving a star to my repositories!
