Skip to content
View bindhusaahithi's full-sized avatar

Block or report bindhusaahithi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bindhusaahithi/README.md

Hi 👋, I'm Bindhu Saahithi

Data Science Graduate Student · ML · POWERBI · SQL · Cloud · EDA


👩‍💻 About Me

  • 🎓 Pursuing M.S. in Data Science at University of Massachusetts Dartmouth
  • 🔍 I build ML models, analyze data, and create dashboards to solve real-world problems
  • 💡 Passionate about turning raw data into meaningful business insights
  • 🌱 Currently exploring: Model Deployment · Advanced NLP · Data Engineering
  • 🎯 Open to: Data Scientist · Data Analyst · ML Engineer roles

🛠️ Skills & Tools

Languages & Libraries

Python SQL Pandas NumPy Scikit-learn Matplotlib Seaborn

Machine Learning

Regression Classification Clustering CNN Hypothesis Testing

Analytics & Visualization

Power BI Tableau Excel

Cloud & Data Engineering

AWS Azure Databricks ETL

Databases

MySQL SQL Server MongoDB Amazon Redshift

Tools

Git Jupyter VS Code


💼 Experience

🔸 Graduate Assistant — Business Analytics

University of Massachusetts Dartmouth · Jan 2026 – May 2026

  • Analyzed large datasets using Excel, SQL, and Python to generate actionable insights for academic and administrative decision-making
  • Built interactive dashboards in Power BI/Tableau to track KPIs and performance metrics
  • Automated weekly/monthly reports, reducing manual reporting time by 30–50%
  • Conducted EDA to uncover patterns and support research projects

🔸 Data Analyst Intern — Tredence Analytics

India · Aug 2023 – Sep 2023

  • Analyzed structured business datasets using SQL and Python to identify trends and support data-driven decisions
  • Designed interactive Power BI dashboards to track KPIs and operational metrics
  • Cleaned and transformed raw datasets to improve data quality and reporting accuracy
  • Collaborated with cross-functional teams to translate business requirements into analytical solutions

🔸 Data Engineer Intern — Anblicks

India · May 2023 – Jul 2023

  • Built ETL pipelines using Python and SQL to ingest, transform, and load data from multiple sources
  • Managed cloud-based data storage and processing workflows on Azure Data Lake
  • Optimized data pipelines to improve performance and reliability
  • Implemented data validation and monitoring checks to ensure data consistency

🚀 Featured Projects

Analyzed large-scale sales data to uncover revenue trends, regional performance, and product profitability. Built interactive dashboards to support business decision-making.
Python SQL Pandas Tableau EDA Power BI


Segmented customers using clustering and predicted Customer Lifetime Value to drive smarter marketing and retention strategies.
Python Scikit-learn K-Means Clustering Regression Pandas


Built a CNN-based deep learning model to detect lung cancer from CT scan images, improving classification accuracy for medical diagnosis.
Python CNN Deep Learning Medical Imaging Scikit-learn


📌 Currently Working On

  • 🚀 Deploying Customer Segmentation as a live Streamlit app
  • 📊 Building my Kaggle profile with project notebooks
  • 📝 Writing my first Medium article on end-to-end ML model building

🎯 Goals

  • 🤖 Build end-to-end data science and machine learning pipelines
  • ☁️ Strengthen cloud and data engineering skills (AWS · Azure · Databricks)
  • 📊 Work on impactful real-world datasets and open-source contributions

📊 GitHub Stats


Pinned Loading

  1. Customer-Segmentation-Customer-Lifetime-Value-Prediction Customer-Segmentation-Customer-Lifetime-Value-Prediction Public

    Customer segmentation and 90-day CLV prediction using RFM analysis, K-Means clustering, and machine learning on e-commerce transaction data.

    HTML

  2. Lung-cancer-detection Lung-cancer-detection Public

    Lung Cancer Detection using CNN on CT Scan Images

    Jupyter Notebook

  3. Netflix-Analysis Netflix-Analysis Public

    Netflix Movies & TV Shows Data Analysis using Python, Pandas, Matplotlib, and Seaborn.

    Jupyter Notebook

  4. Sales-Analytics Sales-Analytics Public

    Retail Sales Analytics project using Python, SQL, and Tableau to analyze revenue trends, regional performance, category insights, and profitability.

    Jupyter Notebook