☁️ Azure ML Labs

A comprehensive collection of machine learning projects, experiments, and labs built on Azure Machine Learning. This repository showcases end-to-end ML workflows including data preprocessing, model training, hyperparameter tuning, batch scoring, and deployment.

📌 Overview

This repository contains my work and projects completed using Azure ML. It demonstrates practical implementations of:

Automated ML pipelines
Hyperparameter tuning (HyperDrive)
Batch and real-time scoring
MLflow tracking
Responsible AI dashboards
Model deployment to managed endpoints

📁 Project Structure

Azure-ML-labs/ ├── Deployments/ # Deployment configurations and scripts ├── Mlflow/ # MLflow tracking experiments ├── Pipelines/ # Azure ML pipeline definitions ├── diabetes-data/ # Diabetes dataset and preprocessing ├── finalgolddata/ # Gold price prediction data ├── logs and AI dash/ # Logs and Responsible AI dashboards ├── Bank Customer Churn Prediction.csv ├── batch_score.py # Batch scoring script ├── conda_env_v_1_0_0.yml # Conda environment configuration ├── hyperdrive.txt # HyperDrive tuning configuration ├── model.pkl # Trained model pickle file ├── online_score.py # Real-time scoring script ├── prep.py # Data preprocessing ├── preprocess.py # Feature engineering ├── score.py # Scoring logic ├── scoring_file_v_2_0_0.py ├── sweep_train.py # Hyperparameter sweep training ├── train.py # Main training script ├── train_mlflow.py # Training with MLflow tracking ├── train_pipeline.py # Pipeline training script └── ... (logs and artifacts)

text

🚀 Featured Projects

1. Gold Price Predictor

Model: RandomForestRegressor (n_estimators=150)
R² Score: 0.9994
MAE: $10.82
Features: Open, High, Low, Close, Volume, Price_Range, Daily_Return, MA_5, MA_20

2. Bank Customer Churn Classifier

Model: RandomForestClassifier
Features: Customer demographics, account data, transaction history
Pipeline: End-to-end preprocessing + training

3. Diabetes Prediction

Goal: Predict diabetes progression using medical indicators
Approach: Linear regression with feature scaling and hyperparameter tuning

🛠️ Tech Stack

Category	Tools
Cloud Platform	Azure Machine Learning
Languages	Python 3.10
ML Libraries	scikit-learn, pandas, numpy
Tracking	MLflow
Deployment	Azure Container Instances, Managed Endpoints
Environment	Conda, Docker

📦 Setup & Installation

Prerequisites

Azure subscription
Azure ML workspace
Python 3.10+

Clone & Configure

# Clone repository
git clone https://github.com/Tee808-bigD/Azure-ML-labs.git
cd Azure-ML-labs

# Create conda environment
conda env create -f conda_env_v_1_0_0.yml
conda activate azure_ml_env

# Configure Azure CLI
az login
az account set --subscription "your-subscription-id"
az ml workspace connect --workspace-name "your-workspace" --resource-group "your-rg"
Run Training
bash
# Train gold price predictor
python train.py --config configs/gold_config.yaml

# Train churn classifier with MLflow
python train_mlflow.py --data_path ./Bank\ Customer\ Churn\ Prediction.csv

# Run hyperparameter sweep
python sweep_train.py
Scoring & Deployment
bash
# Batch scoring
python batch_score.py --input ./data/test_data.csv

# Real-time scoring endpoint
python online_score.py
📊 Key Learnings & Experiments
Experiment	What I Learned
Automated ML	How to let Azure AutoML find the best model and pipeline
HyperDrive	Tuning hyperparameters efficiently using Bayesian sampling
MLflow Tracking	Logging metrics, parameters, and models for experiment comparison
Responsible AI	Building fairness, explainability, and error analysis dashboards
Batch vs Online Scoring	Trade-offs between latency, cost, and throughput
Pipeline Reusability	Creating reusable ML pipelines with reusable components
🔮 Future Work
Add more datasets (fraud detection, time series forecasting)

Implement CI/CD for model retraining and deployment

Create interactive dashboards with Azure Managed Grafana

Add LLMOps experiments with Azure AI Foundry

🤝 Contributing
Feel free to fork this repository and submit pull requests. For major changes, please open an issue first.

📄 License
This project is licensed under the MIT License - see the LICENSE file for details.

📧 Contact
Thando Mzobe

GitHub: @Tee808-bigD

LinkedIn: Thando Mzobe

Email: thandomzobe9@gmail.com

🙏 Acknowledgments
Microsoft Learn for Azure ML documentation and training

Azure ML community for best practices

Built with ☁️ on Azure Machine Learning

text

## How to Add This to Your Repository:

1. Go to your repository: https://github.com/Tee808-bigD/Azure-ML-labs
2. Click on `README.md` (or create it if it doesn't exist)
3. Click the pencil icon (Edit)
4. **Copy and paste** the entire markdown above
5. Scroll down and click **Commit changes**

Your README will now look professional and showcase all your Azure ML work! 🚀

Would you like me to adjust any section or add more details about specific experiments?# Azure-ML-labs
work and projects done with Azure

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Deployments		Deployments
Mlflow		Mlflow
Pipelines		Pipelines
diabetes-data		diabetes-data
finalgolddata		finalgolddata
logs and AI dash		logs and AI dash
Bank Customer Churn Prediction.csv		Bank Customer Churn Prediction.csv
LICENSE		LICENSE
README.md		README.md
amlignore		amlignore
azureml_automl.log		azureml_automl.log
batch_score.py		batch_score.py
conda_env_v_1_0_0.yml		conda_env_v_1_0_0.yml
cs-capability (1).log		cs-capability (1).log
cs-capability.log		cs-capability.log
data-capability.log		data-capability.log
execution-wrapper (1).log		execution-wrapper (1).log
execution-wrapper (2).log		execution-wrapper (2).log
execution-wrapper.log		execution-wrapper.log
finalgolddata.csv		finalgolddata.csv
hosttools-capability (1).log		hosttools-capability (1).log
hosttools-capability (2).log		hosttools-capability (2).log
hosttools-capability.log		hosttools-capability.log
hyperdrive.txt		hyperdrive.txt
lifecycler (1).log		lifecycler (1).log
lifecycler.log		lifecycler.log
metrics-capability (1).log		metrics-capability (1).log
metrics-capability.log		metrics-capability.log
model.pkl		model.pkl
online_score.cpython-310.pyc		online_score.cpython-310.pyc
online_score.py		online_score.py
prep.py		prep.py
preprocess.cpython-310.pyc		preprocess.cpython-310.pyc
preprocess.py		preprocess.py
rslex.log.2026-04-09-03		rslex.log.2026-04-09-03
run_aggregate_log.txt		run_aggregate_log.txt
score.py		score.py
scoring_file_v_2_0_0.py		scoring_file_v_2_0_0.py
secrets-capability.log		secrets-capability.log
snapshot-capability (1).log		snapshot-capability (1).log
snapshot-capability.log		snapshot-capability.log
std_log (1).txt		std_log (1).txt
std_log.txt		std_log.txt
sweep_train.cpython-310.pyc		sweep_train.cpython-310.pyc
sweep_train.py		sweep_train.py
train (1).py		train (1).py
train.cpython-310 (1).pyc		train.cpython-310 (1).pyc
train.cpython-310.pyc		train.cpython-310.pyc
train.py		train.py
train_mlflow.py		train_mlflow.py
train_pipeline.py		train_pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

☁️ Azure ML Labs

📌 Overview

📁 Project Structure

🚀 Featured Projects

1. Gold Price Predictor

2. Bank Customer Churn Classifier

3. Diabetes Prediction

🛠️ Tech Stack

📦 Setup & Installation

Prerequisites

Clone & Configure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

☁️ Azure ML Labs

📌 Overview

📁 Project Structure

🚀 Featured Projects

1. Gold Price Predictor

2. Bank Customer Churn Classifier

3. Diabetes Prediction

🛠️ Tech Stack

📦 Setup & Installation

Prerequisites

Clone & Configure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages