Databricks 14-Days AI Challenge-1 is designed to help beginners build a strong foundation in Databricks through daily learning, hands-on practice, and real-world problem solving.
𝐖𝐡𝐚𝐭 𝐢𝐬 𝐭𝐡𝐞 𝐃𝐚𝐭𝐚𝐛𝐫𝐢𝐜𝐤𝐬 𝐂𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐞?
It’s a hands-on learning challenge where you don’t just read concepts, you apply them. You work on a managed cloud platform using:
⚡ SQL
⚡Python
⚡Spark
There’s also a free Databricks Community Edition, making it easy for anyone to start exploring real datasets.
🔗 Reference: Databricks Community Edition https://lnkd.in/gE8pgVRJ
𝐖𝐡𝐲 𝐢𝐬 𝐭𝐡𝐢𝐬 𝐢𝐦𝐩𝐨𝐫𝐭𝐚𝐧𝐭 (𝐞𝐬𝐩𝐞𝐜𝐢𝐚𝐥𝐥𝐲 𝐟𝐨𝐫 𝐚𝐧𝐚𝐥𝐲𝐬𝐭𝐬)?
Most analysts already know SQL and Python — but this challenge helps you:
🔸 Query and analyze data directly in Databricks using SQL
🔸Build dashboards straight from the SQL editor
🔸Understand how data moves through pipelines
🔸Get exposure to Jobs & Pipelines, which are core data engineering concepts
With AI becoming part of almost every data role, understanding data engineering fundamentals significantly boosts an analyst’s career potential.
𝐖𝐡𝐚𝐭 𝐭𝐡𝐢𝐬 𝐜𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐞 𝐭𝐚𝐮𝐠𝐡𝐭 𝐦𝐞?
Progress doesn’t happen in a single day, it needs hardwork, consistent practice and passion for learning. You don’t truly remember concepts by just reading or watching them You remember them when you build, break, fix, and rebuild
🧠 What actually happened in these 14 days:
- Spark stopped looking scary
- Delta Lake started making sense
- Bronze, Silver, Gold stopped sounding like jewellery 💍
- SQL dashboards became my comfort zone
- MLflow taught me why tracking experiments matters
- AI (Genie & Mosaic AI) showed up and said: “Why type SQL when you can just ask questions?”
| Day | Challenge | Status |
|---|---|---|
| Day 01 | Platform Setup & First Steps | ✅ Completed |
| Day 02 | Apache Spark Fundamentals | ✅ Completed |
| Day 03 | PySpark Transformations Deep Dive | ✅ Completed |
| Day 04 | Delta Lake Introduction | ✅ Completed |
| Day 05 | Delta Lake Advanced | ✅ Completed |
| Day 06 | Medallion Architecture | ✅ Completed |
| Day 07 | Workflows & Job Orchestration | ✅ Completed |
| Day 08 | Unity Catalog Governance | ✅ Completed |
| Day 09 | SQL Analytics & Dashboards | ✅ Completed |
| Day 10 | Performance Optimization | ✅ Completed |
| Day 11 | Statistical Analysis & ML Prep | ✅ Completed |
| Day 12 | MLflow Basics | ✅ Completed |
| Day 13 | Model Comparison & Feature Engineering | ✅ Completed |
| Day 14 | AI-Powered Analytics: Genie & Mosaic AI | ✅ Completed |
-
Daily Workflow:
- Navigate to the day's folder (e.g.,
Day-01) - Read the challenge description in the README.md
- Write/Create solution file
- Document approach and learnings in the README.md
- Navigate to the day's folder (e.g.,
-
Adding Solutions:
- Each day's folder contains a README.md template
- Add DataBricks solution files
- Update the main README table with completion status
-
Tracking Progress:
- Update the status in the table above as you complete each challenge
- Use emojis: ✅ Completed | 🚧 In Progress
- DataBricks Free Edition ( Delta Lake , Pyspark , SparkSQL , MLflow , Genie , Mosaic AI )
- DataBricks Fundamentals
This is a personal learning repository for the Databricks-14-Days-Challenge. Feel free to star my repo or fork it and use the structure for your own learning journey!
- Databricks Documentation
- PySpark API
- Delta Lake
- MLflow
- Kaggle Dataset
- Databricks 14 day challenge notion link
@databricks @codebasics @indiandataclub #DatabricksWithIDC
