Skip to content

lk-learner/Databricks-14-Days-Challenge-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

194 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Databricks-14-Days-Challenge-1

Databricks 14-Days AI Challenge-1 is designed to help beginners build a strong foundation in Databricks through daily learning, hands-on practice, and real-world problem solving.



𝐖𝐡𝐚𝐭 𝐢𝐬 𝐭𝐡𝐞 𝐃𝐚𝐭𝐚𝐛𝐫𝐢𝐜𝐤𝐬 𝐂𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐞?

It’s a hands-on learning challenge where you don’t just read concepts, you apply them. You work on a managed cloud platform using:

⚡ SQL

⚡Python

⚡Spark

There’s also a free Databricks Community Edition, making it easy for anyone to start exploring real datasets.

🔗 Reference: Databricks Community Edition https://lnkd.in/gE8pgVRJ

𝐖𝐡𝐲 𝐢𝐬 𝐭𝐡𝐢𝐬 𝐢𝐦𝐩𝐨𝐫𝐭𝐚𝐧𝐭 (𝐞𝐬𝐩𝐞𝐜𝐢𝐚𝐥𝐥𝐲 𝐟𝐨𝐫 𝐚𝐧𝐚𝐥𝐲𝐬𝐭𝐬)?

Most analysts already know SQL and Python — but this challenge helps you:

🔸 Query and analyze data directly in Databricks using SQL

🔸Build dashboards straight from the SQL editor

🔸Understand how data moves through pipelines

🔸Get exposure to Jobs & Pipelines, which are core data engineering concepts

With AI becoming part of almost every data role, understanding data engineering fundamentals significantly boosts an analyst’s career potential.

𝐖𝐡𝐚𝐭 𝐭𝐡𝐢𝐬 𝐜𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐞 𝐭𝐚𝐮𝐠𝐡𝐭 𝐦𝐞?

Progress doesn’t happen in a single day, it needs hardwork, consistent practice and passion for learning. You don’t truly remember concepts by just reading or watching them You remember them when you build, break, fix, and rebuild

🧠 What actually happened in these 14 days:

  • Spark stopped looking scary
  • Delta Lake started making sense
  • Bronze, Silver, Gold stopped sounding like jewellery 💍
  • SQL dashboards became my comfort zone
  • MLflow taught me why tracking experiments matters
  • AI (Genie & Mosaic AI) showed up and said: “Why type SQL when you can just ask questions?”

📅 Daily Challenges

Day Challenge Status
Day 01 Platform Setup & First Steps ✅ Completed
Day 02 Apache Spark Fundamentals ✅ Completed
Day 03 PySpark Transformations Deep Dive ✅ Completed
Day 04 Delta Lake Introduction ✅ Completed
Day 05 Delta Lake Advanced ✅ Completed
Day 06 Medallion Architecture ✅ Completed
Day 07 Workflows & Job Orchestration ✅ Completed
Day 08 Unity Catalog Governance ✅ Completed
Day 09 SQL Analytics & Dashboards ✅ Completed
Day 10 Performance Optimization ✅ Completed
Day 11 Statistical Analysis & ML Prep ✅ Completed
Day 12 MLflow Basics ✅ Completed
Day 13 Model Comparison & Feature Engineering ✅ Completed
Day 14 AI-Powered Analytics: Genie & Mosaic AI ✅ Completed

🎯 How to Use This Repository

  1. Daily Workflow:

    • Navigate to the day's folder (e.g., Day-01)
    • Read the challenge description in the README.md
    • Write/Create solution file
    • Document approach and learnings in the README.md
  2. Adding Solutions:

    • Each day's folder contains a README.md template
    • Add DataBricks solution files
    • Update the main README table with completion status
  3. Tracking Progress:

    • Update the status in the table above as you complete each challenge
    • Use emojis: ✅ Completed | 🚧 In Progress

🛠️ Technologies

  • DataBricks Free Edition ( Delta Lake , Pyspark , SparkSQL , MLflow , Genie , Mosaic AI )

📖 Learning Goals

  • DataBricks Fundamentals

🤝 Contributing

This is a personal learning repository for the Databricks-14-Days-Challenge. Feel free to star my repo or fork it and use the structure for your own learning journey!


📚 Additional Resources


@databricks @codebasics @indiandataclub #DatabricksWithIDC

About

Indian Data Club : Databricks 14-Days Challenge-1 is designed to help beginners build a strong foundation in Databricks through daily learning, hands-on practice, and problem solving.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors