Course Information
Course Overview
Master Databricks from Scratch: Architecture, Core Components, SQL, Python, Delta Lake, and hands-on Machine Learning
Welcome to Introduction to Databricks, your practical, beginner-friendly guide to one of the most in-demand data platforms in the industry today.
My name is Dimitar Shtev, and I’ve spent nearly a decade working with data across analytics, engineering, and machine learning. In this course, I will walk you step-by-step through everything you need to confidently start using Databricks—even if you’ve never worked with it before.
We begin by exploring what Databricks is, why organizations rely on it, and how its major components fit together. You’ll learn about the Data Lakehouse, Delta Lake, SQL, Databricks Runtime, and the architecture that powers scalable data processing.
From there, we move into hands-on learning using Databricks’ free-for-life community edition. You will run Python scripts, configure compute resources, organize your workspace, load CSV files, and build interactive visualizations and dashboards.
Finally, we put everything into practice with a real machine learning project:
You will collect real Bitcoin price data, prepare it for modeling, convert SQL tables into PySpark objects, train a regression model, and display future price predictions inside Databricks.
Whether you’re a data analyst, data engineer, BI professional, developer, or aspiring data scientist, Databricks is becoming an essential tool—and this course gives you the foundations AND the confidence to use it in real-world scenarios.
By the end, you’ll have a functional Databricks environment, hands-on experience, and practical workflows you can apply at work the very next day.
Course Content
- 5 section(s)
- 28 lecture(s)
- Section 1 Welcome to Introduction to Databricks
- Section 2 Databricks Core Components
- Section 3 Databricks Architecture
- Section 4 Databricks in Action
- Section 5 Machine Learning - Regression Analysis
What You’ll Learn
- Understand what Databricks is and how organizations use it
- Explain core components such as workspaces, clusters, notebooks, jobs, and repos
- Describe Databricks architecture, Delta Lake, Medallion architecture, and the Lakehouse concept
- Navigate the Databricks workspace and use the free Databricks Community Edition
- Run Python scripts inside Databricks and configure compute resources
- Load, clean, transform, and visualize data using Databricks tools
- Build dashboards and create SQL tables with Python
- Convert DataFrames between SQL and PySpark
- Apply basic data science concepts in Databricks
Skills covered in this course
Reviews
-
JJavier Ivan Ramirez Borquez
Buen vistazo básico de la herramienta.