Course Information
Course Overview
Databricks Data Engineer Associate Course with Practical Examples & Hands-On Training, Master Databricks Skills
Welcome to our comprehensive course on Databricks Certified Data Engineer Associate certification. This course is designed to help you master the skills required to become a certified Databricks data engineer associate.
Databricks is a cloud-based data analytics platform that offers a unified approach to data processing, machine learning, and analytics. With the growing demand for data engineers, Databricks has become one of the most sought-after skills in the industry.
In this course, you'll learn the core concepts of Databricks, including Databricks Lakehouse Platform, ELT with Spark SQL and Python, Incremental Data Processing, Production Pipelines, and Data Governance.
This course is designed by industry experts with years of experience in Databricks and data engineering. This course has theoretical concepts and hands-on labs to help you apply the concepts learned in the course.
Upon completion of the course, you'll be able to take the Databricks Certified Data Engineer Associate exam with confidence and succeed in your career as a data engineer.
At the end of this course you should be able to:
Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:
Data Lakehouse (architecture, descriptions, benefits)
Data Science and Engineering workspace (clusters, notebooks, data storage)
Delta Lake (general concepts, table management, manipulation, optimizations)
Build ETL pipelines using Apache Spark SQL and Python, including:
Relational entities (databases, tables, views)
ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)
Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)
Incrementally process data, including:
Structured Streaming (general concepts, triggers, watermarks)
Auto Loader (streaming reads)
Multi-hop Architecture (bronze-silver-gold, streaming applications)
Delta Live Tables (benefits and features)
Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:
Jobs (scheduling, task orchestration, UI)
Dashboards (endpoints, scheduling, alerting, refreshing)
Understand and follow best security practices, including:
Unity Catalog (benefits and features)
Entity Permissions (team-based permissions, user-based permissions)
Enroll now and take the first step towards becoming a certified Databricks data engineer associate.
Course Content
- 10 section(s)
- 85 lecture(s)
- Section 1 Introduction
- Section 2 Getting started with Databricks
- Section 3 Databricks Clusters
- Section 4 Databricks Notebooks
- Section 5 Databricks Lakehouse Platform
- Section 6 ELT with Spark SQL and Python
- Section 7 Accessing Data from Azure Data Lake Storage (ADLS)
- Section 8 Structured Streaming and Auto Loader
- Section 9 Delta Live Tables
- Section 10 Jobs in Databricks
What You’ll Learn
- Databricks Clusters, Notebooks, data storage
- Databricks Lakehouse Platform (architecture, descriptions, benefits)
- Delta Lake
- ELT with Spark SQL and Python
- Relational entities (databases, tables, views)
- Accessing Data from Azure Data Lake Storage (ADLS)
- Structured Streaming, Auto Loader
- Delta Live Tables, Multi-hop Architecture
- Databricks Jobs
- Databricks Dashboards
- Data Governance
Skills covered in this course
Reviews
-
CCARLOS DA SILVA AMARAL
The content is well-structured, clear, and practical, making complex concepts easy to understand. The quizzes and hands-on exercises are extremely helpful for reinforcing knowledge and preparing for the certification exam.
-
CChristian Ivan Jacho Castillo
Exclente. Me pareció interesante la parte del roleplay, sin embargo me gustaría ver mi progreso según voy contestando los temas, para saber si debo profundizar o continuar.
-
TTuncay YAYLALI
İyi dizayn edilmiş ve akıcı bir eğitim
-
GGert Schrijvers
Good course, good examples. The course is a bit outdated, but Databricks is also evolving rapidly. This course has a better structure than the one of Databricks Academy.