Udemy

Databricks Fundamentals & Apache Spark Core

Enroll Now
  • 29,584 Students
  • Updated 9/2023
  • Certificate Available
4.4
(2,966 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
12 Hour(s) 8 Minute(s)
Language
English
Taught by
Wadson Guimatsa
Certificate
  • Available
  • *The delivery and distribution of the certificate are subject to the policies and arrangements of the course provider.
Rating
4.4
(2,966 Ratings)

Course Overview

Databricks Fundamentals & Apache Spark Core

Learn how to process big-data using Databricks & Apache Spark 2.4 and 3.0.0 - DataFrame API and Spark SQL

Welcome to this course on Databricks and Apache Spark 2.4 and 3.0.0

Apache Spark is a Big Data Processing Framework that runs at scale.
In this course, we will learn how to write Spark Applications using Scala and SQL.

Databricks is a company founded by the creator of Apache Spark.
Databricks offers a managed and optimized version of Apache Spark that runs in the cloud.

The main focus of this course is to teach you how to use the DataFrame API & SQL to accomplish tasks such as:

  • Write and run Apache Spark code using Databricks

  • Read and Write Data from the Databricks File System - DBFS

  • Explain how Apache Spark runs on a cluster with multiple Nodes

Use the DataFrame API and SQL to perform data manipulation tasks such as

  • Selecting, renaming and manipulating columns

  • Filtering, dropping and aggregating rows

  • Joining DataFrames

  • Create UDFs and use them with DataFrame API or Spark SQL

  • Writing DataFrames to external storage systems

List and explain the element of Apache Spark execution hierarchy such as

  • Jobs

  • Stages

  • Tasks


Course Content

  • 8 section(s)
  • 72 lecture(s)
  • Section 1 Setup
  • Section 2 Introduction to Databricks and Apache Spark
  • Section 3 The DataFrame API: Basics
  • Section 4 The DataFrame API: Transforming Data
  • Section 5 Spark SQL & SQL Fundamentals
  • Section 6 Working with different type of data
  • Section 7 Data Sources
  • Section 8 Become Apache Spark Certified

What You’ll Learn

  • Databricks
  • Apache Spark Architecture
  • Apache Spark DataFrame API
  • Apache Spark SQL
  • Selecting, and manipulating columns of a DataFrame
  • Filtering, dropping, sorting rows of a DataFrame
  • Joining, reading, writing and partitioning DataFrames
  • Aggregating DataFrames rows
  • Working with User Defined Functions
  • Use the DataFrameWriter API

Skills covered in this course


Reviews

  • C
    Chandu Chandu
    4.0

    Not much covered on Databricks. It is mainly about Apache Spark only.

  • G
    Gaurav Dhase
    1.0

    You are teaching on very basic version of databricks which is not relevant in todays world. I am not liking your videos. Please refund me if possible.

  • C
    Chidozie Nkwor
    4.0

    Yes its a good match for me.

  • J
    JULIO CESAR SILVA DA LUZ
    5.0

    This course is Brilliant.

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed