Udemy

Data Engineering with Google Dataflow and Apache Beam on GCP

Enroll Now
  • 4,848 Students
  • Updated 5/2023
4.4
(1,005 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
2 Hour(s) 20 Minute(s)
Language
English
Taught by
Cassio Alessandro de Bolba
Rating
4.4
(1,005 Ratings)

Course Overview

Data Engineering with Google Dataflow and Apache Beam on GCP

First steps to Extract, Transform and Load data using Apache Beam and Deploy Pipelines on Google Dataflow

This course wants to introduce you to the Apache Foundation's newest data pipeline development framework: The Apache Beam, and how this feature is becoming popular in partnership with Google Dataflow. In a summary, we want to cover the following topics:


1. Understand your inner workings

2. What are your benefits

3. Explain how to use on your local machine without installation via Google Colab for development

4. Its main functions

5. Configure Apache Beam python SDK locallyvice

6. How to deploy this resource on Google Dataflow to a Batch pipeline


This course is dynamic, you will be receiving updates whenever possible.

It is important to remember that this course does not teach Python, but uses it. So, get comfortable with knowing Python basics, defining a function, creating objects and data types.

Also, if you are interested in learning section 4, which consists of deploying a pipeline on Google Dataflow, you will need to have a free counter in GCP. It's a simple process, but it requires a credit card!


I kindly ask you you to consider all the efforts to put this course together and give a nice rate at the end of the course, even tough the course is simple, it was made with all good intent to share knowledge for cheap price. Thanks and hope you enjoy!

___________________________________________________________________________________________________________


Requirements:

· Basic knowledge of Python

· Have Python 3.7 or greater installed locally (from section 4)

· Free account at GCP (from section 4)


Schedule:

· Section 2 – Concepts

· Section 3 – Main Functions

· Section 4 – Apache Beam on Google Dataflow

Course Content

  • 3 section(s)
  • 21 lecture(s)
  • Section 1 Apache Beam Concepts
  • Section 2 Apache Beam Main Functions
  • Section 3 Apache Beam + GCP = Dataflow

What You’ll Learn

  • Apache Beam
  • ETL
  • Python
  • Google Cloud
  • DataFlow
  • Google Cloud Storage
  • Big Query


Reviews

  • A
    Alejandro Retuert
    3.0

    Ideal para comenzar. Mi sugerencia es que no consuma tanto tiempo en repetir pasos previos y se centre en más opciones disponibles en GCP (uso de Pub/Sub, entre muchos otros servicios), en ver casos en que las cosas van mal y como visualizar los errores en logging, etc.

  • J
    Juan Pablo Vallejo Figueroa
    3.5

    Introductory, but not production level, lack of structure for real projects

  • P
    Prasad Kalekar
    5.0

    clear voice would have been great

  • P
    Pradeep Reddy C N
    4.0

    good course

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed