Udemy

Pyspark Foundation for Data Engineering | Beginners

立即報名
  • 810 名學生
  • 更新於 1/2025
4.3
(70 個評分)
CTgoodjobs 嚴選優質課程,為職場人士提升競爭力。透過本站連結購買Udemy課程,本站將獲得推廣佣金,有助未來提供更多實用進修課程資訊給讀者。

課程資料

報名日期
全年招生
課程級別
學習模式
修業期
1 小時 13 分鐘
教學語言
英語
授課導師
Akash Sunil Pawar
評分
4.3
(70 個評分)
2次瀏覽

課程簡介

Pyspark Foundation for Data Engineering | Beginners

Data Engineering, PySpark, Coding exercise

This course will prepare you for a real world Data Engineer role (basics)!

Learn to code PySpark like a real world developer. Here our major focus will be on Practical applications of PySpark and bridge the gap between academic knowledge and practical skill.

In this course we will get to know and apply few of the most essential and basic functions in PySpark, that are used frequently in scripting for any project based on PySpark.


About PySpark:

Learn the latest Big Data Technology - Spark! And learn to use it with one of the most popular programming languages, Python!

One of the most valuable technology skills is the ability to analyze huge data sets, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Spark! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Spark to solve their big data problems!

Spark can perform up to 100x faster than Hadoop MapReduce, which has caused an explosion in demand for this skill! Because the Spark 2.0 DataFrame framework is so new, you now have the ability to quickly become one of the most knowledgeable people in the job market!


What you will learn :

  • SparkSession and imports

  • Spark DataFrame and its characteristics

  • Syntax and example

  • Print results

  • Understanding the data

  • Number of records

  • Columns in dataFrame

  • Describe a DataFrame

  • Schema of a DataFrame

  • Create a new column

  • Arithmetic operations on Data

  • Change column data type

  • Create a column with integer as constant

  • Apply what we know

  • Rounding of digits

  • Sorting operation

  • Drop columns

  • Rename columns

  • Create a column with string as constant

  • Conditional Statements

  • Changing case of a column

  • Filter operations

  • Grouping and aggregations


Prerequisites :

  • Some basic programming skills (Not Mandatory)

  • Will to implement theoretical knowledge in practical.


Who this course is for:

  • Beginners who want to learn Big Data or experienced people who want to transition to a Big Data role

  • Big data beginners who want to learn how to code in the real world

  • Aspiring candidates for data engineering role

課程章節

  • 1 個章節
  • 24 堂課
  • 第 1 章 Lets Begin!

課程內容

  • Fundamentals of PySpark
  • Hands on experience in PySpark
  • Understanding of data using PySpark
  • Performing various operations on DataFrame

評價

  • J
    Juhi Singh
    2.5

    knowledge is ok but where to practice is not told

  • D
    David
    1.0

    Too basic, I could have found a better tutorial on YouTube, I thought that because it was paid it would have better content but it wasn't like that

  • L
    Laxmi Bogam
    4.0

    Very helpful for me.

  • L
    Lili Duan
    5.0

    It is benefit to beginners very much. These are about the basic concept and functions. Hope there will be something connected with the project.

立即關注瀏覽更多

本網站使用Cookies來改善您的瀏覽體驗,請確定您同意及接受我們的私隱政策使用條款才繼續瀏覽。

我已閱讀及同意