Course Information
Course Overview
Learn all the fundamentals of PySpark
Spark is one of the most in-demand Big Data processing frameworks right now.
This course will take you through the core concepts of PySpark. We will work to enable you to do most of the things you’d do in SQL or Python Pandas library, that is:
Getting hold of data
Handling missing data and cleaning data up
Aggregating your data
Filtering it
Pivoting it
And Writing it back
All of these things will enable you to leverage Spark on large datasets and start getting value from your data.
Let’s get started.
Course Content
- 5 section(s)
- 20 lecture(s)
- Section 1 Introduction
- Section 2 A Scenario To Get Us Started
- Section 3 Core Concepts
- Section 4 Challenge
- Section 5 Conclusion
What You’ll Learn
- PySpark, Apache Spark, Big Data Analytics, Big Data Processing, Python
Skills covered in this course
Reviews
-
SSeeni Kannan Manikka
The expected topics were covered with clear explanation. thanks
-
AAtmakuru Madhav Tarun
There was no explanation of spark and it's components.
-
NNAGESH Vaasa
Good for basics. good if you can add different types of JSON reading and explode function. also, more coverage on calling sql end along with reading real time streaming data.
-
AAvish Kadam
Training is very good with more practical examples