Course Information
Course Overview
Provide higher-level language to facilitate large-data processing.
This course is part of “Big data Internship Program” which is aligned to a typical Big data project life cycle stage.
- Foundation
- Ingestion
- Storage
- Processing
- Visualization
This course is focused on Data Processing in Big data.This course is suitable for developers, data analysts and business analysts. Experience with SQL and scripting languages is recommended, but is not required.
You will learn
- Understanding of Hive core concept and architecture.
- How to create and manipulate tables using Hive.
- Advanced features of Hive.
- Hive Best Practices
- Performing real-time, complex queries on datasets
- Pig’s Architecture
- Reading and Writing Data with Pig
- Pig Best Practices
Project work -
- Provide Data in Hive and manipulate the data for Our Book Recommendation project.
- One Ad-on project -- Data Masking with hive and sqoop
Course Content
- 5 section(s)
- 34 lecture(s)
- Section 1 Data Processing Introduction in Big Data
- Section 2 HIve
- Section 3 Pig
- Section 4 Data Processing in Recommendation Project
- Section 5 Ad-on Project Data Masking
What You’ll Learn
- Have excellent understanding of Apache Hive and Pig tool with hands-on experience ., Understand the working of a project in real-world scenario., Work experience in end-to-end Project ( Data Masking) and can mention in Resume .
Skills covered in this course
Reviews
-
PPattabi Srikanth
It is very use full to students
-
SShaik Mustapha
Very much informative and crystal clear, thanks to instructor
-
EEmmanuel Adzotor
Clear and good presentation
-
SSridharan Sundrarajan
It is very good content to understand high level concepts and usage. However, there is no PIG specific lab section with sample jobs yet. May be i have to go through remaining half and comment further.