Udemy

From 0 to 1: Hive for Processing Big Data

Enroll Now
  • 8,521 Students
  • Updated 1/2018
4.4
(1,038 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
15 Hour(s) 15 Minute(s)
Language
English
Taught by
Loony Corn
Rating
4.4
(1,038 Ratings)
8 views

Course Overview

From 0 to 1: Hive for Processing Big Data

End-to-End Hive : HQL, Partitioning, Bucketing, UDFs, Windowing, Optimization, Map Joins, Indexes

Prerequisites: Hive requires knowledge of SQL. The course includes and SQL primer at the end. Please do that first if you don't know SQL. You'll need to know Java if you want to follow the sections on custom functions.


Taught by a 4 person team including 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data.


Hive is like a new friend with an old face (SQL). This course is an end-to-end, practical guide to using Hive for Big Data processing.


Let's parse that


A new friend with an old face: Hive helps you leverage the power of Distributed computing and Hadoop for Analytical processing. It's interface is like an old friend : the very SQL like HiveQL. This course will fill in all the gaps between SQL and what you need to use Hive.


End-to-End: The course is an end-to-end guide for using Hive: whether you are analyst who wants to process data or an Engineer who needs to build custom functionality or optimize performance - everything you'll need is right here. New to SQL? No need to look elsewhere. The course has a primer on all the basic SQL constructs, .


Practical: Everything is taught using real-life examples, working queries and code .


What's Covered:


Analytical Processing: Joins, Subqueries, Views, Table Generating Functions, Explode, Lateral View, Windowing and more


Tuning Hive for better functionality: Partitioning, Bucketing, Join Optimizations, Map Side Joins, Indexes, Writing custom User Defined functions in Java. UDF, UDAF, GenericUDF, GenericUDTF, Custom functions in Python, Implementation of MapReduce for Select, Group by and Join


For SQL Newbies: SQL In Great Depth

Course Content

  • 19 section(s)
  • 87 lecture(s)
  • Section 1 You, Us & This Course
  • Section 2 Introducing Hive
  • Section 3 Hadoop and Hive Install
  • Section 4 Hadoop and HDFS Overview
  • Section 5 Hive Basics
  • Section 6 Built-in Functions
  • Section 7 Sub-Queries
  • Section 8 Partitioning
  • Section 9 Bucketing
  • Section 10 Windowing
  • Section 11 Understanding MapReduce
  • Section 12 MapReduce logic for queries: Behind the scenes
  • Section 13 Join Optimizations in Hive
  • Section 14 Custom Functions in Python
  • Section 15 Custom functions in Java
  • Section 16 SQL Primer - Select Statemets
  • Section 17 SQL Primer - Group By, Order By and Having
  • Section 18 SQL Primer - Joins
  • Section 19 Appendix

What You’ll Learn

  • Write complex analytical queries on data in Hive and uncover insights, Leverage ideas of partitioning, bucketing to optimize queries in Hive, Customize hive with user defined functions in Java and Python , Understand what goes on under the hood of Hive with HDFS and MapReduce


Reviews

  • K
    Kittipong Sirisatjanulak
    4.5

    good

  • K
    Kylie Ma
    4.5

    1

  • S
    Subramaniam Panakudi Lakshmanan
    5.0

    Very detailed course.

  • A
    Alex Epperly
    5.0

    Although this course does not have the most viewers of any Hive class on Udemy, it is by far the best and most thorough one I have found.

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed