Udemy

Java Parallel Computation on Hadoop

Enroll Now
  • 14,940 Students
  • Updated 8/2014
4.3
(115 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
2 Hour(s) 46 Minute(s)
Language
English
Rating
4.3
(115 Ratings)

Course Overview

Java Parallel Computation on Hadoop

Learn to write real, working data-driven Java programs that can run in parallel on multiple machines by using Hadoop.

Build your essential knowledge with this hands-on, introductory course on the Java parallel computation using the popular Hadoop framework:


- Getting Started with Hadoop


- HDFS working mechanism


- MapReduce working mecahnism


- An anatomy of the Hadoop cluster


- Hadoop VM in pseudo-distributed mode


- Hadoop VM in distributed mode


- Elaborated examples in using MapReduce


Learn the Widely-Used Hadoop Framework


Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. Hadoop is an Apache top-level project being built and used by a global community of contributors and users. It is licensed under the Apache License 2.0.


All the modules in Hadoop are designed with a fundamental assumption that hardware failures (of individual machines, or racks of machines) are common and thus should be automatically handled in software by the framework. Apache Hadoop's MapReduce and HDFS components originally derived respectively from Google's MapReduce and Google File System (GFS) papers.


Who are using Hadoop for data-driven applications?


You will be surprised to know that many companies have adopted to use Hadoop already. Companies like Alibaba, Ebay, Facebook, LinkedIn, Yahoo! is using this proven technology to harvest its data, discover insights and empower their different applications!


Contents and Overview


As a software developer, you might have encountered the situation that your program takes too much time to run against large amount of data. If you are looking for a way to scale out your data processing, this is the course designed for you. This course is designed to build your knowledge and use of Hadoop framework through modules covering the following:


- Background about parallel computation


- Limitations of parallel computation before Hadoop


- Problems solved by Hadoop


- Core projects under Hadoop - HDFS and MapReduce


- How HDFS works


- How MapReduce works


- How a cluster works


- How to leverage the VM for Hadoop learning and testing


- How the starter program works


- How the data sorting works


- How the pattern searching


- How the word co-occurrence


- How the inverted index works


- How the data aggregation works


- All the examples are blended with full source code and elaborations


Come and join us! With this structured course, you can learn this prevalent technology in handling Big Data.

Course Content

  • 12 section(s)
  • 43 lecture(s)
  • Section 1 Overview
  • Section 2 Background knowledge about Hadoop
  • Section 3 The Hadoop Ecosystem
  • Section 4 Get Ready in pseudo-distributed mode
  • Section 5 Get Ready in distributed mode
  • Section 6 Large-scale Word Counting
  • Section 7 Large-scale Data Sorting
  • Section 8 Large-scale Pattern Searching
  • Section 9 Large-scale Item Co-occurrence
  • Section 10 Large-scale Inverted Index
  • Section 11 Large-scale Data Aggregation
  • Section 12 Data Preparation

What You’ll Learn

  • Know the essential concepts about Hadoop, Know how to setup a Hadoop cluster in pseudo-distributed mode, Know how to setup a Hadoop cluster in distributed mode (3 physical nodes), Know how to develop Java programs to parallelize computations on Hadoop

Reviews

  • W
    Wojciech Domalewski
    3.0

    Subject is very interesting but presentation is far from perfect.

  • S
    Shubham Jagannath Patil
    4.5

    Audio intensity is low.

  • S
    Subhankar sarkar
    3.0

    Yess this course is very helpful for my upcoming career

  • V
    Vivek Kalewar
    5.0

    IT GREAT TO LEARN WITH UDEMY THEY EXPLAIN IN EASY WAY

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed