2023 2024 Student Forum > Management Forum > Entrance Exams

 
  #2  
21st December 2015, 11:30 AM
Super Moderator
 
Join Date: Apr 2013
Re: Hadoop Developer Training

Hadoop Developer Training Course is a comprehensive study of Big Data using Hadoop offered by the Intellipaat

The course topics include

Introduction to Hadoop and its Ecosystem,
MapReduce and HDFS and MapReduce Abstraction.
Introduction,
Installation and Implementation of advanced platforms like Hive, Pig, Flume, Sqoop, Oozie and Yarn.

This course also trains on Hbase Architecture, Advance MapReduce Jobs and Hadoop Cluster Management.

Pre–Requisites:
Some prior experience in Core Java and good analytical skills

Basic knowledge of Unix, sql scripting

Prior knowledge of Apache Hadoop is not required

Course content
Module 1 – Introduction to Hadoop and its Ecosystem, Map Reduce and HDFS
Big Data, Factors constituting Big Data
Hadoop and Hadoop Ecosystem
Map Reduce -Concepts of Map, Reduce, Ordering, Concurrency, Shuffle, Reducing, Concurrency
Hadoop Distributed File System (HDFS) Concepts and its Importance
Deep Dive in Map Reduce – Execution Framework, Partitioner, Combiner, Data Types, Key pairs
HDFS Deep Dive – Architecture, Data Replication, Name Node, Data Node, Data Flow
Parallel Copying with DISTCP, Hadoop Archives

Assignment – 1

Module 2 – Hands on Exercises
Installing Hadoop in Pseudo Distributed Mode, Understanding Important configuration files, their Properties and Demon Threads
Accessing HDFS from Command Line
Map Reduce – Basic Exercises
Understanding Hadoop Eco-system
Introduction to Sqoop, use cases and Installation
Introduction to Hive, use cases and Installation
Introduction to Pig, use cases and Installation
Introduction to Oozie, use cases and Installation
Introduction to Flume, use cases and Installation
Introduction to Yarn

Assignment -2 and 3

Mini Project – Importing Mysql Data using Sqoop and Querying it using Hive

Module 3 – Deep Dive in Map Reduce and Yarn
How to develop Map Reduce Application, writing unit test
Best Practices for developing and writing, Debugging Map Reduce applications
Joining Data sets in Map Reduce
Hadoop API’s
Introduction to Hadoop Yarn
Difference between Hadoop 1.0 and 2.0

Module 3.1
Project 1- Hands on exercise – end to end PoC using Yarn or Hadoop 2.
Real World Transactions handling of Bank
Moving data using Sqoop to HDFS
Incremental update of data to HDFS
Running Map Reduce Program
Running Hive queries for data analytics
Project 2- Hands on exercise – end to end PoC using Yarn or Hadoop 2.0

Running Map Reduce Code for Movie Rating and finding their fans and average rating

Assignment -4 and 5

Module 4 – Deep Dive in Pig
1. Introduction to Pig
What Is Pig?
Pig’s Features
Pig Use Cases
Interacting with Pig

2. Basic Data Analysis with Pig
Pig Latin Syntax
Loading Data
Simple Data Types
Field Definitions
Data Output
Viewing the Schema
Filtering and Sorting Data
Commonly-Used Functions
Hands-On Exercise: Using Pig for ETL Processing

3. Processing Complex Data with Pig
Complex/Nested Data Types
Grouping
Iterating Grouped Data
Hands-On Exercise: Analyzing Data with Pig

Assignment – 6

Module 5 – Deep Dive in Hive
1. Introduction to Hive
What Is Hive?
Hive Schema and Data Storage
Comparing Hive to Traditional Databases
Hive vs. Pig
Hive Use Cases
Interacting with Hive

2. Relational Data Analysis with Hive
Hive Databases and Tables
Basic HiveQL Syntax
Data Types
Joining Data Sets
Common Built-in Functions
Hands-On Exercise: Running Hive Queries on the Shell, Scripts, and Hue

3. Hive Data Management
Hive Data Formats
Creating Databases and Hive-Managed Tables
Loading Data into Hive
Altering Databases and Tables
Self-Managed Tables
Simplifying Queries with Views
Storing Query Results
Controlling Access to Data
Hands-On Exercise: Data Management with Hive

4. Hive Optimization
Understanding Query Performance
Partitioning
Bucketing
Indexing Data

Assignment – 7

Module 6 – Introduction to Hbase architecture
What is Hbase
Where does it fits
What is NOSQL

Assignment -8

Module 7 – Hadoop Cluster Setup and Running Map Reduce Jobs
Hadoop Multi Node Cluster Setup using Amazon ec2 – Creating 4 node cluster setup
Running Map Reduce Jobs on Cluster

Assignment – 9, 10

Module 8 – Advance Mapreduce
Delving Deeper Into The Hadoop API
More Advanced Map Reduce Programming, Joining Data Sets in Map Reduce
Graph Manipulation in Hadoop

Assignment – 11, 12

Module 9 – Job and certification support
Major Project, Hadoop Development, cloudera Certification Tips and Guidance and Mock Interview Preparation, Practical Development Tips and Techniques, certification preparation

Contact Details:
Bangalore
1st Floor, 10th Cross, 28th Main, HSR Layout, Bangalore - 102, Karnataka , India
+91 9784286179

Rajasthan
A16 A,Van Vihar Colony, Tonk Road, Jaipur-302015 (Rajasthan) India.
Registered office: 94 A, Vasundhara Colony, Tonk Road, Jaipur-302018
( Rajasthan) India.

UK
Flat 16 Bluepoint Court, 203 Station Road, Harrow, Middlesex HA1 2TS, UK.

USA
1219 E. Hillsdale Blvd. Suite 205, Foster City, CA 94404


Quick Reply
Your Username: Click here to log in

Message:
Options




All times are GMT +5. The time now is 05:56 AM.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.
SEO by vBSEO 3.6.0 PL2

1 2 3 4