Hands-On Labs
Throughout the course, hands-on labs help students build their knowledge and apply the concepts being discussed. By the end of the course, participants will be able to import and analyze their own data in Apache Hadoop.
Labs include:
Importing flat-file data into HDFS
Running MapReduce jobs
Writing MapReduce code in Java, or using the Hadoop Streaming API
Importing data into HDFS from relational database management systems
Implementing an inverted index in Hadoop
Manipulating data with Hive and Pig
Creating pipelines of MapReduce jobs with Oozie
This four-day training course from Cloudera is for developers who want to learn to use Apache Hadoop to build powerful data processing applications.
You will learn:
How MapReduce and the Hadoop Distributed File System work
How to write MapReduce code in Java or other programming languages
What issues to consider when developing MapReduce jobs
How to implement common algorithms in Hadoop
Best practices for Hadoop development and debugging
How to leverage other project such as Apache Hive, Apache Pig, Sqoop and Oozie
Advanced Hadoop API topics required for real-world data analysis
Certification Exam
Following the training, attendees will have an opportunity to take become a Cloudera Certified Developer for Apache Hadoop (CCDH).
Course Pre-Requisites
This course is designed for developers with some programming experience (preferably Java). Existing knowledge of Hadoop is not required.
Course Contents
The course covers the following topics:
The Motivation For Hadoop
Problems with traditional large-scale systems
Requirements for a new approach
Hadoop: Basic Concepts
An Overview of Hadoop
The Hadoop Distributed File System
Hands-On Exercise
How MapReduce Works
Hands-On Exercise
Anatomy of a Hadoop Cluster
Other Hadoop Ecosystem Components
Writing a MapReduce Program
The MapReduce Flow
Examining a Sample MapReduce Program
Basic MapReduce API Concepts
The Driver Code
The Mapper
The Reducer
Hadoop's Streaming API
Using Eclipse for Rapid Development
Hands-On Exercise
Integrating Hadoop Into The Workflow
Relational Database Management Systems
Storage Systems
Importing Data from RDBMSs With Sqoop
Hands-On Exercise
Importing Real-Time Data with Flume
Accessing HDFS Using FuseDFS and Hoop
Delving Deeper Into The Hadoop API
Using Combiners
Using LocalJobRunner Mode for Faster Development
Reducing Intermediate Data with Combiners
The configure and close methods for MapReduce Setup and Teardown
Writing Partitioners for Better Load Balancing
Directly Accessing HDFS
Using The Distributed Cache
Hands-On Exercise
Common MapReduce Algorithms
Sorting and Searching
Indexing
Machine Learning with Mahout
Term Frequency - Inverse Document Frequency
Word Co-Occurrence
Hands-On Exercise
Using Hive and Pig
Hive Basics
Pig Basics
Hands-On Exercise
Practical Development Tips and Techniques
Testing with MRUnit
Debugging MapReduce Code
Using LocalJobRunner Mode for Easier Debugging
Retrieving Job Information with Counters
Logging
Splittable File Formats
Determining the Optimal Number of Reducers
Map-Only MapReduce Jobs
Implementing Multiple Mappers using ChainMapper
Hands-On Exercise
More Advanced MapReduce Programming
Custom Writables and WritableComparables
Saving Binary Data using SequenceFiles and Avro Files
Creating InputFormats and OutputFormats
Hands-On Exercise
Joining Data Sets in MapReduce Jobs
Map-Side Joins
The Secondary Sort
Reduce-Side Joins
Hands-On Exercise
Graph Manipulation in Hadoop
Introduction to graph techniques
Representing Graphs in Hadoop
Implementing a sample algorithm: Single Source Shortest Path
Creating Workflows with Oozie
The Motivation for Oozie
Oozie's Workflow Definition Format
Hands-On Exercise
Cloudera Certified Developer Exam
Feed Readers (RSS/XML)
SUBSCRIBE
Loading...
Is this your event?
Claim it
Cloudera Developer Training for Apache Hadoop - SF bay area - Feb 6-9
Friday, Feb 10 9:00a
at
Seaport Conference Center,
Redwood City,
CA
Age Suitability:
None Specified
Tags:
conferences, seminars, training, pig, cloud, hadoop, apache, hive, data warehousing, big data, mapreduce, hdfs, cloudera, nosql, hbase, training-developer, mapr
Category:
Other
Creator: eventbrite
Creator: eventbrite
Location & Nearby Info
Show nearby:
Don't Miss This
Sponsored Listings
Hot Tickets
More »
ON SALE NOW
-
Fri 6/8 8:00p
-
Sun 6/3 7:30p
-
Fri 6/1 12:00p
-
Tue 6/12 8:00p
-
Tue 7/10 8:00p
-
Sat 6/2 8:00p
Other Events
| 6/7 | 9:00a | Hadoop Training with MapReduce |
| 7/16 | 9:00a | Hadoop Administration - July 16, 2012 |
| 7/19 | 9:00a | Hadoop Overview for Managers - July 19, 2012 |
| 8/23 | 9:00a | Hadoop Training with MapReduce - August 20, 2012 |
| 9/6 | 9:00a | Hadoop Administration |
| 9/20 | 9:00a | Hadoop Overview for Managers - September 20, 2012 |
| 10/13 | 9:00a | Hadoop Training with MapReduce - October 10, 2012 |
| 11/12 | 9:00a | Hadoop Administration - November 12, 2012 |
| 11/22 | 9:00a | Hadoop Overview for Managers - November 22, 2012 |
| 12/20 | 9:00a | Hadoop Training with MapReduce - December 17, 2012 |
add to our listings








