Mail : training@ecorptrainings.com
India : +91-8143-111-555
USA : +1-703-445-4802
Whats app : +91-8143-110-555
Facebook Twitter Google Plus Pinit Stumbleupon Youtube Blog

Workday HCM Demo New Batches Starting from Wednesday... 14-12-2016
Search Course Here




Live Chat
Support


Cloudera Developer Apache Hadoop

overview

Big Data Analytics with R and Hadoop is focused on the techniques of integrating R and Hadoop by various tools such as RHIPE and RHadoop. A powerful data analytics engine can be built, which can process analytics algorithms over a large scale dataset in a scalable manner.
prerequisties
  • should have programming experience
  • knowledge of Java
Duration
Online
  • It is a 16 days program and extends up to 2hrs each.
  • The format is 40% theory, 60% Hands-on.

Corporate
  • It is a 4 days program and extends up to 8hrs each.
  • The format is 40% theory, 60% Hands-on.
Classroom
    Private Classroom arranged on request and minimum attendies for batch is 4.
course content
  • The Motivation For Hadoop
    • Problems with traditional large-scale systems
    • Requirements for a new approach
  • Hadoop Basic Concepts
    • An Overview of Hadoop
    • The Hadoop Distributed File System
    • Hands-On Exercise
    • How MapReduce Works
    • Hands-On Exercise
    • Anatomy of a Hadoop Cluster
    • Other Hadoop Ecosystem Components
  • Writing a MapReduce Program
    • The MapReduce Flow
    • Examining a Sample MapReduce Program
    • Basic MapReduce API Concepts
    • The Driver Code
    • The Mapper
    • The Reducer
    • Hadoop's Streaming API
    • Using Eclipse for Rapid Development
  • Integrating Hadoop Into The Workflow
    • Relational Database Management Systems
    • Storage Systems
    • Creating workflows with Oozie
    • Importing Data from RDBMSs With Sqoop
    • Hands-On Exercise
    • Importing Real-Time Data with Flume
    • Accessing HDFS Using FuseDFS and Hoop
  • Delving Deeper Into The Hadoop API
    • Using Combiners
    • Using LocalJobRunner Mode for Faster Development
    • Reducing Intermediate Data with Combiners
    • The configure and close methods for MapReduce Setup and Teardown
    • Writing Partitioners for Better Load Balancing
    • Directly Accessing HDFS
    • Using The Distributed Cache
  • Using Hive and Pig
    • Hive Basics
    • Pig Basics
  • Common MapReduce Algorithms
    • Sorting and Searching
    • Indexing
    • Machine Learning with Mahout
    • Term Frequency - Inverse Document Frequency
    • Word Co-Occurrence
  • Practical Development Tips and Techniques
    • Testing with MRUnit
    • Debugging MapReduce Code
    • Using LocalJobRunner Mode for Easier Debugging
    • Eclipse development techniques
    • Retrieving Job Information with Counters
    • Logging
    • Splittable File Formats
    • Determining the Optimal Number of Reducers
    • Map-Only MapReduce Jobs
    • Implementing Multiple Mappers using ChainMapper
  • More Advanced MapReduce Programming
    • Custom Writables and WritableComparables
    • Saving Binary Data using SequenceFiles and Avro Files
    • Creating InputFormats and OutputFormats
  • Joining Data Sets in MapReduce Jobs
    • Map-Side Joins
    • The Secondary Sort
    • Reduce-Side Joins
  • Graph Manipulation in Hadoop
    • Introduction to graph techniques
    • Representing Graphs in Hadoop
    • Implementing a sample algorithm: Single Source Shortest Path
  • Creating Workflows with Oozie
    • The Motivation for Oozie
    • Oozie's Workflow Definition Format
Videos
For Videos Click Here Videos

Flash News


AngularJS New Batch Start From 09th DEC & 10th DEC.


Hadoop Dev New Batch Start From 10th DEC & 11th DEC.


IBM COGNOS TM New Batch Start From 11th DEC & 12th DEC.


Informatica Dev New Batch Start From 12th DEC & 13th DEC.


Mean Stack New Batch Start 13th DEC & 14th DEC.

SAP BODS new Batch Starting From 14th DEC & 15th DEC.

SAP S/4 HANA New Batch Start From 15th DEC & 16th DEC
.

Tableau New Batch Start From 16th DEC & 17th DEC

PUBLIC DEMO


(1) Workday Technical Demo Training

Demo Schedule : 09:30A.M EST / 08:30A.M CST / 6:30A.M PST on 13th DEC & 07:00A.M IST on 14th DEC

SOLVE YOUR QUERIES ONLINE
Email :
Rediff Bol :
ecorptrainings@rediffmail.com
Google Talk :
ecorptrainings@gmail.com
MSN Messenger :
ecorptrainings@hotmail.com
Yahoo Messenger :
ecorptrainings@yahoo.com
Skype Talk :
ecorptrainings