Mail :
India : +91-8143-111-555
USA : +1-703-445-4802
uk : +44-20-3287-2021
Whats app : +91-8143-110-555
Facebook Twitter Google Plus Pinit Stumbleupon Youtube Blog

Workday HCM Demo New Batches Starting from Wednesday... 26-07-2017
Search Course Here

Live Chat

Hadoop Scalable Distributed Computing


Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.
  • Knowledge of Hadoop and Distributed Computing.
  • It is a 20 days program and extends up to 2hrs each.
  • The format is 40% theory, 60% Hands-on.

  • It is a 5 days program and extends up to 8hrs each.
  • The format is 40% theory, 60% Hands-on.
    Private Classroom arranged on request and minimum attendies for batch is 4.
course content
  • Getting Started with Hadoop Core
    • Understanding Data, Data Storage and Data Analysis
    • Introducing the MapReduce Model
    • Introducing Hadoop
    • Tracing the Hadoop History
    • Installing Hadoop
    • Running Hadoop Examples and Tests
    • Troubleshooting
  • Understanding MapReduce
    • An Example Dataset
    • Analyzing the Data with Unix Tools
    • Analyzing the Data with Hadoop
    • Scaling Out
    • Hadoop Streaming
    • Hadoop Pipes
  • Understanding the Hadoop Distributed Filesystem
    • The Design of HDFS
    • HDFS Concepts
    • The Command-Line Interface
    • Hadoop Filesystems
    • The Java Interface
    • Data Flow
    • Parallel Copying with distcp
    • Hadoop Archives
  • Working with Hadoop I/O
    • Data Integrity
    • Compression
    • Serialization
    • File-Based Data Structures
  • Developing a MapReduce Application
    • The Configuration API
    • Configuring the Development Environment
    • Writing a Unit Test
    • Running Locally on Test Data
    • Running on a Cluster
    • Tuning a Job
    • MapReduce Workflows
  • Explaining How MapReduce Works
    • Anatomy of a MapReduce Job Run
    • Failures
    • Job Scheduling
    • Shuffle and Sort
    • Task Execution
  • Discussing MapReduce Types and Formats
    • MapReduce Types
    • Input Formats
    • Output Formats
  • Explaining MapReduce Features
    • Counters
    • Sorting
    • Joins
    • Side Data Distribution
    • MapReduce Library Classes
  • Setting Up a Hadoop Cluster
    • Cluster Specification
    • Cluster Setup and Installation
    • SSH Configuration
    • Hadoop Configuration
    • Post Install
    • Benchmarking a Hadoop Cluster
    • Hadoop in the Cloud
  • Administering Hadoop
    • HDFS
    • Monitoring
    • Maintenance
For Videos Click Here Videos

Flash News

AngularJS New Batch Start From 14th JULY & 15th JULY.

Hadoop Dev New Batch Start From 15th JULY & 16th JULY.

IBM COGNOS TM New Batch Start From 16th JULY & 17th JULY.

Informatica Dev New Batch Start From 17th JULY & 18th JULY.

Mean Stack New Batch Start 18th JULY & 19th JULY.

SAP BODS new Batch Starting From 19th JULY & 20th JULY.

SAP S/4 HANA New Batch Start From 20th JULY & 21st JULY

Tableau New Batch Start From 21st JULY & 22nd JULY


(1) Workday Technical Demo Training

Demo Schedule : 09:30A.M EST / 08:30A.M CST / 6:30A.M PST on 13th JULY & 07:00A.M IST on 14th JULY

Email :
Rediff Bol :
Google Talk :
MSN Messenger :
Yahoo Messenger :
Skype Talk :