Home    |    Instructor-led Training    |    Online Training     
         
 
Courses
ADA
Adobe
Agile
AJAX
Android
Apache
AutoCAD
Big Data
BlockChain
Business Analysis
Business Intelligence
Business Objects
Business Skills
C/C++/Go programming
Cisco
Citrix
Cloud Computing
COBOL
Cognos
ColdFusion
COM/COM+
CompTIA
CORBA
CRM
Crystal Reports
Data Science
Datawarehousing
DB2
Desktop Application Software
DevOps
DNS
Embedded Systems
Google Web Toolkit (GWT)
IPhone
ITIL
Java
JBoss
LDAP
Leadership Development
Lotus
Machine learning/AI
Macintosh
Mainframe programming
Mobile
MultiMedia and design
.NET
NetApp
Networking
New Manager Development
Object oriented analysis and design
OpenVMS
Oracle
Oracle VM
Perl
PHP
PostgreSQL
PowerBuilder
Professional Soft Skills Workshops
Project Management
Rational
Ruby
Sales Performance
SAP
SAS
Security
SharePoint
SOA
Software quality and tools
SQL Server
Sybase
Symantec
Telecommunications
Teradata
Tivoli
Tomcat
Unix/Linux/Solaris/AIX/
HP-UX
Unisys Mainframe
Visual Basic
Visual Foxpro
VMware
Web Development
WebLogic
WebSphere
Websphere MQ (MQSeries)
Windows programming
XML
XML Web Services
Other
HADOOP FOR SYSTEMS ADMINISTRATORS
Big Data Training Overview

This course covers the essentials of deploying and managing an Apache™ Hadoop® cluster. The course is lab intensive with each participant creating their own Hadoop cluster using either the CDH (Cloudera's Distribution, including Apache Hadoop) or Hortonworks Data Platform stacks. Core Hadoop services are explored in depth with emphasis on troubleshooting and recovering from common cluster failures. The fundamentals of related services such as Ambari, Zookeeper, Pig, Hive, HBase, Sqoop, Flume, and Oozie are also covered. The course is approximately 60% lecture and 40% labs.

Supported Distributions

Red Hat Enterprise Linux 6

Big Data Training Prerequisites

Qualified participants should be comfortable with the Linux commands and have some systems administration experience, but do not need previous Hadoop experience

Big Data Training Course Duration

3 Days

Big Data Training Course outline

  1. Hadoop: The Big Picture
    1. Data Analysis
    2. Big Data
    3. Hadoop Core Architecture
    4. Hadoop Ecosystem
    5. Hadoop Ecosystem continued
    6. Running Commands on Multiple Systems
    Lab Tasks
    1. Running Commands on Multiple Hosts
    2. Preparing to Install Hadoop
  2. HDFS
    1. Design Goals
    2. Design
    3. Blocks
    4. Block Replication
    5. Namenode Daemon
    6. Secondary Namenode Daemon
    7. Datanode Daemon
    8. Accessing HDFS
    9. Permissions and Users
    10. Adding and Removing Datanodes
    11. Balancing
    Lab Tasks
    1. Single Node HDFS
    2. Multi-node HDFS
    3. Files and HDFS
    4. Managing and Maintaining HDFS
  3. MapReduce
    1. MapReduce
    2. Terminology and Data Flow
    3. MapReduce Daemons
    4. YARN
    5. MapReduce Essential Configuration
    6. Failure and Recovery
    Lab Tasks
    1. MapReduce
  4. MapReduce Schedulers
    1. Working with Jobs
    2. Scheduling Concepts
    3. FIFO Scheduler
    4. Fair Scheduler
    5. Fair Scheduler - Configuration
    Lab Tasks
    1. MapReduce Schedulers
  • A. Installing Hadoop with Ambari Lab Tasks
    1. Install Ambari

Please contact your training representative for more details on having this course delivered onsite or online

Training Outlines - the one stop shopping center for IT training.
© Training Outlines All rights reserved