Warning: Parameter 1 to kc_action_the_post() expected to be a reference, value given in /home/dohabest/ss/wp-includes/class-wp-hook.php on line 326

- This event has passed.
Hadoop Administration
May 27, 2018 - May 27, 2022
$44

Professional Hadoop Administration Training
Spoorthy’s Hadoop Administration Certification Training will guide you to gain expertise in maintaining large and complex Hadoop Clusters. You will learn exclusive Hadoop Admin activities like Planning, Installation, Configuration, Monitoring & Tuning. Furthermore, you will be mastering the security implementation through Kerberos and Hadoop v2 through industry-level cases studies.
Trainers

What do our Students say ?




Course Content
INTRODUCTION
Big Data Introduction
What is Big Data?
Big Data – Why
Big Data – Journey
Big Data Statistics
Big Data Analytics
Big Data Challenges
Technologies Supported By Big Data
Hadoop Introduction
What Is Hadoop?
History Of Hadoop
Breakthroughs Of Hadoop
Future of Hadoop
Who Is Using?
Basic Concepts
The Hadoop Distributed File System – At a Glance
Hadoop Daemon Processes
Anatomy Of A Hadoop Cluster
Hadoop Distributions
HADOOP DISTRIBUTED FILE SYSTEM (HDFS)
What is HDFS?
Distributed File System (DFS)
Hadoop Distributed File System (HDFS)
HDFS Cluster Architecture and Block Placement
NameNode
DataNode
JobTracker
TaskTracker
Secondary NameNode
HDFS Concepts
Typical Workflow
Data Replication
Replica Placement
Replication Policy
Hadoop Rack Awareness
Anatomy of a File Read
Anatomy of a File Write
MAPREDUCE
STAGES OF MAPREDUCE
DAEMONS
Job Tracker
Task Tracker
TASK FAILURES
Child
Task Tracker Failures
Job Tracker Failures
HDFS Failures
YARN
HOW TO PLAN A CLUSTER
VERSIONS AND FEATURES
HARDWARE SELECTION
Master Hardware
Slave Hardware
Cluster sizing
OPERATING SYSTEM SELECTION
Deployment Layout
Software Packages
Hostname, DNS
Users, Groups, Privileges
DISK CONFIGURATION
Choose a FileSystem
INSTALLATION AND CONFIGURATION
APACHE HADOOP
Tarball Installation
Package Installation
CONFIGURATION
XML Configuration
Environment Variables
Logging Configuration
HDFS
Optimization and Tuning
MAPREDUCE
Optimization and Tuning
AUTHENTICATION
KERBEROS AND HADOOP
Kerberos
Configuring Hadoop Security
RESOURCE MANAGEMENT
WHAT IS RESOURCE MANAGEMENT?
MAPREDUCE SCHEDULER
Capacity Scheduler
Fair Scheduler
CLUSTER MAINTENANCE
MANAGING HADOOP PROCESS
Starting and stopping processes with Init scripts
Starting and stopping processes manually
HDFS MAINTENANCE
Adding and Decommissioning DataNode
Balancing HDFS Block Data
Dealing with a Failed disk
MAPREDUCE MAINTENANCE
Adding and Decommissioning TaskTracker
Kill MapReduce Job and Task
Dealing Blacklisted Tasktracker
TROUBLESHOOTING
COMMON FAILUERS AND PROBLEMS
HDFS AND MAPREDUCE CHECKS
BACKUP AND RECOVERY
DATA BACKUP
Distributed copy
Parallel data ingestion
NAMENODE METADATA
COURSE DELIVERABLES
Workshop style coaching
Interactive approach
Course material
Hands on practice exercises
Quiz at the end of each major topic
Tips and techniques on Cloudera Certification Examination
Mock interviews for each individual will be conducted on need basis
Resume preparation and guidance