Module 0: Giveaways
· Linux / UNIX Course
· 100 Solved Queries of Hadoop Administration Day to Day activities.
· Guidelines to create an AWS account.
Module 1: Introduction of Hadoop Administration
· Understanding Big Data
· Common big data domain scenarios
· Analyze Limitation of Traditional Solutions
· Roles and Responsibility
· Case Studies
Module 2: Hadoop Architecture And Mapreduce
· Introduction to Hadoop
· Hadoop Architecture
· Difference between Hadoop 1.x, Hadoop 2.x and Hadoop 3.x
· Hadoop 1.x Ecosystem tools and Core System
· Hadoop 2.x Ecosystem tools and Core System
· HDFS File System
o Introduction of NameNode, DataNode and Secondary NameNode
o Anatomy of Write and Read
o Replication Pipeline
· YARN Framework
o Role and function of YARN in Hadoop
o Mapreduce Theory
§ Cluster testing using MapReduce Code in YARN Environment
Module 3: Cluster Planning
· Types of Rack
· General Principal of selecting CPU Memory and hardware
· Understand Hardware Consideration
· Machines requirement as per the daemons
· Learn Best Practice for selecting hardware
Know the network Consideration
Module 4: Hadoop Cluster Administration, Backup, Recovery and Maintenance
· SafeMode
· Decommissioning, Commissioning and Re-Commissioning of Node
· Trash Functionality
· Distcp
· Rack Awareness
· HDFS / Hadoop Balancer
Module 5: Managing Resources and Scheduling
· Scheduler: Explanation and demo
o Capacity Scheduler
Module 6: HDFS Federation and High Availability
· Understand the YARN framework
· Understand the Federation
· Understand High Availability
· High Availability Implementation Using Quorum Journal Manager
Module 7: Cloudera Setup and Performance Tuning
· Cloudera Distribution Hadoop
· Cloudera Features
· Cloudera Manager Editions
· Cloudera Manager Web UI
· CDH Installation
Module 8: Security
· Basics of Hadoop Platform Security
· Securing the Platform
· Understand Kerberos
Configuring Kerberos on Cloudera Hadoop Cluster using LDAP authentication