We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

Hortonworks Data Platform Administration Fast Track Training

Online Training by  Agilitics
Inquire Now
Online / Training

Details

Hortonworks Data Platform Administration Fast Track Training


This 5 day training course is designed primarily for systems administrators and platform architects who need to understand HDP cluster capabilities, and manage HDP clusters.

Topics include:
Understanding HDF capabilities, Apache Hadoop, Apache YARN, HDFS, and other Hadoop ecosystem components.
Students will understand how to administer, manage, and monitor HDP clusters.

PREREQUISITES
Students should be familiar with server or platform software concepts and have a basic understanding of system administration.

TARGET AUDIENCE
For students who range from understanding server software concepts to system administrators and platform architects who plan on administering HDP clusters.
FORMAT
50% Lecture/Discussion 50% Hands-on Labs

AGENDA SUMMARY
Day 1: Introduction to Hadoop and Ambari
Day 2: Managing HDFS, YARN Architecture and Management
Day 3: The YARN Capacity Scheduler, High Availability, Monitoring and Backups
Day 4: Advanced HDFS & YARN Services
Day 5: Additional HDP Components and Tuning

DAY 1 OBJECTIVES
• Describe Apache Hadoop • Summarize the Purpose of the Hortonworks Data Platform Software Frameworks • List Hadoop Cluster Management Choices • Describe Apache Ambari • Identify Hadoop Cluster Deployment Options • Plan for Hadoop Cluster Deployments • Perform an Interactive HDP Installation using Apache Ambari • Install Apache Ambari • Describe the Differences Between Hadoop Users, Hadoop Service Owners and Ambari Users • Manage Users, Groups and Permissions • Identify Hadoop Configuration Files • Summarize Operations of the Web UI Tool • Manage Hadoop Service Configuration Properties using the Ambari Web UI • Manage Client Configuration Files Using the Command-line Interface DAY 1 LABS
• Setting Up the Lab Environment • Installing HDP • Managing Apache Ambari Users and Groups • Managing Hadoop Services

DAY 2 OBJECTIVES
• Describe the Hadoop Distributed File System (HDFS) • Perform HDFS Shell Operations • Use the Ambari Files View • Use WebHDFS • Protect Data using HDFS Access Control Lists (ACLs) • Describe HDFS Architecture and Operation • Manage HDFS using Ambari Web, NameNode and DataNode UIs • Manage HDFS using Command-line Tools • Enable and Manage HDFS Quotas • Identify Reasons to Add, Replace and Delete Worker Nodes • Configure and Run HDFS Balancer • Decommission and Re-commission a Worker Node • Move a Master Component • Summarize the Purpose and Benefits of Rack Awareness • Configure Rack Awareness

DAY 2 LABS
• Using Hadoop Storage • Using WebHDFS • Using HDFS Access Control Lists • Managing Hadoop Storage • Managing HDFS Quotas • Adding, Decommissioning, and Re-commissioning Worker Nodes • Configuring Rack Awareness

DAY 3 OBJECTIVES
• Describe YARN Resource Management • Summarize YARN Architecture and Operation • Identify and Use YARN Management Options • Summarize YARN Response to Component Failure • Understand the Basics of Running a Sample YARN Application, Including: o MapReduce and Tez o Apache Pig o Apache Hive • Summarize the Purpose and Operation of the YARN Capacity Scheduler • Configure and Manager YARN Queues • Control Access to YARN Queues

DAY 3 LABS
• Managing the YARN Service Using the Apache Ambari Web UI • Managing the YARN Service Using the CLI Commands • Running Sample YARN Applications • Setting Up for the Capacity Scheduler • Managing YARN Containers and Queues • Managing YARN ACLs and User Limits • Working with YARN Node Labels

DAY 5 OBJECTIVES
• Configure YARN Queues, Tez, and Hive Properties to Support Performance Goals • Recall Basic Facts About Hive and the Hive Architecture • Recall the Requirements and Benefits of Hive HA • Summarize the Hive HA Architecture and Operation • Configure and Test Hive HA • Recall the Purpose, Job Types, Structure and Benefits of Oozie • Install and Configure Oozie using Ambari • Deploy and Manage a Sample Oozie Workflow • Identify Characteristics of Ambari Local Versus LDAP Users and Groups • Integrate Ambari Server with LDAP • Summarize the Purpose and Benefits of Ambari Blueprints • Recall the Process Used to Deploy a Cluster Using Ambari Blueprints • Configure Ambari Blueprints Logical Cluster Configuration Files • Recall the Definition of an HDP Stack and Interpret its Version Number • View the Current Stack and Identify Compatible Ambari Software Versions • Recall the Types and Methods of Upgrades Available in HDP • Describe the Rolling Upgrade Process, Restrictions, and Pre-Upgrade Checklist • Perform a Rolling Upgrade Using the Ambari Web UI

DAY 5 LABS
• Configuring Apache Hive High Availability • Managing Workflows Using Apache Oozie • Integrating Apache Ambari with AD/LDAP • Automating Cluster Provisioning using Apache Ambari • Performing an HDP Upgrade
Reviews
Be the first to write a review about this course.
Write a Review
Agilitics Pte. Ltd. is a reknowned Big Data Analytics firm headquartered in Singapore with operations in mulitple countries. They are expert of big data and belive and spreading the knowledging for betterment of the Big Data community and generating bigger and better talent pool for industry.
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.