We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

HDP Operations: Hadoop Admin II

Online Training by  Agilitics
Inquire Now
Online / Training

Details


HORTONWORKS DATA PLATFORM (HDP®) OPERATIONS: ADMINISTRATION 2

This course is designed for experienced administrators who manage Hortonworks Data Platform (HDP) 2.3 clusters with Ambari. It covers upgrades, configuration, application management, and other common tasks.

PREREQUISITES
Attendees should have attended HDP Operations: Hadoop Administration 1 or possess equivalent knowledge and experience. Attendees should be familiar with basic HDP administration and Linux environments.

TARGET AUDIENCE

IT administrators and operators responsible for configuring, managing, and supporting an Hadoop 2.3 deployment in a Linux environment using Ambari.

FORMAT 50% Lecture/Discussion 50% Hands-on Labs

AGENDA SUMMARY

Day 1: Performing a Rolling Upgrade, Configuring Heterogeneous HDFS Storage

Day 2: Deploying Applications with Slider, Integrating Ambari with LDAP and Hive Tuning

Day 3: Apache Oozie High Availability, Introduction to Falcon, Automating Using Ambari Blueprints

DAY 1 OBJECTIVES
• List the HDP Upgrade Types • List the HDP Upgrade Path Restrictions • Preparing Databases and HDFS • Describe the Process Used to Register a New HDP Version • Execute Automated Installation of and Upgrades to HDP Clusters • Identify the Purpose and Operation of Hetergenous HDFS Storage • Identify and Describe the HDFS Storage Types and Policies • Identify HDFS NFS Gateway Use Cases • Recall HDFS NFS Gateway Architecture and Operation • Install and Configure an HDFS NFS Gateway • Configure an HDFS NFS Gateway Client • Summarize the Purpose and Operation of HDFS Centralized Caching • Configure HDFS Centralized Cache • Define and Manage Cache Pools and Cache Directives • Summarize the Importance of File Format and Compression Algorithm Selection • Summarize the Benefits and Considerations When Using Compression in Hadoop • Describe the Administrator and Non-Administrator Roles in Managing Compression • List and Describe Splittable Compression Formats • Configure Default MapReduce and Tez Compression Algorithms DAY 1 LABS
• Setting Up the Lab Environment • Performing a Rolling Upgrade • Configuring HDFS Storage Policies • Configuring an NFS Gateway • Configuring HDFS Centralized Cache

DAY 2 OBJECTIVES
• Summarize the Purpose and Operation of YARN Node Labels • Create, Add, Modify and Remove Node Labels • Configure Queues to Access Node Label Resources • Run Test Jobs to Confirm Node Label Behavior • Recall the Purpose, Benefits and Components of Apache Slider • Install and Manage an Apache Slider Application Package Using the Slider View • Identify Characteristics of Ambari Local Versus LDAP Users and Groups • Integrate Ambari Server with LDAP • Configure YARN Queues, Tez and Hive Properties to Support Performance Goals • Recall Basic Facts About Hive and the Hive Architecture • Recall the Requirements and Benefits of Hive High Availability HA • Summarize the Hive HA Architecture and Operation • Configure and Test Hive HA • Recall the Purpose, Job Types, Structure and Benefits of Apache Oozie • Install and Configure Apache Oozie Using Ambari • Deploy and Manage a Sample Apache Oozie Workflow

DAY 2 LABS
• Configuring YARN Node Labels • Deploying Applications Using Apache Slider • Integrating Ambari with LDAP • Configuring Hive High Availability • Managing Workflows Using Apache Oozie

DAY 3 OBJECTIVES
• Recall the Purpose and Architecture of Apache Oozie • Recall the Benefits of Apache Oozie High Availability (HA) • Summarize the Apache Oozie HA Architecture and Operation • Configure Apache Oozie HA • Understand the Challenges of Data Governance in Large, Complex Environments • Recall the Purpose and Capabilities of Falcon • Understand the Purpose and Configuration of Cluster, Feed and Process Entities • Create a Cluster Entity and Set Up Mirroring Using the Falcon UI • Summarize the Purpose and Benefits of Ambari Blueprints • Recall the Processes Used to Deploy a Cluster Using Ambari Blueprints • Configure Ambari Blueprints Logical Cluster Configuration Flies • Configure Ambari Blueprints Cluster Creation Configuration Files • Configure Ambari Blueprints Host Creation Configuration Files • Identify Ambari Blueprints Configuration Property Precedence and Best Practices

DAY 3 LABS
• Configuring Apache Oozie High Availability • Configuring Data Replication Using Apache Falcon • Automating Cluster Provisioning Using Apache Ambari
Reviews
Be the first to write a review about this course.
Write a Review
Agilitics Pte. Ltd. is a reknowned Big Data Analytics firm headquartered in Singapore with operations in mulitple countries. They are expert of big data and belive and spreading the knowledging for betterment of the Big Data community and generating bigger and better talent pool for industry.
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.