We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

Hortonworks Data Platform (HDP) Operations: Apache Hbase Advanced Management

Online Training by  Agilitics
Inquire Now
Online / Training

Details

Hortonworks Data Platform (HDP) Operations: Apache Hbase Advanced Management

This course is designed for administrators who will be installing, configuring and managing HBase clusters. It covers installation with Ambari, configuration, security and troubleshooting HBase implementations. The course includes an end-of-course project in which students work together to design and implement an HBase schema.

PREREQUISITES
Students must have basic familiarity with data management systems. Familiarity with Hadoop or databases is helpful but not required. Students new to Hadoop are encouraged to take the HDP Overview: Apache Hadoop Essentials course.

TARGET AUDIENCE

Architects, software developers, and analysts responsible for implementing non-SQL databases in order to handle sparse datasets commonly found in big data use cases. 

FORMAT 50% Lecture/Discussion 50% Hands-on Labs

AGENDA SUMMARY
Day 1: An Apache HBase Overview and Installing HBase

Day 2: Using the HBase Shell and Ingest/ImportTSV

Day 3: Managing HA Clusters and Log Files, Backup Recovery and Security

Day 4: Monitoring HBase, Maintenance, Troubleshooting and Class Project 

DAY 1 OBJECTIVES
• Describe the Characteristics and Operation of HDFS • Describe the Responsibilities of the NameNode and DataNode • Describe the Purpose of YARN, including the: o ResourceManager o NodeManager o ApplicationMaster • Describe the Primary Differences Between Hadoop 1.x and 2.x • Describe the Function and Purpose of HBase • List HBase Features and Components • Describe an HBase Table as a Set of Key-Value Mappings • Idenfity HBase as Either a Row-or- Column-Oriented Database • Describe HBase Operations • List the Options for HBase Installation • List the HBase Minimum System Requirements • Describe the Process for Installing HBase Using Ambari • Describe the Process for Confirming a Successful Installation

DAY 1 LABS
• Installing and Configuring HBase with Ambari • Manually Installing an HBase Cluster

DAY 2 OBJECTIVES
• Work with Basic HBase Shell Commands • List the Categories of Shell Commands Including: o General o Table Management o Data Manipulation o Surgery Tools o Cluster Replication Tools o Security Tools • Work with Cluster Administration Commands • Describe the Function and Purpose of the Regionserver • Identify the Purpose of Key-Value Pairs • Identify the Purpose of Row Keys • Identify the Purpose of Column Families • Describe How to Read and Write Data in HBase • Describe the Flush Process • Describe the Compaction Process • Perform a Bulk Ingest Using ImportTSV • Describe the Function and Purpose of a CopyTable

DAY 2 LABS
• Using HBase Shell Commands • Ingesting Data with ImportTSV

DAY 3 OBJECTIVES
• List the Steps Required to Upgrade HBase • Configure HBase for High Availability • View Log Files • Describe the Function and Purpose of HBase Coprocessors • Describe the Function and Purpose of HBase Filters • Describe the Process for Using Filters for Scans • Describe the Process for Protecting HBase Data with Backups • Describe the Function and Benefits of Snapshots in HBase • Describe the Process for Performing Snapshots in HBase • Describe the Process for HBase Replication • Configure HBase Cluster Replication • Describe the Purpose of HBase Authentication • Describe the Purpose and Benefits of HBase Authorization Via ACLs • Describe the Benefits of Ranger and Knox for HBase Security • Describe the Process Used to Configure Simple Authentication • Describe the Secure Bulk Load Process

DAY 3 LABS
• Enabling HBase High Availability • Viewing Log Files • Configuring and Enabling Snapshots • Configuring Cluster Replication • Enabling Authentication and Authorization in HBase Tables

DAY 4 OBJECTIVES
• List Important Metrics to Monitor for an HBase Cluster • Monitor an HBase Cluster Using Ambari • Describe the Benefits of OpenTSDB as a Took for Monitoring • Describe How to Identify a Region Hot Spot • Design a Row-Key Schema to Avoid Hot Spotting • Configure an HBase Table Using Pre-Splitting • Describe the Region Splitting Process • Describe the Function of the Load Balancer • Define Region Sizing • Describe the Process of Manual Splitting and Merging • Describe the Process of Resolving Regions Overlap Issues • Use the Zookeeper Command Line Tool to Check Zookeeper Status and State • Monitor JVM Garbage Collection Metrics on Regionservers • Resolve Startup Errors for Masterserver and Regionservers • Tune HBase for Better Performance • Tune HDFS for Better HBase Performance

DAY 4 LABS
• Diagnosing and Resolving Hot Spotting • Region Splitting • Monitoring JVM Garbage Collection • End of Course Lab Project – Designing an HBase Schema
Reviews
Be the first to write a review about this course.
Write a Review
Agilitics Pte. Ltd. is a reknowned Big Data Analytics firm headquartered in Singapore with operations in mulitple countries. They are expert of big data and belive and spreading the knowledging for betterment of the Big Data community and generating bigger and better talent pool for industry.
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.