We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

Details

The objective of this training program is to convert a layman into Big Data Hadoop Professional. During the course you will learn from basic to advance concepts of Big Data Hadoop. Setup a minimum 4 Node Hadoop Cluster, Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

  • Apache Ambari, Cloudera Manager features that make managing your clusters easier, such as aggregated logging, configuration management, resource management, reports, alerts, and service management.
  • The internals of YARN, MapReduce, Spark, Kafka, Storm and HDFS
  • Determining the correct hardware and infrastructure for your cluster
  • Proper cluster configuration and deployment to integrate with the data center
  • How to load data into the cluster from dynamically-generated files using Flume and from RDBMS using Sqoop
  • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster
  • Best practices for preparing and maintaining Apache Hadoop in production
  • Troubleshooting, diagnosing, tuning, and solving Hadoop issues
  • Advanced topics in real time event processing using Apache Kafka, Storm, NiFi
Who Should Attend Hadoop Administration Training?

There is no strict prerequisite to start learning Hadoop administration. This course is best suited who have basic Unix/Linux fundamentals. Prior knowledge of Apache Hadoop is not required.

  • Linux / Unix Administrator
  • Database Administrator
  • Windows Administrator
  • Infrastructure Administrator
  • System Administrator
  • Support engineers
  • Big Data Architects
  • IT Managers
  • Freshers can easily jump their career into Hadoop / Big Data

Benefits of Learning Hadoop Administration Training
  • Get Paid Higher Than What Your Earning Now Hadoop Administrators on average get paid 30% more. Hadoop job market is expected to grow 25 times by 2020.
  • Better Career Opportunities: The requirement for processing zettabytes of unstructured big data is generating demand for professionals with Hadoop skills to work with unstructured data. Career opportunities for Hadoop professionals are emerging across various business industries, from financial firms to retailers, healthcare, agriculture, sports, energy, utility and media.
  • During Internal Job Postings, Hadoop Skills helps you move up the ladder and accelerates your career in existing organization.
  • Within three to five years, half of the world’s data will be processed on Hadoop……there will be huge demand for thousands and thousands of individuals who are trained in Hadoop" Said By : Bob Mahan (Senior Director of Worldwide Field Services).
  • Large companies who are hiring Hadoop Administrators are Cisco, HP, Tata, LinkedIn, Oracle, ebay, IBM, Amazon, Google, Microsoft, Yahoo and many more.

Outline

Introduction to Apache Hadoop

The Case for Apache Hadoop

  • Why Hadoop is needed
  • What problems Hadoop solves
  • What comprises Hadoop and the Hadoop Ecosystem

HDFS

  • What features HDFS provides
  • How HDFS reads and writes files
  • How the NameNode uses memory
  • How Hadoop provides file security
  • How to use the NameNode Web UI
  • How to use the Hadoop File Shel

Getting Data Into HDFS

  • How to import data into HDFS with Flume
  • How to import data into HDFS with Sqoop
  • What REST interfaces Hadoop provides
  • Best practices for importing data

MapReduce

  • What MapReduce is
  • What features MapReduce provides
  • What the basic concepts of MapReduce are
  • What the architecture of MapReduce is
  • What featurs MapReduce version 2 provides
  • How MapReduce handles failure
  • How to use the JobTracker Web UI

Planning, Installing, and Configuring a Hadoop Cluster

Planning Your Hadoop Cluster

  • What issues to consider when planning your Hadoop cluster
  • What types of hardware are typically used for Hadoop nodes
  • How to optimally configure your network topology
  • How to select the right operating system and Hadoop distribution
  • How to plan for cluster management

Hadoop Installation and Initial Configuration

  • The different installation configurations avaialable in Hadoop
  • How to install Hadoop
  • How to specify Hadoop configuration
  • How to configure HDFS
  • How to configure MapReduce How to locate and configure Hadoop log files

Installing and Configuring Hive, Impala,and Pig

  • Hive features and basic configuration
  • Impala features and basic configuration
  • Pig features and installation

Hadoop Clients

  • What Hadoop clients are
  • How to install and configure Hadoop clients
  • How to install and configure Hue
  • How Hue authenticates and authorizes user access

Advanced Cluster Configuration

  • Advanced Configuration Parameters
  • Configuring Hadoop Ports
  • Explicitly including and Excluding Hosts
  • Configuring HDFS for Rack Awareness
  • Configuring HDFS High Availability

Hadoop Security

  • Why security is important for Hadoop
  • How Hadoop's security model evolved
  • What Kerberos is and how it relates to Hadoop
  • What to consider when securing Hadoop

Cluster Operations and Maintenance

Managing and Scheduling Jobs

  • How to view and stop jobs running on a cluster
  • The options available for scheduling Hadoop jobs
  • How to configure the Fair Scheduler

Cluster Maintenance

  • How to check the status of HDFS
  • How to copy data between clusters
  • How to add and remove nodes
  • How to rebalance the cluster
  • How to upgrade your cluster

Cluster Maintenance and Troubleshooting

  • What general system conditions to monitor
  • How to monitor a Hadoop cluster
  • Some techniques for troubleshooting problems on a Hadoop cluster
  • Some common misconfigurations, and their resolutions

Security and HDFS Federation

Kerberos Configuration

  • What are the phases required for a client to access a service
  • Kerberos Client Commands
  • Configuring HDFS Security
  • Configuring MapReduce Security
  • Troubleshooting Hadoop Security

Configuring HDFS Federation

  • What is HDFS Federation
  • Benefits of HDFS Federation
  • How HDFS Federation works
  • Federation Configuration

ADVANCED TOPICS (Real Time Event Processing)

APACHE SPARK

  • What is Spark
  • How Spark works
  • Spark Use Cases
  • Installing and configuring Spark
  • Real time event processing with Spark

APACHE KAFKA

  • What is Kafka
  • How Kafka works
  • Installing and configuring kafka
  • Real time event processing with Kafka.

APACHE STORM

  • What is Storm
  • How Storm works
  • Installing and configuring Storm
  • Real time event processing with Storm
Reviews
Be the first to write a review about this course.
Write a Review
AADS Education was established a decade ago and has rich global experience in Education, online Media and IT services. Educational training programs offered by us have been very beneficial to both employed individuals,unemployed individuals and corporates. AADS Education is well established player in offering job oriented trainings,short term professional training and long term professional training in various fields.
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.