We've noticed this is not your region.
Redirect me to my region
What do you want to learn today?

Details

Apache Hadoop Data Analyst

Hadoop Introduction:
  • Why we needHadoop
  • Why Hadoop is in demand in market now adays
  • Where expensive SQL based tools arefailing
  • Key points , Why Hadoop is leading tool in current It Industry Definitionof BigData
  • Hadoopnodes
  • Introduction to Hadoop Release-1
  • Hadoop Daemons in HadoopRelease-1
  • Introduction to Hadoop Release-2
  • Hadoop Daemons in HadoopRelease-2
  • Hadoop Cluster andRacks
  • Hadoop ClusterDemo
  • New projects on Hadoop
  • How Open Source tools is capable to run jobs in lesser time Hadoop Storage– HDFS (Hadoop Distributed file system) Hadoop Processing Framework (Map Reduce / YARN) Alternates of MapReduce
  • Why NOSQL is in much demand instead ofSQL
  • Distributed warehouse forHDFS
  • Hadoop Ecosystem and its usages
  • Data import/Exporttools

 

Hadoop Installation and Hands-on on Hadoop machine : Hadoop installation
  • Introduction to Hadoop FS and Processing Environment’s UIs How toread and write files
  • Basic Unix commands forHadoop
  • Hadoop FSshell
  • Hadoop releasespractical
  • Hadoop daemonspractical
  • Why Pig if Map Reduce isthere?
  • How Pig is different from Programming languages Pig Data flow Introduction
  • How Schema is optional in Pig
  • Pig Datatypes
  • Pig Commands – Load, Store , Describe , Dump Map Reduce job started by PigCommands
  • Executionplan

ETL Tool (Pig) Introduction Level-1 (Basics)PigIntroduction

 

ETL Tool (Pig) Level-2 (Complex)Pig-UDFs
  • Pig Usecases
  • Pig Assignment
  • Complex Use cases onPig
  • Real time scenarios onPig
  • When we should usePig
  • When we shouldn’t usePig

 

Hive Warehouse
  • HiveIntroduction
  • Meta storage and metastore
  • Introduction to Derby Database
  • Hive Datatypes
  • HQL
  • DDL, DML and sub languages ofHive
  • Internal , external and Temp tables inHive
  • Differentiation between SQL based Data warehouse and Hive

 

Hive Level-2 (Complex)Hivereleases
  • Why Hive is not best solution for OLTP OLAP inHive
  • Partitioning
  • Bucketing
  • HiveArchitecture
  • Hue Interface forHive
  • How to analyze data using Hive script Differentiation between Hive and Impala UDFs inHive
  • Complex Use cases inHive
  • Hive Advanced Assignment
  • Why Reducer is optional while Mapper is mandatory? Introduction to Combiner
  • Introduction to Partitioner
  • Programming languages for MapReduce
  • Why Java is preferred for Map Reduce programming

Introduction to Map ReduceHow Map Reduce works as Processing Framework End to End execution flow of Map Reduce job Different tasks in Map Reducejob

 

NOSQL Databases and Introduction to HBase
  • Introduction toNOSQL
  • Why NOSQL if SQL is in market since severalyears
  • Databases in market based onNOSQL
  • CAPTheorem
  • ACID Vs.CAP
  • OLTP Solutions with different capabilities
  • Which Nosql based solution is capable to handle specific requirements Examples of companies that uses NOSQL baseddatabases
  • HBase Architecture of column families

 

Zookeeper and SQOOP Introduction to Zookeeper
  • How Zookeeper helps in Hadoop Ecosystem
  • How to load data from Relational storage in Hadoop Sqoopbasics
  • Sqoop practical implementation
  • Sqoop alternative
  • Sqoop connector
  • How to load unstructured and semi structured data in Hadoop Introduction to Flume
  • Hands-on on Flume
  • How to load Twitter data in HDFS using Hadoop
  • Introduction to Oozie
  • How to schedule jobs using Oozie
  • What kind of jobs can be scheduled using Oozie
  • How to schedule jobs which are time based
  • Hadoop releases
  • From where to get Hadoop and other components to install
  • Introduction to YARN
  • Significance of YARN
  • Introduction to Spark
  • Basics Features of SPARK and Scala available in Hue Why Spark demand is increasing in market
  • How can we use Spark with Hadoop Eco System Datasets for practicepurpose
Flume , Oozie and YARN How to load data streaming data without fixeschema
Apache Spark Basics

 

Emerging Trends of Big Data
  • YARN
  • Emerging Technologies of BigData
  • Emerging use cases e.g IoT, Industrial Internet, NewApplications
  • Certificationsand
  • JobOpportunities
Reviews
Be the first to write a review about this course.
Write a Review
Agilitics Pte. Ltd. is a reknowned Big Data Analytics firm headquartered in Singapore with operations in mulitple countries. They are expert of big data and belive and spreading the knowledging for betterment of the Big Data community and generating bigger and better talent pool for industry.
Sending Message
Please wait...
× × Speedycourse.com uses cookies to deliver our services. By continuing to use the site, you are agreeing to our use of cookies, Privacy Policy, and our Terms & Conditions.