20/09/2018
Duration : 40+
Week ends : 4
Days : Saturday & Sunday
Dates : Nov 9,10,16,17,23,24 Dec 30,1
Time : 1.30 pm 6.30 pm
Training mode : Classroom/Online Training
Fee : 20,000 INR ( Incusive of GST )
contact : [email protected]
Pre-requisite : Basics of Database, Unix
Who can Attend this course:
1. College graduates, who are passionate to do main project or to make their career track on Data Engineering which will give more weightage during interview
2. Teaching Professionals, who are looking forward to upgrade their skill which is inclined more on hands on
3. PhD students, who have idea but struck with the technology or coding.
4. IT professionals, those who are looking for an oppurtunity to shift their career track toward recent trends and technology
5. Non-IT professional, who are willing making their career in IT
Topics covered
Week 1 : Introduction to Big Data 10 Hrs
----------------------------------------------
Introduction to Data Science , Big Data and Hadoop Frame Work
Hadoop Pseudo Node Cluster setup Installation and detailing on the Property Files
Deep Dive on Hadoop commands & Use cases
Week 2 : Data Ingestion & Work Flow Management 10 Hrs
----------------------------------------------------------
Data Ingestion : Sqoop and Oozie Work Flow
Work Flow Management : Oozie Work flow
Sqoop:
Introduction to Sqoop,
Sqoop Architecture,
Integrate Sqoop with Mysql,
Import/Export data using Sqoop,
use cases on Sqoop
Oozie:
Introduction to Oozie workflow and its architecture
Implementation of Oozie workflow
Week 3 : Data Processing 10 Hrs
----------------------------------------------
Hive :
Introduction to Hive and its Architecture
Simple and complex Data types
Hive Query Language Statements
Joins and its types
Internal/External Tables
Partition/Bucketing
Pig :
Introduction to Pig and its Architecture
Mode of ex*****on
Simple and complex Data types
Data transformation using Pig Commands
Joins and its types
Week 4 : Data Streaming 10 Hrs
----------------------------------------------
Flume:
Introduction to flume and its architecture
Usage of flume in real time
Flume Data streaming
Kafka :
Introduction to Kafka
Kafka Architecture
Components of Kafka
Data Streaming using RDD
Configuration of Producer & Consumer
Spark :
Introduction to Spark
Spark Installation and to setup environment parameter
Enable your spark cluster
Data frames and RDD
Process data using SparkSQL and spark streaming