Академический Документы
Профессиональный Документы
Культура Документы
www.easylearning.guru
Agenda
What is Big Data ?
Different Kinds of Big Data
www.easylearning.guru
www.easylearning.guru
www.easylearning.guru
Sources of Data
Mobile Devices
(Tracking all the objects all the time)
Sensor Technology & Networks
(Measuring all kinds of data)
Scientific Instruments
(Collecting all sorts of data)
www.easylearning.guru
www.easylearning.guru
Facebook Scenario
Facebook on an average generates 70 thousand MB in 1 minute.
1 hour
1 Day
1 week
4 weeks
52 weeks
= 70,000 MB *60
= 4.2 Million MB
= 4.2 Million *24 MB = 10.8 Billion MB = 98438 GB
= 6.9 thousand GB = 690 TB
= 690 TB * 4 = 2756 TB = 2.7 PB
= 2.7 PB * 52 = 143.3 PB
www.easylearning.guru
50
40
30
20
10
0
Implemented Big Data
2012
Filled
DATA SCIENTIST
BIG DATA VISUALIZER
BIG DATA RESEARCH ANALYST
2014
2015
2016
2017
Unfilled
82
18
77
23
69
31
44
56
43
57
2013
50
50
FILLED/VACANCY(%)
www.easylearning.guru
www.easylearning.guru
120
100
38%
80
As of February 2014
60
14%
40
8%
20
2%
2%
3%
8%
10%
11%
4%
www.easylearning.guru
Sources : Dice, LinkedIn.
What is Hadoop ?
Hadoop was created by Doug Cutting and Mike Cafarella.
Hadoop provides the reliable shared storage and analysis
system.
It is designed to scale up from a single server to thousand of
machines, with a high degree of fault tolerance.
www.easylearning.guru
Hadoop History
www.easylearning.guru
www.easylearning.guru
www.easylearning.guru
MapReduce Flow
www.easylearning.guru
MapReduce Framework
Map Reduce works by breaking the processing into two phases :
Map Phase and Reduce Phase.
www.easylearning.guru
www.easylearning.guru
What we offer
www.easylearning.guru
www.easylearning.guru
Syllabus
Introduction
Hive
a)Big Data
a)Hive 1
b)Hadoop
b)Hive 2
Hadoop
Hbase
a)HDFS
Zookeeper
b)MapReduce
Sqoop
PIG
a)Pig 1
b)Pig 2
Yarn
Project Class
www.easylearning.guru
Skype Id : easylearning.guru
Website : www.easylearning.guru
Your queries are always welcome.
www.easylearning.guru