Академический Документы
Профессиональный Документы
Культура Документы
cloud
l d computing
ti
阳振坤
Senior scientist, Baidu
yangzhenkun@baidu com
yangzhenkun@baidu.com
2009.6.26
Agenda
• Background information
• Power consumption in IDC
• Cutting
C tting po
power
er cons
consumption
mption
• Summary
What is cloud computing
• An emerging computing technology that uses the
internet and central remote servers (IDC) to
maintain data and applications
• Enabling much more efficient computing by
centralizing storage
storage, memory
memory, processing and
bandwidth
¾ Machine idle ratio: ~70% vs
vs. 90% in traditional
• A kind of large-scale distributed (usually
heterogeneo s) ssystem
heterogeneous) stem
IDC: some numbers
• The data center in Dallas, Oregon: ~50
50 MW
¾ 50MW*0.8/200W=0.2M
¾ Average electricity consumption in USA:
~900kwh/month/family, or 1.25KW
• Power consumption is the major cost and constraint
of IDC
• About 7000 IDCs in USA
IDC’s
IDC s infrastructure
~AC-1
AC 1
~AC-2
AC 2
0 2 0 2
5 4 5
1 3
0 8
D 5 7 7
3
M
1 2 3
8 1
6
4 6 4 6
7 8
Cluster: an example
• Requirements
¾ 50TB of data, 2,000,000 queries/s and 90% cache hit
rate
¾ Single machine: 100~400 queries/s, 0.15~1.8TB disk
capacity
• Cluster planning
¾ 2,000,000 (1 90%)/400 = 500 machines
2,000,000*(1-90%)/400
¾ 50TB*3/500 = 0.3TB/machine
• Can we make half or 1/3 of machines standby or
hibernated?
Cluster: an example (cont’d)
(cont d)
• Make machines standby/hibernated
y one by
y one
¾ May lead to mass data shuffle, or
¾ Requires data distribution
0 2 0 2
5 4 5
1 3
0 8
D 5 7 7
3
M
1 2 3
8 1
6
4 6 4 6
7 8
Summary
• Power is the major constraint and cost of IDC
• To cut power consumption
¾ Computer machines
¾ Cooling system
¾ UPS
¾ System infrastructure…
• Whole
Wh l iindustry
d t should
h ld b
be iinvolved
l d
• Q&A