You are on page 1of 24

EMC Data Domain

Overview

Copyright 2014 EMC Corporation. All rights reserved.

EMC Data Domain

Protection Storage for Backup and Archive Data


Scale and performance

Reduce storage required by 1030x


Protect up to 100 PB of logical capacity in a single system
Complete backups fasterup to 31 TB per hour

Seamless integration

Integrates with backup, archiving, and enterprise applications

Reliable access and recovery

End-to-end data verification, fault detention, and self healing

Efficient resource utilization

Copyright 2014 EMC Corporation. All rights reserved.

Send only deduplicated data across the network to reduce


bandwidth required by up to 99%

EMC Data Domain:


Leadership and Innovation
A History of Industry Firsts
2003

2004

First deduplication
NAS

2005

2006

2007

2008

First deduplication
virtual tape library

First deduplication
volume replication

First deduplication
directory replication

2009

2010

Fastest backup
controller
Cascaded
replication

2011
First
deduplication
for long-term
retention of
backup data

First
distributed
deduplication
processing

Copyright 2014 EMC Corporation. All rights reserved.

2012

2013

2014

First
deduplication
optimized for
backup and
archive data

First inline
deduplication to
support retention
for compliance

First
deduplication
with secure
multi-tenancy

$M

3000

2500

2000

1500

MARKET LEADER

DATA PROTECTION

SOFTWARE &

STORAGE

1000

Appliance
500

Software

Source: IDC, Worldwide Purpose-Built Backup Appliance Tracker, Q42013 and IDC, Worldwide Storage Software QView, Q42013

Copyright 2014 EMC Corporation. All rights reserved.

Why Protection Storage?


Scalability

Data Integrity

Consolidation

Enable scale without cost


and complexity

Data Domain Data


Invulnerability Architecture
ensures data is recoverable
and accessible

Consolidate backup, archive,


and disaster recovery on a
single system

Inline deduplication
minimizes storage
footprint by 1030x

Copyright 2014 EMC Corporation. All rights reserved.

Protect a wide variety of


data sources

Data Domain Basics

Seamless Integration with Existing Environment


Control Tier
Backup Applications
EMC
Symantec
CommVault

VMware
HP
IBM

Archiving Applications
EMC
Symantec
CommVault

HP
Dell
OpenText

Enterprise Applications
Oracle
Pivotal
SQL

Target Tier

Disaster Recovery Tier

CIFS, NFS,
NDMP, DD Boost,
Ethernet

Virtual Tape
Library (VTL) or DD
Boost over
Fibre Channel
Data Domain System

Replication

Data Domain System

SAP
HANA
DB2

Copyright 2014 EMC Corporation. All rights reserved.

Data Deduplication: Technology Overview


Store More Backups in a Smaller Footprint
Friday Full Backup
A

Backup
Data

Logical

Estimated
Reduction

Physical

1 TB

24x

250 GB

Monday Incremental

50 GB

710x

5 GB

Tuesday Incremental

50 GB

710x

5 GB

Wednesday Incremental

50 GB

710x

5 GB

Thursday Incremental

50 GB

710x

5 GB

Second FRIDAY FULL

1 TB

5060x

18 GB

2.2 TB

7.6x

288 GB

FRIDAY FULL
A

Mon Incremental

Tues Incremental

Weds Incremental

Thurs Incremental

Second Friday Full Backup


B

A B C D E F G H

L
I

G
K L

Copyright 2014 EMC Corporation. All rights reserved.

TOTAL

Data Domain Data Invulnerability Architecture


Industrys Best Defense Against Data Integrity Issues
Stored Correctly

Inline Data
Verification

Copyright 2014 EMC Corporation. All rights reserved.

Stays Correct

Recovers Correctly

Continuous
Fault Detection
and SelfHealing

Recovery/Access
Verification

INLINE

POST-PROCESS

Deduplication Before Storing


Deduplication

Deduplication After Storing


Store

Deduplication

3x disk accesses
to shared store

Other activities
unimpeded
Predictable
Simpler

The more processes, the more resource


contention

Copy to tape: Too slow to stream tape


Recovery: Service level agreement predictability
Replication: Poor time-to-disaster-recovery
Deduplication: If interleaved with backup or
restore

More administration to fight these issues

Copyright 2014 EMC Corporation. All rights reserved.

CPU-Centric v Spindle-Bound
Performance
6,000

Throughput MB/s

Improvement since 2003:


Throughput:
~200x
Capacity:
~1650x

Data Domain

Fibre Channel

SATA

Most
deduplication
vendors
50
50

100

150

200

Number of Disk Spindles

Copyright 2014 EMC Corporation. All rights reserved.

10

Secure Multi-Tenancy

Delivering Data Protection as a Service

Enables enterprises to deliver Data


Domain in a private cloud

Enables service providers to deliver


Data Domain in a private/public
cloud

Features:
Logical data isolation
and administration
Roles for users and admin
Tenant management and reporting

Copyright 2014 EMC Corporation. All rights reserved.

Tenant A

Tenant
Unit A
Tenant
Unit B

Tenant B

Data Domain
system

11

Data Domain Software Options


Data Domain Boost

Data Domain Replicator

Advanced integration with apps

Network-efficient and encrypted

Speed backups by up to 50%

Consolidate up to 270 remote sites into


a single system

Data Domain Encryption

Data Domain Extended Retention

Inline encryption of data at rest

Long-term retention of backup

Protects against theft or loss of a physical


system

Up to 100 PB logical capacity

Data Domain Retention Lock

Data Domain Virtual Tape Library

Secure retention for archive data

Supports open systems and


IBM i operating environments

Satisfies governance and compliance

Copyright 2014 EMC Corporation. All rights reserved.

12

Data Domain Boost

Advanced Integration for Faster Backup

DD Boost

Copyright 2014 EMC Corporation. All rights reserved.

Advanced integration with leading backup


and enterprise applications
Speeds backups by up to 50%
Enables more efficient resource utilization
Provides application control of Data Domain
replication process

13

Data Domain Boost Ecosystem


NetWorker

NetBackup Backup Exec

vRanger

NetVault

VDP
Advanced

Data
Protector

Greenplum

RMAN

SAP

SAP
HANA

DB2

SQL

Backup
Backup
Server
Server

App
App
Server
Server

Avamar

Copyright 2014 EMC Corporation. All rights reserved.

DD
DD Boost
Boost

Supported over LAN

DD
DD Boost
Boost

Supported over SAN

DD
DD Boost
Boost

Supported over WAN

14

Data Domain Replicator

Network-Efficient Replication for Backup and Archive Data


Reduces bandwidth requirements up to 99%
Protects sensitive data when replicating over
untrusted networks
Accelerates time-to-disaster recovery (DR)
readiness
Consolidates backup and archive data from
hundreds of remote sites
Leverages multiple replication topologies
Disaster Recovery Site

Copyright 2014 EMC Corporation. All rights reserved.

15

Data Domain Encryption

Enhance the Security of Backup and Archive Data

Backup

Archive

Copyright 2014 EMC Corporation. All rights reserved.

Encrypts all data stored on a Data Domain


system
Encrypts data inline before its written to
disk
Leverage the internally generated static
default key or rotate keys for compliance

16

Data Domain Extended Retention


Long-Term Retention of Backup Data

z
Data Domain Controller

Active Tier

Separate tiers of storage for long-term


retention of data to eliminate reliance on tape
Cost-effective scalability
Fault isolation for access and recoverability of
long-term data
Granular replication for simplified disaster
recovery

Retention Tier

Copyright 2014 EMC Corporation. All rights reserved.

17

Data Domain Retention Lock


Governance and Compliance for Archive Data
Archive
Software

Efficiently store and manage governance and


compliance archive data on a single Data
Domain system

Meets the strictest regulatory requirements such


as SEC 17a-4(f)

Litigation hold protects archive data during legal


actions

Secure file locking of archive data at an


individual file level

Integrates seamlessly with industry-leading


archiving applications

Backup
Backup Data
Data
Archive
Archive Data
Data
Governance
Governance
Archive
Archive Data
Data
Compliance
Compliance
Archive
Archive Data
Data

Copyright 2014 EMC Corporation. All rights reserved.

18

Data Domain Virtual Tape Library

High-Speed, Inline Deduplication for SAN Environments


Eliminates physical tape challenges
Integrates seamlessly into existing Fibre
Channel SAN environments
Replicates virtual tape cartridges efficiently
offsite, over a wide area network (WAN)

Copyright 2014 EMC Corporation. All rights reserved.

19

Data Domain Management Center

Virtual Appliance for Aggregate Multi-system Management


Dashboards show the aggregate status of all
Data Domain systems
Manages and monitors up to 75 Data
Domain systems through a single interface
Role-based access control restricts access to
authorized users

Copyright 2014 EMC Corporation. All rights reserved.

20

Data Domain Systems

Protection Storage for Backup And Archive


Data Sources

Databases
Email Servers

Virtual Machines

Enterprise Applications

File Shares/Servers

Backup

Content Management

Archive
Archive Use Cases

Backup Use Cases


Database
Mainframe
IBM i
Big Data

File/Email
Big Data
Virtual Machine

File/Email
VMware
NAS
ROBO

Content Mgmt.
Storage Tiering
Database

Network
Replication
Over WAN

Disaster Recovery,
Long-Term Retention

On Premise or Cloud

Copyright 2014 EMC Corporation. All rights reserved.

21

Data Domain Systems


Data Domain Software

DD
DD
DD
DD

Boost
Encryption
Extended Retention
Management Center

Large Enterprise
Large Enterprise

DD Replicator
DD Retention Lock
DD Virtual Tape Library

Midsize Enterprise
Midsize Enterprise
Small Enterprise/ROBO
Small Enterprise/ROBO

DD160

DD2200

DD2500

DD4200

DD4500

DD7200

DD990

Speed (DD Boost)

1.1 TB/hr

4.7 TB/hr

13.4 TB/hr

22.0 TB/hr

22.0 TB/hr

26.0 TB/hr

31.0 TB/hr

Speed (other)

667 GB/hr

3.5 TB/hr

5.3 TB/hr

10.2 TB/hr

10.2 TB/hr

11.9 TB/hr

15.0 TB/hr

Logical capacity

40195 TB

172860 TB

1.36.6 PB

1.8-9.4 PB
5.6-28.4 PB1

2.8-14.2 PB
11.4-57.0 PB1

4.2-21.4 PB
17.1-85.61

5.728.5 PB
Up to 100 PB1

Usable capacity

Up to 3.98 TB

Up to 17.2 TB

Up to 133 TB

Up to 189 TB
Up to 569 TB1

Up to 285 TB
Up to 428 TB
Up to 570 TB
Up to 1.1 PB1
Up to 1.7 PB1
Up to 2.0 PB1
1
With DD Extended Retention software option

Copyright 2014 EMC Corporation. All rights reserved.

22

Why Data Domain?

Protection Storage for Backup and Archive Data


Industry-leading speed and scale
Seamless integration
Backup
Data

Reliable access and recovery


Efficient network utilization

Archive
Data

Copyright 2014 EMC Corporation. All rights reserved.

23