Вы находитесь на странице: 1из 12

AccessDatacomesfrommanysources,including

legacyapplicationandsystems,databases,modern
applications,variousXMLmessagesandnumeroustypes
ofdocuments(spreadsheets,projectplans,text
documents,etc).Identifyingandaccessingthesesources
isthefirststeptodataintegration.

Discovery - This involves bringing all data sources


out into the open, and documenting the uses and
structures of poorly understood or described
sources. This is also the point at which data
semantics (patterns or rules that emerge from its
structure and use) and quality issue should be
noted and flagged for further action.

CleansingDataiscleanedupforaccuracyand
integrity.Clean-upcaninvolvedetectingandcorrecting
errors,supplyingmissingelementsandvalue,enforcing
datastandards,validatingdataandpurgingduplicate
entries.

Integration-Thisstepinvolvesconsolidatingdata
acrossallsystemsandapplications,accessingtheir
fragmenteddata,creatinganaccurateandconsistentview
oftheirinformationassets,andleveragingthoseassetsto
drivebusinessdecisionsandoperations.Thisoftenmeans
resolvinginconsistentutilizationanddefinitionfor
identicaltermsacrossdifferentcontexts.

DeliveryCorrect,relevantdataismadeavailablein
properform,inatimelymanner,toallusersand
applicationsthatneedsuchaccess.Thismightmean
respondingtoqueriesthatresultinsinglerecordsorsmall
answersetstodeliveringentiredatasetsfortrendanalysis
orenterprise-widereporting.Thisstepalsoaddresses
needsfordatasecurity,availability,privacyand
compliancerequirementsrelatedtoaccessanduse.

DevelopmentandManagement-ThisiswhereXMLbasedtoolsetsenablethosewhomanagedata;business
analysts,architects,developersandmanagerstowork
togetherincreatingacomprehensivesetofdata
integrationrules,processes,practicesandprocedures,
therebycapturingandimplementingallthesubstantive
workdoneinthefiveprecedingsteps.Thisstepalso
tacklesissuesrelatedtoperformance,scalabilityand
reliabilityneedsforkeyenterpriseapplicationsand
services.

Auditing,MonitoringandReportingOnceits
semanticsanduseshavebeencaptured,omissions
remedied,errorscorrected,andqualityexaminedand
assured,ongoingobservationandanalysisisrequiredto
keepthedataclean,correct,reliableandavailable.This
partoftheprocessmakesitpossibletoflagpotential
issuesastheyoccurandtocyclethembackthroughthis
lifecycletomakesuretheyresolved.Auditingalsohelps
tomakesurethatdataremainsvisible,undercontrol,and
abletoguidefuturechangesandenhancements.

INTRODUCTION

Data integration focuses mainly on databases. A


database is an organized collection of data. It's
similar to a file system, which is an organizational
structure for files so they're easy to find, access
and manipulate.

WHY
The goal of data integration is to gather data from
different sources, combine it and present it in such
a way that it appears to be a unified whole.
Let's say you're about to leave on a trip and you
want to see what traffic is like before you decide
which route to take out of town. Here's how the
different approaches to data integration would
handle your query.
An integrated data solution makes it easy to keep
information up to date. One input can propagate
across all integrated systems, keeping your data
current. In fact, your data can even be real-time if
a server or cloud solution is part of the integration
strategy.

ADVANTAGES
Fast & Complex Query Process
High Volume of Data Processing
Less Costlier
Data Freshness
DISADVANTAGES
High Dependency
Slow Query Response

WHERE
a data warehouse is a database that stores information from other databases using a common
format. That's about as specific as you can get when describing data warehouses. There's no
unified definition that dictates what data warehouses are or how designers should build them. As a
result, there are several different ways to create data warehouses, and one data warehouse might
look and behave very differently from another.
In general, queries to a data warehouse take very little time to resolve. That's because the data
warehouse has already done the major work of extracting, converting and combining data. The
user's side of a data warehouse is called the front end, so from a front-end standpoint, data
warehousing is an efficient way to get integrated data.

Вам также может понравиться