Вы находитесь на странице: 1из 18

1

Jump to Page
1

You are on page 1of 31


Search inside document

Syllabus for unit test – ICS2032 DATA WAREHOUSING AND DATA MININGUNIT I DATA
WAREHOUSING
Data warehousing Components –Building a Data warehouse –- Mapping the Data Warehouse
toa M ul t i p ro c ess or Ar c hi t e ct u r e – D BM S S c h em as fo r D e ci si on S u pp ort –
D at a Ex t r a ct i on, Cleanup, and Transformation Tools –Metadata.
UNIT II BUSINESS ANALYSIS
Reporting and Query tools and Applications – Tool Categories – The Need for
Applications – Cognos Impromptu – Online Analytical Processing (OLAP) – Need –
Multidimensional DataModel –
Data Warehouse Introduction
A d at a w a r eh ous e i s a c ol l e ct i on o f da t a m a rt s r ep r es ent i n g hi st or i c al d at a f r
o m d i f f e re nt operations in the company. This data is stored in a structure optimized
for querying and dataanalysis as a data warehouse. Table design, dimensions and
organization should be consistentthroughout a data warehouse so that reports or queries
across the data warehouse are consistent.A data warehouse can also be viewed as a
database for historical data from different functions within a company.The term Data
Warehouse was coined by Bill Inmon in 1990, which he defined in thefollowing way:
"A warehouse is a subject-oriented, integrated, time-variant and non-volatilecollection
of data in support of management's decision making process". He defined the terms inthe
sentence as follows:
Subject Oriented:
Da t a t h a t gi v es i n fo rm at i on a bou t a p art i cu l a r su bj e ct i nst e a d o f a bou t acompa
ny's ongoing operations.
Integrated:
Data that is gathered into the data warehouse from a variety of sources and mergedinto a
coherent whole.
Time-variant:
All data in the data warehouse is identified with a particular time period.
Non-volatile:
Data is stable in a data warehouse. More data is added but data is never removed.This enables
management to gain a consistent picture of the business. It is a single, complete andconsistent
store of data obtained from a variety of different sources made available to end usersin what they
can understand and use in a business context. It can be

Used for decision Support

Used to manage and control business

Used by managers and end-users to understand the business and make judgments

Data Warehousing is an architectural construct of information systems that provides users


withcurrent and historical decision support information that is hard to access or present in
traditionaloperational data storesOther important terminology
Enterprise Data warehouse:
It c ol l e ct s a l l i n fo r m at i on a bo ut su bj ec t s (
customers, products, sales, assets, personnel
) that span the entire organizationData Mart: Departmental subsets that focus on selected
subjects. A data mart is a segment of adata warehouse that can provide data for reporting
and analysis on a section, unit, department or operation in the company, e.g. sales, payroll,
production. Data marts are sometimes completeindividual data warehouses which are
usually smaller than the corporate data warehouse.
Decision Support System (DSS):
In f o rm at i on t e chn ol o g y t o h el p t he k no wl ed ge w or k er (executive, manager, and
analyst) makes faster & better decisions
Drill-down:
Traversing the summarization levels from highly summarized data to the underlyingcurrent or
old detail
Metadata:
Data about data. Containing location and description of warehouse sy
s t e m components: names, definition, structure…Benefits of data warehousing

Data warehouses are designed to perform well with aggregate queries running on
largeamounts of data.

The structure of data warehouses is easier for end users to navigate, understand and queryagainst
unlike the relational databases primarily designed to handle lots of transactions.

Da t a wa r eh ous e s en abl e qu e ri es t h at cut a c ro ss di ff e r ent s e gm en t s o f a co m p
an y's operation. E.g. production data could be compared against inventory data even
if theywere originally stored in different databases with different structures.

Queries that would be complex in very normalized databases could be easier to build
andmaintain in data warehouses, decreasing the workload on transaction systems.

Data warehousing is an efficient way to manage and report on data that is from a varietyof
sources, non uniform and scattered throughout a company.

Data warehousing is an efficient way to manage demand for lots of information from lotsof
users.

Data warehousing provides the capability to analyze large amounts of historical data for nuggets
of wisdom that can provide an organization with competitive advantage.Operational and
informational Data• Operational Data:

Focusing on transactional function such as bank card withdrawals and deposits


Detailed

Updateable

Reflects current data• Informational Data:

Focusing on providing answers to problems posed by decision makers

Summarized

Non updateableData Warehouse Characteristics• A data warehouse can be viewed as an
information system with the following attributes: – It is a database designed for analytical tasks –
It’s content is periodically updated – It contains current and historical data to provide a historical
perspective of informationOperational data store (ODS)• ODS is an architecture concept to
support day-to-day operational decision support and containscurrent value data propagated from
operational applications• ODS is subject-oriented, similar to a classic definition of a Data
warehouse• ODS is
integratedHowever:O D S D A T A W
A R E H O U S E V o l a t i
l e N o n v o l a t i l e
You're reading a preview. Unlock full access with a free trial.
Pages 4 to 31 are not shown in this preview.

Download With Free Trial

You're Reading a Preview


Unlock full access with a free trial.
Download With Free Trial

Related Interests

 Data Warehouse
 Databases
 Metadata
 Parallel Computing
 Information Management

Documents Similar To Data Warehouse and Data Mining Notes


Carousel Next

Data Warehouse and Data Mining


UPLOADED BY

chiranjeeb_mimts

Data Warehousing and Data Mining Notes [Unit i and II]


UPLOADED BY

Thasleem Bin Aushiq Hussain

Data Warehouse complete notes


UPLOADED BY

shankarssr

Data warehouse
UPLOADED BY
nagaraju-g

Data warehousing and data mining


UPLOADED BY

Bridget Smith

Data Warehousing and Data Mining


UPLOADED BY

Camilo Amarcy

Data Warehousing and Data Mining


UPLOADED BY

Abbas Hashmi


DATA MINING AND DATA WAREHOUSING
UPLOADED BY

Bridget Smith

Indroduction to Data Warehousing (Alex Berson)


UPLOADED BY

Md Saif

Big Data Project Report


UPLOADED BY

HemanthAroumougam

SOFTWARE ENGINEERING LAB MANUAL


UPLOADED BY
PRIYA RAJI

what is a HMM
UPLOADED BY

afsana_shimu

Deferential Geometry (by Garrett Lisi) - WWW.OLOSCIENCE.COM


UPLOADED BY

Fausto Intilla

Operating System Notes:Spooling:Acronym for Simultaneous Peripheral


Operations on-line, Spooling Refers
UPLOADED BY

ppiiyyuuss

Programming_with_Solutions C
UPLOADED BY

anshu19

Mining Frequent Patterns Without Candidate FPGrowth 2004


UPLOADED BY

Vũ Đức Toàn

chapter 5
UPLOADED BY

harinima

Welcome to International Journal of Engineering Research and Development


(IJERD)
UPLOADED BY
IJERD

Lecture 8-9 Association Rule Mining.ppt


UPLOADED BY

Muhammad Usman

Modern Association Rule Mining Methods


UPLOADED BY

ijcsity

Slides: R Introduction
UPLOADED BY

Julio José


Algorithms
UPLOADED BY

Alexandra-Petronela

Optimization Algorithms for Association Rule Mining


UPLOADED BY

harinima

Minimizing Spurious Patterns Using Association Rule Mining


UPLOADED BY

seventhsensegroup

Efficient Temporal Association Rule Mining


UPLOADED BY
International Journal of Engineering Inventions (IJEI)

CHAPTER 2
UPLOADED BY

harinima

An Efficient Algorithm for Anonymization of Set-Valued Data and


Representation Using Fp-Tree
UPLOADED BY

ijaitjournal

Ensemble Methods in Data Mining


UPLOADED BY

Jagadeeswara Rao A

TOC FINAL
UPLOADED BY

harinima

Possible Algorithms of 2NF and 3NF for DBNorma- A tool for Relational
Database Normalization
UPLOADED BY

IDES

More From Samrat Saxena


Mobile-Databases Presentation
UPLOADED BY

Samrat Saxena


Security Issues Mobile Database
UPLOADED BY

Samrat Saxena

ui-poster
UPLOADED BY

Samrat Saxena

eRTOS
UPLOADED BY

Samrat Saxena

SecMobDB
UPLOADED BY
Samrat Saxena

Cake Php
UPLOADED BY

Samrat Saxena

Footer Menu
Back To Top
ABOUT

 About Scribd
 Press
 Our blog
 Join our team!
 Contact Us
 Invite Friends
 Gifts

SUPPORT

 Help / FAQ
 Accessibility
 Purchase help
 AdChoices
 Publishers

LEGAL

 Terms
 Privacy
 Copyright
Social Media

o
o
o
o
o

 Copyright © 2018 Scribd Inc.


 Browse Books
 Site Directory
 Site Language:

English

Вам также может понравиться