Академический Документы
Профессиональный Документы
Культура Документы
• Late 1980s-present
– Advanced Data Analysis
• Data warehouse and OLAP
• Data mining and knowledge discovery
• Advanced data mining appliations
• Data mining and socity
• 1990s-present:
– XML-based database systems
– Integration with information retrieval
– Data and information integreation
Present – future:
New generation of integrated data and information
system.
Task-relevant Data
Data Integration
Pattern evaluation
Data
Databases Warehouse
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Data Mining and Business
Intelligence
Increasing potential
to support
business decisions End User
Making
Decisions
Data Exploration
Statistical Analysis, Querying and Reporting
Similarity-based analysis
Information
Science Data Mining MachineLearning
Visualization Other
Disciplines
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Data Mining: Classification Schemes
General functionality
Descriptive data mining
Knowledge to be mined
Data warehousing:
The process of constructing and using data warehouses is called
datawarehousing
time,location,supplier
time,item,location 3-D cuboids
time,item,supplier item,location,supplier
4-D(base) cuboid
Dr. C.NAGARAJU
time, item, HEAD OF CSE YSREC of
location, supplier
YVU Proddatur
Conceptual Modeling of Data Warehouses
Modeling data warehouses: dimensions & measures
Star schema: A fact table in the middle connected to a set
of dimension tables
Snowflake schema: A refinement of star schema where
some dimensional hierarchy is normalized into a set of
smaller dimension tables, forming a shape similar to
snowflake
Fact constellations: Multiple fact tables share dimension
tables, viewed as a collection of stars, therefore called
galaxy schema or fact constellation
branch_key
branch location
location_key
location_key
branch_key
units_sold street
branch_name
city_key
branch_type city
dollars_sold
city_key
avg_sales city
Measures province_or_street
country
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Example of Fact
Constellation
time
time_key item Shipping Fact Table
day item_key
day_of_the_week Sales Fact Table item_name time_key
month brand
quarter time_key type item_key
year supplier_type shipper_key
item_key
branch_key from_location
Other operations
drill across: involving (across) more than one fact
table
Operational meta-data
Business data
business terms and definitions, ownership of data, charging
policies
Multi-Tier Data
Warehouse
Distributed
Data Marts
Enterprise
Data Data
Data
Mart Mart
Warehouse
mining
Layer2
MDDB
MDDB
Meta Data
C c3 61
c2 45
62 63 64
46 47 48
c1 29 30 31 32
c0
B13 14 15 16 60
b3 44
B 28 56
b2 9
40
24 52
b1 5
36
20
b0 1 2 3 4
a0 a1 a2 a3
A