Академический Документы
Профессиональный Документы
Культура Документы
Section – 1 Marks
Q.1 (A) How is a data warehouse different from a database? How are they similar? [5]
Q.1 (B) What is Big Data? Explain challenges of Big Data. [5]
Q.1 (C) Explain difference between OLTP and OLAP. [5]
OR
Q.1 (C) Explain Star Schema with example. [5]
Q.2 (A) Suppose that the data for analysis includes the attribute age. The age values for the data [5]
tuples are (in increasing order) 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30,
33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
Use smoothing by bin means to smooth the above data, using a bin depth of 3. Illustrate
your steps.
Q.2 (B) What is Sampling? Explain Data Sampling techniques. [5]
OR
Q.2 (A) Using the data for age given in Question – 2(A) as above, answer the following: [5]
(a) Use min-max normalization to transform the value 35 for age onto the range [0.0;
1.0].
(b) Use z-score normalization to transform the value 35 for age, where the standard
deviation of age is 12.94 years.
Q.2 (B) What are the value ranges of the following normalization methods? [5]
(a) min-max normalization
(b) z-score normalization
(c) normalization by decimal scaling
P.T.O.
Section – 2
Q.4 (A) Explain Apriori Algorithms with example. [5]
Q.5 (A) Briefly outline the major steps of decision tree classification. [5]
Q.6 (A) Differentiate the Big Data and traditional Enterprise Relational Data. [5]
OR
Q.6 (A) Explain major steps for Hadoop Implementation. [5]
Q.6 (B) Explain Map-Reduce programming model with any example. [5]