Вы находитесь на странице: 1из 2

KADI SARVA VISHWAVIDYALAYA

B.E. SEMESTER 7TH EXAMINATION APRIL – MAY, 2016

SUBJECT CODE: IT – 701 SUBJECT NAME: BIG DATA ANALYTICS

DATE: 20/04/2016 TIME: 10:00 am to 01:00 pm TOTAL MARKS: 70

Instructions: 1) All questions are compulsory.


2) Figures to the right indicate full marks.
3) Indicate clearly, the options you attempt along with its respective question number.
4) Use the last page of main supplementary for rough work.

Section – 1 Marks
Q.1 (A) How is a data warehouse different from a database? How are they similar? [5]
Q.1 (B) What is Big Data? Explain challenges of Big Data. [5]
Q.1 (C) Explain difference between OLTP and OLAP. [5]
OR
Q.1 (C) Explain Star Schema with example. [5]

Q.2 (A) Suppose that the data for analysis includes the attribute age. The age values for the data [5]
tuples are (in increasing order) 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30,
33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
Use smoothing by bin means to smooth the above data, using a bin depth of 3. Illustrate
your steps.
Q.2 (B) What is Sampling? Explain Data Sampling techniques. [5]
OR
Q.2 (A) Using the data for age given in Question – 2(A) as above, answer the following: [5]
(a) Use min-max normalization to transform the value 35 for age onto the range [0.0;
1.0].
(b) Use z-score normalization to transform the value 35 for age, where the standard
deviation of age is 12.94 years.
Q.2 (B) What are the value ranges of the following normalization methods? [5]
(a) min-max normalization
(b) z-score normalization
(c) normalization by decimal scaling

Q.3 (A) Explain Three – Tier Data Warehouse Architecture. [5]


Q.3 (B) Explain Market Basket Analysis. [5]
OR
Q.3 (A) Explain Enterprise Warehouse, Data Mart and Virtual Warehouse. [5]
Q.3 (B) Explain Frequent Itemsets, Closed Itemsets, and Association Rules [5]

P.T.O.
Section – 2
Q.4 (A) Explain Apriori Algorithms with example. [5]

Q.4 (B) Briefly explain KDD Process. [5]

Q.4 (C) What is Prediction? Explain prediction by Regression. [5]


OR
Q.4 (C) What is application of concept hierarchy? Draw concept hierarchy for location (country, [5]
state, city, street) and time (year, quarter, month, week, day).

Q.5 (A) Briefly outline the major steps of decision tree classification. [5]

Q.5 (B) Explain the major clustering methods. [5]


OR
Q.5 (A) Explain the criteria based on classification and prediction methods are compared. [5]

Q.5 (B) What is Cluster analysis? Explain its importance. [5]

Q.6 (A) Differentiate the Big Data and traditional Enterprise Relational Data. [5]

Q.6 (B) Explain various components of Hadoop ecosystem. [5]

OR
Q.6 (A) Explain major steps for Hadoop Implementation. [5]

Q.6 (B) Explain Map-Reduce programming model with any example. [5]

Вам также может понравиться