Вы находитесь на странице: 1из 6

B8IT106 Tools for Data Analytics

QQI
Higher Diploma in Science in Data Analytics
January 2016
Module Code:

B8IT106

Module Description:

Tools for Data Analytics

Examiner:

Thomas Fitzsimons

Internal Moderator:

Niall Larkin

External Examiner:

Dr Brett Becker
Date:
Time:

Friday 29th January 2016


10.00-12.00

INSTRUCTIONS TO CANDIDATES
Time allowed is 2 hours
Question 1 is a mandatory question.
Answer any 3 out of the remaining 4 Questions
All questions carry 25 marks
Answers to all questions to be written in answer books provided.

Page 1 of 6

B8IT106 Tools for Data Analytics

Question 1: Mandatory question.


1a.What open-source software was developed from Googles MapReduce concept?
A. Puppet
B. Splunk
C. Hadoop
D. MongoDB
(1 Mark)
1b. Write a brief appraisal of the open-source software that was developed from Googles
MapReduce concept.
(3 Marks)
2a.Of the following terms which is not one of the Vs of Big Data as formulated by Gartner
Research.
A. Value
B. Volume
C. Velocity
D. Variety
(1 Mark)
2b. Write a brief appraisal of two of the Vs of Big Data as formulated by Gartner Research.
(4 Marks)
3a.How did Hadoop get its name?
A. Its an acronym
B. Toy elephant
C. An imaginary friend
D. A fictional character from literature
(1 Mark)
3b.What year was Hadoop created, by whom and for what purpose was it originally developed.
(3 Marks)
4a.What is Hadoop file system called?
A. HUNK
B. YARN
C. HDFS
D. HBASE
(1 Mark)
4b. Write a brief appraisal of the Hadoop file system.
(3 Marks)

Page 2 of 6

B8IT106 Tools for Data Analytics

5a. Organisations engage in a variety of activities that generate records. Propose which of the
record types listed below are of most interest to those organisations.
A. Emails
B. Business Transactions
C. Social Media
D. Log Data
(1 Mark)
5b. Justify your answer.
(3 Marks)
6a. According to a very recent Jaspersoft survey, which of the following is the most popular data
store.
A. Relational Databases
B. Hadoop HDFS
C. Analytic Databases
D. MongoDB
(1 Mark)
6b. Justify your answer.
(3 Marks)
(Question 1 Total 25 Marks)

Question 2: Data Variety


A. Write a brief appraisal of the term Unstructured Data
(5 Marks)
B. Formulate a list of 5 examples of Unstructured Data sources.
(5 Marks)
C. Write a brief appraisal of the term Semi-Structured Data
(5 Marks)
D. Formulate a list of 5 examples of Semi-Structured Data.
(5 Marks)
E. Some analysts have extended the concept of the three Vs to four Vs. Identify and write a brief
appraisal of the 4th V.
(5 Marks)
(Question 2 Total 25 Marks)

Page 3 of 6

B8IT106 Tools for Data Analytics

Question 3: Data Analysis Tools


A. Select two of the three data analysis tools listed below and evaluate them against each of the
headings in the table.
1. Statistical Analysis System (SAS).
2. Python.
3. R.
Heading
Brief Description
Cost
System Specifications
Availability
Ease of Installation
Upgradeability
Support
Easy of Learning
Employment Opportunities

SAS

Python

(20 Marks)
B. Write a brief appraisal of the following statement: SAS is the preferred data analytics tool of
business companies and governments around the world.
(5 Marks)
(Question 3 Total 25 Marks)

Page 4 of 6

B8IT106 Tools for Data Analytics

Question 4: Business Intelligence


Select three of the four Business Intelligence (BI) technologies listed below and appraise them
against each of the headings in the table.
Business Intelligence (BI) technologies
1. (OLAP) Multidimensional data analysis
2. Data mining
3. Text mining
4. Web mining
Brief Description of the BI technology
Type of data the BI technology is used on.
Types of information obtainable from use of BI technology.
How does it use improve business performance?
How does it use improve decision making in the organisation?

Total (25 Marks)

(Question 4 Total 25 Marks)

Page 5 of 6

B8IT106 Tools for Data Analytics

Question 5 : Data Science


A. Critique an SQL Relational Database vs Hadoop. What are the differences, the advantages and
disadvantages of both systems?
(20 Marks)
B. Evaluate the following statement Data scientists are professionals with a unique skill set
(3 Marks)
C. Formulate a list of Data scientists skills.
(2 Marks)
(Question 5 Total 25 Marks)

END OF EXAMINATION

Page 6 of 6

Вам также может понравиться