Cps 8210 Assignment 2

Загружено:

Sarama Kamal Syed

0% нашли этот документ полезным (0 голосов)

119 просмотров3 страницы

assignment for data mining

Авторское право

Доступные форматы

PDF, TXT или читайте онлайн в Scribd

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Пожаловаться на этот документ

assignment for data mining

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

0% нашли этот документ полезным (0 голосов)

119 просмотров3 страницы

Cps 8210 Assignment 2

Загружено:

Sarama Kamal Syed

assignment for data mining

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

Перейти к странице

Вы находитесь на странице: 1из 3

Поиск в документе

CP8210

ASSIGNMENT 2

Assignment 2 Due date Oct 18 (15%)

PART 1 - Data Mining
Data Mining refers to analyzing massive amount of data for finding patterns, trends and
relationships to form model to be used in business decision making, prediction, simulations and
etc. Two approaches frequently used in data mining are clustering and classification.
a) By explaining two examples for classification and clustering methods explain what are the
differences between two approaches and when each of them should be used.
b) One of the common method of clustering is k-Means. General k-Means algorithm that is
shown below is from Chapter 28th of Database book by Elmasri detailed in reference).

First use the above algorithm and with explaining and using common similarity metric distance
between a record, and also using a value of 3 for K, cluster the data of following table. You can
assume that the records with RIDs 1, 3, and 5 are used for the initial cluster centroids (means).
Try to follow the algorithm and calculate the centroids in a way that clusters to be optimum.
RID
1
2
3
4
5
6

Dimension 1
8
5
2
2
2
8

Dimension 2
4
4
4
6
8
6

Reference Part 1: Fundamental of Database Systems, 7th (or 6th) edition , By Elmasri and
Navathe, Pearson Publication, Chapter 28, Data Mining concepts.

CP8210

ASSIGNMENT 2

PART2-Simulating Distributed Computing using MapReduce

Implement Matrix multiplication by simulating MapReduce using java code below. Modify the
code and show the output of mapper and reducer and explain your model in details. Assume you
have computing nodes working in parallel and show in your model the number of mappers and
reducers. You can use the MapReduce models explained in Sections 2.3.9 and 2.3.10 in Mining
of Massive Datasets book by Leskovec et. al.

Below is the Java code to perform Multiplication of two matrix A[i][j] and B[j][k].
Where,
C[i][j] = A[i][0] * B[0][j] + A[i][1] * B[1][j] + A[i][2] * B[2][j] + .... A[i][n-1] * B[n1][j]

import java.util.Scanner;
public class MatrixMultiplication {
public static void main(String[] args) {
Scanner s = new Scanner(System.in);
System.out.print("Enter number of rows in A: ");
int rowsInA = s.nextInt();
System.out.print("Enter number of columns in A / rows in B: ");
int columnsInA = s.nextInt();
System.out.print("Enter number of columns in B: ");
int columnsInB = s.nextInt();
int[][] a = new int[rowsInA][columnsInA];
int[][] b = new int[columnsInA][columnsInB];
System.out.println("Enter matrix A");
for (int i = 0; i < a.length; i++) {
for (int j = 0; j < a[0].length; j++) {
a[i][j] = s.nextInt();
}
}
System.out.println("Enter matrix B");
for (int i = 0; i < b.length; i++) {
for (int j = 0; j < b[0].length; j++) {
b[i][j] = s.nextInt();
}
}

CP8210

ASSIGNMENT 2

int[][] c = multiply(a, b);

System.out.println("Product of A and B is");
for (int i = 0; i < c.length; i++) {
for (int j = 0; j < c[0].length; j++) {
System.out.print(c[i][j] + " ");
}
System.out.println();
}
}
public static int[][] multiply(int[][] a, int[][] b) {
int rowsInA = a.length;
int columnsInA = a[0].length; // same as rows in B
int columnsInB = b[0].length;
int[][] c = new int[rowsInA][columnsInB];
for (int i = 0; i < rowsInA; i++) {
for (int j = 0; j < columnsInB; j++) {
for (int k = 0; k < columnsInA; k++) {
c[i][j] = c[i][j] + a[i][k] * b[k][j];
}
}
}
return c;
}
}

Bonus Mark : There will b up to 5% bonus marks if you implement Hadoop as an underline
platform for the implemented MapReduce model

Вам также может понравиться

Cis016-2, Cis116-2 & Pat001-2
Документ5 страниц
Cis016-2, Cis116-2 & Pat001-2
ruslanas re
Оценок пока нет
SAPBW Technical Specification Template
Документ30 страниц
SAPBW Technical Specification Template
mkumar26
100% (2)
PC210-240-7K M Ueam001704 PC210 PC230 PC240-7K 0310 PDF
Документ363 страницы
PC210-240-7K M Ueam001704 PC210 PC230 PC240-7K 0310 PDF
Carlos Israel Gomez
100% (10)
20BCS4585 - ANANYA SINGH - JAVA Worksheet-3
Документ6 страниц
20BCS4585 - ANANYA SINGH - JAVA Worksheet-3
Ananya Singh
Оценок пока нет
Assignment
Документ2 страницы
Assignment
Ramesh Rathod
Оценок пока нет
ICSE Class 10 Computer Applications
Документ68 страниц
ICSE Class 10 Computer Applications
satnamghai
60% (5)
List of Lab Exercises
Документ3 страницы
List of Lab Exercises
Ajay Raj Srivastava
Оценок пока нет
Pic Miceopeoject
Документ20 страниц
Pic Miceopeoject
413 YASH MANE
Оценок пока нет
Quiz1 Questions
Документ2 страницы
Quiz1 Questions
Pritesh Gethewale
Оценок пока нет
Sample Paper of Computer Science Class 12
Документ5 страниц
Sample Paper of Computer Science Class 12
Niti Arora
Оценок пока нет
Data Structures Algorithms
Документ21 страница
Data Structures Algorithms
Diluxan So
Оценок пока нет
BSC (H) Sem 3 Guidelines July 2012
Документ17 страниц
BSC (H) Sem 3 Guidelines July 2012
Priyanka D Singh
Оценок пока нет
Filename: MCA-Science Question - Paper PDF
Документ75 страниц
Filename: MCA-Science Question - Paper PDF
atulzende
Оценок пока нет
Final CSE 4361
Документ2 страницы
Final CSE 4361
Md. Sohel Rahman
Оценок пока нет
HCT108 Research Questions 1
Документ7 страниц
HCT108 Research Questions 1
Simple lyrics
Оценок пока нет
Exam I (Review List) - Answer Key
Документ13 страниц
Exam I (Review List) - Answer Key
Rylee Simth
Оценок пока нет
Unit Iii & Iv
Документ4 страницы
Unit Iii & Iv
SangeethRaj PS
Оценок пока нет
Welcome To International Journal of Engineering Research and Development (IJERD)
Документ5 страниц
Welcome To International Journal of Engineering Research and Development (IJERD)
IJERD
Оценок пока нет
Data Structure Using C and C++ Basic
Документ8 страниц
Data Structure Using C and C++ Basic
aman deeptiwari
Оценок пока нет
Choice Based Credit System: Semester Total Credit I
Документ18 страниц
Choice Based Credit System: Semester Total Credit I
Sumit Halder
Оценок пока нет
Software Process and Requirement
Документ25 страниц
Software Process and Requirement
Unicorn54
Оценок пока нет
Advanced Data Structures and Algorithms
Документ4 страницы
Advanced Data Structures and Algorithms
Toaster97
Оценок пока нет
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
Документ2 страницы
A Practical Performance Comparison of Parallel Matrix Multiplication Algorithms On Networks of Workstations
Phulturoo Khan
Оценок пока нет
Mining Weather Data Using Rattle
Документ6 страниц
Mining Weather Data Using Rattle
ijcsn
Оценок пока нет
Paper 16-Localisation of Numerical Date Field in An Indian Handwritten Document
Документ4 страницы
Paper 16-Localisation of Numerical Date Field in An Indian Handwritten Document
Editor IJACSA
Оценок пока нет
DataStructure SEM3 CE IT Degree
Документ1 страница
DataStructure SEM3 CE IT Degree
Dhwanil Bhatt
Оценок пока нет
Data Mining 2-5
Документ4 страницы
Data Mining 2-5
nirman kumar
Оценок пока нет
7.2 Designs of Algorithm
Документ12 страниц
7.2 Designs of Algorithm
Anisha Bushra Akond
Оценок пока нет
CS3301 Data Stuctures Important Questions
Документ7 страниц
CS3301 Data Stuctures Important Questions
Karnan Suganya
Оценок пока нет
Data Structure & Algorithm Assingment 1
Документ1 страница
Data Structure & Algorithm Assingment 1
sunil kumar
Оценок пока нет
Sir Jamal's Assignment # 2:: Data Structure & Algorithm
Документ10 страниц
Sir Jamal's Assignment # 2:: Data Structure & Algorithm
Syed Jibran Ali Bukhari
Оценок пока нет
Exam I Review List
Документ12 страниц
Exam I Review List
Rylee Simth
Оценок пока нет
F. Y. B. Sc. (Computer Science) Examination - 2010: Total No. of Questions: 5) (Total No. of Printed Pages: 4
Документ76 страниц
F. Y. B. Sc. (Computer Science) Examination - 2010: Total No. of Questions: 5) (Total No. of Printed Pages: 4
Amarjeet Das
Оценок пока нет
Data Structure Laboratory Exercises: Iii Semester Information Science & Enginerring
Документ6 страниц
Data Structure Laboratory Exercises: Iii Semester Information Science & Enginerring
pksharma75
Оценок пока нет
HW2 For Students PDF
Документ5 страниц
HW2 For Students PDF
msk123123
Оценок пока нет
Data Stream Clustering
Документ3 страницы
Data Stream Clustering
john949
Оценок пока нет
ECS305 (OOS) 2nd Sessional
Документ3 страницы
ECS305 (OOS) 2nd Sessional
Ashutosh Singh
Оценок пока нет
K-Means Clustering Method For The Analysis of Log Data
Документ3 страницы
K-Means Clustering Method For The Analysis of Log Data
idescitation
Оценок пока нет
DSE 513: Programming Data Structure & Algorithm Mid Semester Assignment
Документ4 страницы
DSE 513: Programming Data Structure & Algorithm Mid Semester Assignment
tibowe8
Оценок пока нет
Write A Program To Fin Tte Sum of Iumbers II Ai Array Usiig Poiiters
Документ6 страниц
Write A Program To Fin Tte Sum of Iumbers II Ai Array Usiig Poiiters
Adi
Оценок пока нет
Capital University of Science and Technology Department of Computer Science CS 3163: Design and Analysis of Algorithms (3) : Fall 2020
Документ4 страницы
Capital University of Science and Technology Department of Computer Science CS 3163: Design and Analysis of Algorithms (3) : Fall 2020
Malik Naveed
Оценок пока нет
B.SC IT Hons PDF
Документ52 страницы
B.SC IT Hons PDF
Sumit
Оценок пока нет
University of Wah: Department of Computer Science
Документ4 страницы
University of Wah: Department of Computer Science
Marriam Nawaz
Оценок пока нет
DS May 19 Solved
Документ24 страницы
DS May 19 Solved
crazygamernikhil922
Оценок пока нет
Mock Exam
Документ11 страниц
Mock Exam
poker
Оценок пока нет
DS Lab Manual
Документ78 страниц
DS Lab Manual
dhinerao11032005
Оценок пока нет
SQLDM - Implementing K-Means Clustering Using SQL: Jay B.Simha
Документ5 страниц
SQLDM - Implementing K-Means Clustering Using SQL: Jay B.Simha
Moh Ali M
Оценок пока нет
Computer Application ICSE E
Документ5 страниц
Computer Application ICSE E
shauryasahu2004
Оценок пока нет
Gtavm t2 Hye C11a2 Cs SK
Документ7 страниц
Gtavm t2 Hye C11a2 Cs SK
S. Lakshanya
Оценок пока нет
Gujarat Technological University: Instructions
Документ2 страницы
Gujarat Technological University: Instructions
ektasj
Оценок пока нет
Worksheet Summer 2022
Документ2 страницы
Worksheet Summer 2022
Samuel Godad
Оценок пока нет
DST Questions
Документ2 страницы
DST Questions
Keith Tanaka Magaka
Оценок пока нет
COMP 4710 Assignment 1 - Clustering Total Marks
Документ2 страницы
COMP 4710 Assignment 1 - Clustering Total Marks
api-279173920
Оценок пока нет
Tutorial 4
Документ8 страниц
Tutorial 4
POEASO
Оценок пока нет
Using The Confusion Matrix For Improving Ensemble Classifiers
Документ5 страниц
Using The Confusion Matrix For Improving Ensemble Classifiers
Ritu Yadav
Оценок пока нет
Matrix Chain Multiplication
Документ20 страниц
Matrix Chain Multiplication
Harsh Tibrewal
100% (1)
Tutorial Sheet 1 - SIT 433
Документ3 страницы
Tutorial Sheet 1 - SIT 433
Nzelle Bide
Оценок пока нет
Assignment-1 Course Code:CAP205: Date: 08/09/10
Документ9 страниц
Assignment-1 Course Code:CAP205: Date: 08/09/10
Manish Kinwar
Оценок пока нет
Ar 10steps Secinfowatch Us 0612
Документ7 страниц
Ar 10steps Secinfowatch Us 0612
Zahid Mashhood
Оценок пока нет
Lab 6
Документ4 страницы
Lab 6
Samuel Tan
Оценок пока нет
Uipath Guide PDF
Документ6 страниц
Uipath Guide PDF
Ibrahim Syed
Оценок пока нет
Advanced C Concepts and Programming: First Edition
От Everand
Advanced C Concepts and Programming: First Edition
Gayatri
Рейтинг: 3 из 5 звезд
3/5 (1)
CRM
Документ15 страниц
CRM
Pradeep Chintada
Оценок пока нет
Ductle Iron Spec1
Документ8 страниц
Ductle Iron Spec1
윤병택
Оценок пока нет
Hydraulic Cartridge Systems
Документ14 страниц
Hydraulic Cartridge Systems
Jas Sum
Оценок пока нет
Ref: Bboneblk - SRM Beaglebone Black System Reference Manual Rev B
Документ125 страниц
Ref: Bboneblk - SRM Beaglebone Black System Reference Manual Rev B
hernangyc
Оценок пока нет
Lab - 17-WAN Configuration
Документ12 страниц
Lab - 17-WAN Configuration
Muhammad Asghar Khan
100% (1)
Case Study Analysis of Apex Corporation PDF
Документ2 страницы
Case Study Analysis of Apex Corporation PDF
AJ
Оценок пока нет
Scope of Work Diesel Fuel Tank For The Rifle-Garfield County Regional Airport Fuel Farm IFB-GC-AP-01-14 - Diesel Fuel Tank
Документ4 страницы
Scope of Work Diesel Fuel Tank For The Rifle-Garfield County Regional Airport Fuel Farm IFB-GC-AP-01-14 - Diesel Fuel Tank
MS
Оценок пока нет
TDS 9-11SA Mechanical Troubleshooting
Документ34 страницы
TDS 9-11SA Mechanical Troubleshooting
ahmed.kareem.khanjer
Оценок пока нет
Principles of Management 07
Документ8 страниц
Principles of Management 07
knockdwn
Оценок пока нет
Memory QVL 3rd Gen AMD Ryzen Processors PDF
Документ14 страниц
Memory QVL 3rd Gen AMD Ryzen Processors PDF
ნიკო ქარცივაძე
Оценок пока нет
Government College of Engineering SALEM 636011.: Electronics and Communication Engineering Curriculum and Syllabus
Документ111 страниц
Government College of Engineering SALEM 636011.: Electronics and Communication Engineering Curriculum and Syllabus
Salma Mehajabeen Shajahan
Оценок пока нет
CCI Control Valves For Fossil Applications
Документ2 страницы
CCI Control Valves For Fossil Applications
Gabrieldiaz
Оценок пока нет
Confined Spaces: Avoiding Common Mistakes in Gas Detection
Документ1 страница
Confined Spaces: Avoiding Common Mistakes in Gas Detection
trravi1983
Оценок пока нет
6.hydraulic Pressure Spesification
Документ3 страницы
6.hydraulic Pressure Spesification
TLK Channel
Оценок пока нет
Sanjay Project
Документ41 страница
Sanjay Project
Prynka Rawat
Оценок пока нет
08L76 HR3 A21
Документ6 страниц
08L76 HR3 A21
liebofreak
Оценок пока нет
Plotting in Matlab
Документ7 страниц
Plotting in Matlab
pride3351
Оценок пока нет
Terminal - Exam - Section 4A-4B - Signal & System FA14 COMSAT
Документ3 страницы
Terminal - Exam - Section 4A-4B - Signal & System FA14 COMSAT
Ali Raza
Оценок пока нет
Cs9152 DBT Unit IV Notes
Документ61 страница
Cs9152 DBT Unit IV Notes
Nivitha
Оценок пока нет
Adopter Categories
Документ6 страниц
Adopter Categories
Caroline Mputhia
100% (1)
Global Deduplication Array Administration Guide: DD OS 5.0
Документ70 страниц
Global Deduplication Array Administration Guide: DD OS 5.0
Rajesh Kumar
Оценок пока нет
Rescue Boat Lsa 5.1
Документ4 страницы
Rescue Boat Lsa 5.1
Celal Bozdogan
Оценок пока нет
Ad Agency Synopsis
Документ19 страниц
Ad Agency Synopsis
Raj Bangalore
Оценок пока нет
VirtualHost Examples - Apache HTTP Server
Документ9 страниц
VirtualHost Examples - Apache HTTP Server
SaitejaTallapelly
Оценок пока нет
Ms 95-2018 Solved Assignment
Документ15 страниц
Ms 95-2018 Solved Assignment
Pramod Shaw
Оценок пока нет
Technology and Culture - Reading
Документ3 страницы
Technology and Culture - Reading
Braulio Pezantes
100% (1)
Ecofracsmart: A New Stock-Preparation Process For Testliner
$Ecofracsmart: A New Stock-Preparation Process For Testliner$
Документ14 страниц
Ecofracsmart: A New Stock-Preparation Process For Testliner
Hgagselim Selim
Оценок пока нет
Abu Dhabi Certification Scheme For Assistant Engineer
Документ12 страниц
Abu Dhabi Certification Scheme For Assistant Engineer
suresh
Оценок пока нет