Вы находитесь на странице: 1из 9

TEST 1 BFM4633 DATA ANALYTIC

MUHAMMAD HUZAIFI BIN YUSNIMAN

FB15029

Question 1

A) How to load data and visualize data

1. Open ORANGE software and click on “New” to start a blank file.


2. Click on “File” to start or to use any data from windows.
3. Double-click on the file to insert data.
4. Click on this button to browse any data that you want to upload. Select the data you
want and it will show all the features on the file you selected.

5. After that, select “Data Table”. Connect the file and data table by dragging the dash
line and a connection line will exist.
6. After the connection is done, double click on data table and a pop-up windows will
appear to show the data you selected.
B) Examine and identify

i. There are 420 instances available in the data.

ii. There are 25 features :

 Axis ratio

 Circularity

 Rectangularity

 Form factor

 Perimeter

 Contrast

 Correlation

 Energy

 Homogeneity

 Entropy

 Av1/A, Av2/A, Av3/A, Av4/A, Av4/Av1

 Area, Convex Area

 Width, Length, Diameter

 Solidity

 Perimeter ratio of diameter

 Perimeter ratio of length

 Apex Angle

 Variety

iii. Clusters available.


C)

D)
QUESTION 2

A)

f3

Mode = (33.6+23.3+31.0+23.3+25.6+31.0+35.5) / 7

= 29.04

Median

23.3, 23.3, 25.6, 31.0, 31.0, 33.6, 35.3

= 31.0

Linear method

x  33.6 31  50

23.3  33.6 32  50
x  22.73

f4

Mode = (50+31+32+21+32+26+29) / 7

= 31.57

Median

21, 26, 26, 31, 32, 32, 50


= 31

Linear method

25.6  23.3 x  32

31.0  23.3 26  32
x  30
B)

i) Redundancy in the features

ii) Data redundancy in same cluster data

C) Normalize

Standardize

Вам также может понравиться