Академический Документы
Профессиональный Документы
Культура Документы
CORRELATION
ANALYSIS
Introduction
Correlation a LINEAR association between two random variables
Type
s
Simple Multiple
Partial
100 100
S ymptom
S ymptom
80 80
Index
Index
60 60
40
40
20
20
0
0 50 100 150 200 250 0
0 50 100 150 250
D r u g A (dose i n m g ) 200
Drug B (dose in mg)
Represented by “r”
r lies between +1 to -1
y ) 2
2
Y)2
Shared variability of X and Y variables on the
top
Individual variability of X and Y variables on the
bottom
Problem
Find the value of the correlation coefficient from the following table:
Step 1:Make a chart. Use the given data, and add three
more columns: xy, x2, and y2
Glucos
Subject Age x e Level xy x2 y2
y
1 43 99
2 21 65
3 25 79
4 42 75
5 57 87
6 59 81
• The range of the correlation coefficient is from -1 to 1.
• Our result is 0.5298 or 52.98%, which means the variables have a
moderate positive correlation.
Interpreting Correlation
Coefficient r
strong correlation: r > .70 or r < –.70
moderate correlation: r is between .30
&
.70
or r is between –.30 and –.70
weak correlation: r is between 0 and .
30 or r is between 0 and –.30 .
Spearman’s Rank Correlation
Ranks are not given but grades are given but not
repeated
repeated
where n is the number of data points of the two variables and di is the
difference in the ranks of the ith element of each random variable considered.
students who have free university meals and their CGPA scores.
Pune 14.4 54
Chennai 7.2 64
Delhi 27.5 44
Kanpur 33.8 32
Ahmedabad 38.0 37
Indore 15.9 68
Guwahati 4.9 62
STEP 1
Pune 3 4 -1 1
Chennai 2 6 -4 16
Delhi 5 3 2 4
Kanpur 6 1 5 25
Ahmedab 7 2 5 25
ad
Indore 4 7 -3 9
Guwahati 1 5 -4 16
Σd2 = 96
Interpretation
Represented by r2
The coefficient of determination is a measure of
how well the regression line represents the data