Вы находитесь на странице: 1из 6

COLEGIO DE DAGUPAN

SCHOOL OF PROFESSIONAL STUDIES


DOCTOR OF PHILOSOPHY
Major in Educational Leadership and Management

Course Code: DELM 211


Course Title: ADVANCE STATISTICS
Discussant: MS. EDITHA I. BALLESTEROS
Professor: DR. EUGENIA B. RAMOS

TOPIC/S: Measures of Significant Relationship


 Calculating of the Coefficient of Correlation by the Product Moment
Method
 Characteristics and Uses
 Advantages and Dis Advantages
 Formula/ Equations

Pearson Product-Moment Correlation Coefficient

The Pearson correlation coefficient (also known as Pearson product-moment


correlation coefficient) r is a measure to determine the relationship (instead of
difference) between two quantitative variables (interval/ratio) and the degree to which
the two variables coincide with one another—that is, the extent to which two variables
are linearly related: changes in one variable correspond to changes in another
variable. In fact, a variety of different correlation coefficients (such as phi correlation
coefficient, point-biserial correlation, Spearman’s rho, partial correlation, and part
correlation) have been developed over the years for measuring relationships between
sets of data, and the Pearson correlation coefficient (also referred to Pearson’s r) is the
most common measure of correlation and has been widely used in the sciences as a
measure of the degree.

The Pearson product-moment correlation coefficient (Pearson’s correlation,


for short) is a measure of the strength and direction of association that exists between
two variables measured on at least an interval scale.
A Pearson’s correlation attempts to draw a line of best fit through the data of
two variables, and the Pearson correlation coefficient, r, indicates how far away all
these data points are from this line of best fit (i.e., how well the data points fit this
model/line of best fit).
Note: If one of your two variables is dichotomous you can use a point-biserial
correlation instead, or if you have one or more control variables, you can run a
Pearson's partial correlation.
The formula for Pearson's r is:
Pearson Correlation - Formula
If we want to inspect correlations, we'll have a computer calculate them for us. You'll
rarely (probably never) need the actual formula. However, for the sake of
completeness, a Pearson correlation between variables X and Y is calculated by
rXY=∑ni=1(Xi−X¯¯¯¯)(Yi−Y¯¯¯¯)∑ni=1(Xi−X¯¯¯¯)2−−−−−−−−√∑ni=1(Yi−Y¯¯¯
¯)2−−−−−−−−−−−−√rXY=∑i=1n(Xi−X¯)(Yi−Y¯)∑i=1n(Xi−X¯)2∑i=1n(Yi−Y¯)2

The formula basically comes down to dividing the covariance by the product
of the standard deviations. Since a coefficient is a number divided by some other
number our formula shows why we speak of a correlation coefficient
You can see in the numerator involves finding the Z scores for each of the x and y
coordinates, multiplying those Z scores together, and finally adding them all up. And
then you get to divide by n – 1.
One further note – Generally the two variables play specific roles in a
bivariate study. One variable, x, is the independent variable, while y is the dependent
variable . However, when computing Pearson's r, these roles do not matter. You will
get the same correlation value even if you reverse the two variables.
Pearson’s Product Moment Correlation (r) The Pearson’s product Moment
Correlation coefficient is a measure of the strength and direction of association
that exists between two variables measured on at least an interval scale.
A Pearson’s correlation attempts to draw a line of best fit through the data of
two variables, and the Pearson’s correlation coefficient, r, indicates how far away
all these data points are from this line of best fit.
When the Pearson’s correlation is to be used, one must make necessary
checks to ensure that the Pearson’s correlation is the appropriate statistic. The
way to do this is to ensure the following four assumptions are passed:
1. The two variables must be measured at the interval or ratio scale.
2. There is a linear relationship between the two variables.
3. There should be no significant outliers. Outliers are single data points
within your data that do not follow the usual pattern.
4. The data should be approximately normally distributed.

FORMULA/EQUATIONS

Computing the Pearson’s Given the bivariate set (x1, y1), (x2, y2), . . .
, (xn, yn), the Pearson‟s Product Moment Correlation Coefficient (r) is defined as
where r = Pearson‟s Product Moment Correlation Coefficient
Sxy = Covariance of x and y values (scores)
Sx and Sy = Standard deviations of x and y values (scores) respectively. Given
the above relationship, the Pearson‟s Product Moment Correlation Coefficient (r) can
be written as where r = Pearson‟s Product Moment Correlation Coefficient
N = Number of pairs of values or scores
= Sum of the products of x and y
= Mean of the x values (or x scores)
= Mean of the y values (or y scores)
= Product of the mean values (scores) of x and y
= Sum of squares of x values (or x scores)
= Sum of squares of y values (or y scores)

Types of correlation coefficient formulas.

There are several types of correlation coefficient formulas.


One of the most commonly used formulas in stats is Pearson’s correlation coefficient
formula. If you’re taking a basic stats class, this is the one you’ll probably use:
*Pearson correlation *Pearson correlation coefficient
Two other formulas are commonly used: the sample correlation coefficient and the
population correlation coefficient.

Sample question: Find the value of the correlation coefficient from the following table:
SUBJECT AGE X GLUCOSE LEVEL Y

1 43 99
2 21 65
3 25 79
4 42 75
5 57 87
6 59 81

Step 1:Make a chart. Use the given data, and add three more columns: xy, x2, and y2.

SUBJECT AGE X GLUCOSE LEVEL Y XY X2 Y2

1 43 99
2 21 65
3 25 79
4 42 75
5 57 87
6 59 81

Step 2: Multiply x and y together to fill the xy column. For example, row 1 would be 43 × 99 = 4,257.

SUBJECT AGE X GLUCOSE LEVEL Y XY X2 Y2


1 43 99 4257
2 21 65 1365
3 25 79 1975
4 42 75 3150
5 57 87 4959
6 59 81 4779

Step 3: Take the square of the numbers in the x column, and put the result in the x2 column.

SUBJECT AGE X GLUCOSE LEVEL Y XY X2 Y2

1 43 99 4257 1849
2 21 65 1365 441
3 25 79 1975 625
4 42 75 3150 1764
5 57 87 4959 3249
6 59 81 4779 3481

Step 4: Take the square of the numbers in the y column, and put the result in the y2 column.
SUBJECT AGE X GLUCOSE LEVEL Y XY X2 Y2

1 43 99 4257 1849 9801


2 21 65 1365 441 4225
3 25 79 1975 625 6241
4 42 75 3150 1764 5625
5 57 87 4959 3249 7569
6 59 81 4779 3481 6561
Step 5: Add up all of the numbers in the columns and put the result at the bottom of the column.

The Greek letter sigma (Σ) is a short way of saying “sum of.”
SUBJECT AGE X GLUCOSE LEVEL Y XY X2 Y2
1 43 99 4257 1849 9801
2 21 65 1365 441 4225

3 25 79 1975 625 6241

4 42 75 3150 1764 5625

5 57 87 4959 3249 7569

6 59 81 4779 3481 6561

Σ 247 486 20485 11409 40022

Step 6: Use the following correlation coefficient formula.

The answer is: 2868 / 5413.27 = 0.529809

ADVANTAGE and DISADVANTAGES

Pearson’s product moment correlation co efficient, or Pearson’s r was


developed by Karl Pearson (1948) from a related idea introduced by Sir Francis
Galton in the late 1800’s. In addition to being the first of the correlational measures to
be developed, it is also the most commonly used measure of association. All
subsequent correlation measures have been developed from Pearson’s equation and
are adaptations engineered to control for violations of the assumptions that must be
met in order to use Pearson’s equation (Burns & Grove, 2005; Polit & Beck, 2006).
Pearson’s r measures the strength, direction and probability of the linear association
between two interval or ratio variables. Pearson’s Product Moment Correlation
Coefficient - Pearson’s r Pearson’s r is a measure of the linear relationship between
two interval or ratio variables, and can have a value between -1 and 1. It is the same
measure as the point-biserial correlation; a measure of the relationship between a
dichotomous (yes or no, male or female) and an interval/ratio variable (Cramer,
1998).

The advantage of using Pearson’s r is that it is a simple way to assess the


association between two variables; whether they share variance (covary), if the
relationship is positive or negative, and the degree to which they correlate. The
disadvantages of using Pearson’s r is that it cannot identify relationships that are not
linear, and may show a correlation of zero when the correlation has a relationship
other than a linear one. Additionally the types of variable that can be evaluated are
limited. In addition to Pearson’s r, semi-partial and partial correlation can be
employed in order to estimate the relationship between an outcome and predictor
variable after controlling for the effects of additional predictors in the equation.

REFERENCES:
https://statistics.laerd.com/spss-tutorials/pearsons-product-moment-correlation-using-spss-
statistics.php
https://www.anthonypicciano.com/Relationship.html
https://www.spss-tutorials.com/pearson-correlation-coefficient/

Вам также может понравиться