Академический Документы
Профессиональный Документы
Культура Документы
REGRESSION
Correlation and regression (linear) are the most
commonly used techniques for investigating the
relationship between two quantitative variables.
• The second main use for correlation and regression is to see whether two variables
are associated, without necessarily inferring a cause-and-effect relationship. In this
case, neither variable is determined by the experimenter; both are naturally
variable. If an association is found, the inference is that variation in X may cause
variation in Y, or variation in Y may cause variation in X, or variation in some
other factor may affect both X and Y.
• The third common use of regression (linear) is estimating the value of one variable
corresponding to a particular value of the other variable.
CORRELATION
CORRELATION COEFFICIENT:
A) Pearson Product-Moment Correlation is one of the
measures of correlation which quantifies the strength as
well as direction of such relationship. It is usually
denoted by Greek letter ρ.
CONDITIONS
This coefficient is used if two conditions are satisfied
• the variables are in the interval or ratio scale of measurement
• a linear relationship between them is suspected
POSITIVE AND NEGATIVE CORRELATION
The coefficient (ρ) is computed as the ratio of covariance between the
variables to the product of their standard deviations. This formulation is
advantageous.
ρ (X,Y)=
rs = correlation coefficient,
In general,
• rs > 0 implies positive agreement among ranks
• rs < 0 implies negative agreement (or agreement in the reverse direction)
• rs = 0 implies no agreement
Closer rs is to 1, better is the agreement while rs closer to -1 indicates strong
agreement in the reverse direction.
SIGNIFICANCE OF CORRELATION
(b) The percent of variation in y that is accounted for by the regression is computed as the
coefficient of determination (r2) multiplied by 100. The value of Sse has been computed in
Table
11.1.
Coefficient of determination R2 = 1 – Sse / Syy = 1 – 163.073/315.251=0.483.
The coefficient of correlation (r) = square root of coefficient of determination= (0.483)1/2 = 0.695.
Thus, nearly 66 percent of variation in y is explained by the regression equation. The remaining
34 percent variation is due to unexplained causes.