Академический Документы
Профессиональный Документы
Культура Документы
MANAGEMENT
DOCUMENT ON LINEAR REGRESSION QUESTIONS
Submitted By:
SHALU (MFM/18/923)
(MFM-III)
Submitted To:
1
ACKNOWLEDGEMENT
In preparation of my assignment, I had to take the help and guidance of some respected persons,
who deserve my deepest gratitude. As the completion of this assignment gave us much pleasure, I
would like to show our gratitude to Asst. Professor Mr. Kislay Kashyap, our faculty of National
Institute of Fashion Technology, Patna for giving us good guidelines for our assignment
throughout numerous consultations. I would also like to expand my gratitude to all those who have
directly and indirectly guided us in writing this assignment. I express our warm thanks to our
classmates, juniors and local people have made valuable comments and suggestions on this paper
which gave us an inspiration to improve the quality of the assignment. I would not forget to thank
them for their encouragement and more over for their timely support and guidance till the
completion of our project work.
2
QUESTIONS: -
Use the data in WAGE file to estimate a simple regression explaining monthly salary (wage) in
terms of IQ score (IQ).
(1) Find the average salary and average IQ in the sample. What is the standard deviation of IQ?
3
Average salary is about Rs.957.95 and average IQ is about 101.28. The sample standard
deviation of IQ is about 15.05, which is pretty close to the population value of 15.
(2) Estimate a simple regression model. Use this model to find the predicted increase in wage
for an increase in IQ of 15 points. Does IQ explain most of the variation in wage?
Answer to the question no. (2)
4
Simple Linear regression
Linear regression answers a simple question: Here we Can measure an exact relationship between
one target variables and a set of predictors i.e. Wages and IQ.
The simplest of probabilistic models is the straight-line model:
where
y = Dependent variable (Wages)
x = Independent variable (IQ)
= random error component
= intercept
= Coefficient of x
R-squared is a statistical measure of how close the data are to the fitted regression line. It is also
known as the coefficient of determination.
The more variance that is accounted for by the regression model the closer the data points will fall
to the fitted regression line. Theoretically, if a model could explain 100% of the variance, the fitted
values would always equal the observed values and, therefore, all the data points would fall on the
fitted regression line.
5
Consider the following plot:
The line showed by the left arrow is the regression line that depicts where the predicted values of
Salary lies with respect to IQ along the x-axis. The upper dotted point represents the actual values
of the salary which is the observed y value with respect to IQ. The middle-dotted point represents
the predicted value of salary for an observed value of experience which is denoted by y^.
IQ is not the only factor that affects wages. As shown in Figure 4, even for workers with the same
IQ, there is remarkable variation in wages. Surely, some of this variation is due to work experience,
hours, education, age, tenure, and marital status, etc.
For example, one could think of wages as a function of IQ and work experience:
Wage = f (IQ, Experience)
The longer one spends on a job, the better one gets. If people are paid for their productivity, then
workers with more work experience should be more productive, and therefore, paid more.
ΔWage / ΔExperience > 0
6
From the analysis we can observe that R²= 0.095. It means that variation in IQ explains only 9.5%
of variables in wage, so the level of dependency between the independent variable i.e. IQ and the
the dependent variable i.e. Wage is only 9.5%. Hence, they are not closely related.
7
8
9