Вы находитесь на странице: 1из 53

18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

i LOGIN / REGISTER (HTTPS://ID.ANALYTICSVIDHYA.COM/ACCOUNTS/LOGIN/?

NEXT=HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/2016/12/45-QUESTIONS-TO-TEST-A-DATA-SCIENTIST-ON-REGRESSION-SKILL-TEST-

REGRESSION-SOLUTION/)

(https://www.analyticsvidhya.com/blog/)

(https://analyticsvidhya.com/datahack-summit-2019/?
utm_source=blog&utm_medium=topBanner&utm_campaign=DHS2019)

MACHINE LEARNING (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/CATEGORY/MACHINE-LEARNING/) Reply


PYTHON (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/CATEGORY/PYTHON-2/)

R (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/CATEGORY/R/)
-
45 questions to test a Data Scientist on Regression (Skill
Your Ultimate pathtest
for – Regression
Becoming
Solution) a DATA Scientist!
Download this learning path to start your data
science journey.
ANKIT GUPTA (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/AUTHOR/FACEBOOK_USER_4/), DECEMBER 19, 2016 LOGIN TO BOOKMA…

Email Id

(https://courses.analyticsvidhya.com/bundles/certified-machine-learning-master-s-program/?
 Download Resource
utm_source=AVBannerbelowtitle&utm_medium=display&utm_campaign=AMLprogram) k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 1/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Introduction

Regression is much more than just linear and logistic regression. It includes many techniques for modeling and
analyzing several variables. This skill test was designed to test your conceptual and practical knowledge of
various regression techniques.

A total of 1845 number of people participated in the test. I am sure they all will agree it was the best skill
assessment test on regression they have come across.

If you are one of those who missed out on this skill test, then you did miss out on the real time test. But here the
questions with detailed solutions, find out how many you could have answered correctly.

(https://datahack.analyticsvidhya.com/contest/skilltest-regression/)

Overall Scores

Below are the distribution of scores, this will help you evaluate your performance: -
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 2/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

(http://https://datahack.analyticsvidhya.com/contest/skilltest-regression/lb)

You can assess your performance here (https://datahack.analyticsvidhya.com/contest/skilltest-regression/lb).


Around 530 people participate in the skilltest and the highest score was 38. Here are a few statistics about the
distribution.

Mean Score: 23.15

Median Score: 23

Mode Score: 23

Helpful Resources for Regression -


Your Ultimate
In case you want to revise your knowledge, here are a few resources for brushingpath for
up your Becoming
knowledge on
Regression. a DATA Scientist!
Download this learning path to start your data
Going Deeper into Regression Analysis with Assumptions, Plotsscience
& Solutions
journey.
(https://www.analyticsvidhya.com/blog/2016/07/deeper-regression-analysis-assumptions-plots-solutions/)

5 Questions which can teach you Multiple Regression (with R and Python)
(https://www.analyticsvidhya.com/blog/2015/10/regression-python-beginners/)
Email Id
7 Types of Regression Techniques you should know
(https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/)
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 3/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Simple Guide to Logistic Regression in R (https://www.analyticsvidhya.com/blog/2015/11/beginners-guide-on-


logistic-regression-in-r/)

A Complete Tutorial on Ridge and Lasso Regression in Python


(https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-ridge-lasso-regression-python/)

Using Platt Scaling and Isotonic Regression to Minimize LogLoss Error in R


(https://www.analyticsvidhya.com/blog/2016/07/platt-scaling-isotonic-regression-minimize-logloss-error/)

Q1. Which of the following step / assumption in regression modeling impacts the trade-off between
under-fitting and over-fitting the most.

A. The polynomial degree

B. Whether we learn the weights by matrix inversion or gradient descent

C. The use of a constant-term

Solution: A

Choosing the right degree of polynomial plays a critical role in fit of regression. If we choose higher degree of
polynomial, chances of overfit increase significantly.

Q2. Suppose you have the following data with one real-value input variable & one real-value output -
variable. What is leave-one out cross validation mean square error in case of linear regression (Y =
Your Ultimate path for Becoming
bX+c)?
a DATA Scientist!
Download this learning path to start your data
science journey.

A. 10/27
Email Id

k
B. 20/27
 Download Resource
C. 50/27

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 4/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

D. 49/27

Solution: D

We need to calculate the residuals for each cross validation point. After fitting the line with 2 points and leaving 1
point for cross validation.

Leave one out cross validation mean square error = (2^2 +(2/3)^2 +1^2) /3 = 49/27

Q3. Which of the following is/ are true about “Maximum Likelihood estimate (MLE)”?

1. MLE may not always exist


2.
3.
MLE always exists
If MLE exist, it (they) may not be unique
-
4. If MLE exist, it (they) must be unique Your Ultimate path for Becoming
a DATA Scientist!
A. 1 and 4
Download this learning path to start your data
science journey.
B. 2 and 3

C. 1 and 3

D. 2 and 4 Email Id

Solution: C
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 5/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

The MLE may not be a turning point i.e. may not be a point at which the first derivative of the likelihood (and log-
likelihood) function vanishes.

* The MLE may not be unique.

-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id
Q4. Let’s say, a “Linear regression” model perfectly fits the training data (train error is zero). Now, Which

k
of the following statement is true?
 Download Resource
A. You will always have test error zero

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 6/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

B. You can not have test error zero

C. None of the above

Solution: C

Test error may be zero if there no noise in test data. In other words, it will be zero, if the test data is perfect
representative of train data but not always.

Q5. In a linear regression problem, we are using “R-squared” to measure goodness-of-fit. We add a
feature in linear regression model and retrain the same model.

Which of the following option is true?

A. If R Squared increases, this variable is significant.

B. If R Squared decreases, this variable is not significant.

C. Individually R squared cannot tell about variable importance. We can’t say anything about it right now.

D. None of these.

Solution: C

“R squared” individually can’t tell whether a variable is significant or not because each time when we add a
feature, “R squared” can either increase or stay constant. But, it is not true in case of “Adjusted R squared” -
(increases when features found to be significant).
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
Q6. Which one of the statement is true regarding residuals science journey. analysis?
in regression

A. Mean of residuals is always zero

B. Mean of residuals is always less than zero


Email Id

k
C. Mean of residuals is always greater than zero
 Download Resource
D. There is no such rule for residuals.

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 7/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Solution: A

Sum of residual in regression is always zero. It the sum of residuals is zero, the ‘Mean’ will also be zero.

Q7. Which of the one is true about Heteroskedasticity?

A. Linear Regression with varying error terms

B. Linear Regression with constant error terms

C. Linear Regression with zero error terms

D. None of these

Solution: A

The presence of non-constant variance in the error terms results in heteroskedasticity. Generally, non-constant
variance arises because of presence of outliers or extreme leverage values.

You can refer this article (https://www.analyticsvidhya.com/blog/2016/07/deeper-regression-analysis-


assumptions-plots-solutions/) for more detail about regression analysis.

Q8. Which of the following indicates a fairly strong relationship between X and Y?
-
A. Correlation coefficient = 0.9
Your Ultimate path for Becoming
a DATA Scientist!
B. The p-value for the null hypothesis Beta coefficient =0 is 0.0001
Download this learning path to start your data
C. The t-statistic for the null hypothesis Beta coefficient=0 is 30science journey.

D. None of these

Solution: A
Email Id

k
Correlation between variables is 0.9. It signifies that the relationship between variables is fairly strong.
 Download Resource
On the other hand, p-value and t-statistics merely measure how strong is the evidence that there is non zero
association. Even a weak effect can be extremely significant given enough data.

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 8/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Q9. Which of the following assumptions do we make while deriving linear regression parameters?

1. The true relationship between dependent y and predictor x is linear


2. The model errors are statistically independent
3. The errors are normally distributed with a 0 mean and constant standard deviation
4. The predictor x is non-stochastic and is measured error-free

A. 1,2 and 3.

B. 1,3 and 4.

C. 1 and 3.

D. All of above.

Solution: D

When deriving regression parameters, we make all the four assumptions mentioned above. If any of the
assumptions is violated, the model would be misleading.

Q10. To test linear relationship of y(dependent) and x(independent) continuous variables, which of the
following plot best suited?

A. Scatter plot -
Your Ultimate path for Becoming
B. Barchart
a DATA Scientist!
C. Histograms Download this learning path to start your data
science journey.
D. None of these

Solution: A

Email Id
To test the linear relationship between continuous variables Scatter plot is a good option. We can find out how

k
one variable is changing w.r.t. another variable. A scatter plot displays the relationship between two quantitative
variables.  Download Resource

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 9/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Q11. Generally, which of the following method(s) is used for predicting continuous dependent variable?

1. Linear Regression
2. Logistic Regression

A. 1 and 2

B. only 1

C. only 2

D. None of these.

Solution: B

Logistic Regression is used for classification problems. Regression term is misleading here.

Q12. A correlation between age and health of a person found to be -1.09. On the basis of this you would
tell the doctors that:

A. The age is good predictor of health

B. The age is poor predictor of health


-
C. None of these
Your Ultimate path for Becoming
Solution: C a DATA Scientist!
Download this learning path to start your data
science journey.
Correlation coefficient range is between [-1 ,1]. So -1.09 is not possible.

Q13. Which of the following offsets, do we use in case of least square line fit? Suppose horizontal axis is
Email Id
independent variable and vertical axis is dependent variable.

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 10/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

A. Vertical offset

B. Perpendicular offset

C. Both but depend on situation

D. None of above

Solution: A

We always consider residual as vertical offsets. Perpendicular offset are useful in case of PCA.

Q14. Suppose we have generated the data with help of polynomial regression of degree 3 (degree 3 will
perfectly fit this data). Now consider below points and choose the option based on these points. -
1. Simple Linear regression will have high bias and low variance
Your Ultimate path for Becoming
2. Simple Linear regression will have low bias and highavariance
DATA Scientist!
3. Download this learning path to start your data
polynomial of degree 3 will have low bias and high variance
4. science journey.
Polynomial of degree 3 will have low bias and Low variance

A. Only 1

B. 1 and 3
Email Id

k
C. 1 and 4
 Download Resource
D. 2 and 4

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 11/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Solution: C

If we fit higher degree polynomial greater than 3, it will overfit the data because model will become more
complex. If we fit the lower degree polynomial less than 3 which means that we have less complex model so in
this case high bias and low variance. But in case of degree 3 polynomial it will have low bias and low variance.

Q15. Suppose you are training a linear regression model. Now consider these points.

1. Overfitting is more likely if we have less data


2. Overfitting is more likely when the hypothesis space is small

Which of the above statement(s) are correct?

A. Both are False

B. 1 is False and 2 is True

C. 1 is True and 2 is False

D. Both are True

Solution: C

1.With small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e. overfitting.

2. We can see this from the bias-variance trade-off. When hypothesis space is small, it has higher bias and lower -
variance. So with a small hypothesis space, it’s less likely to find a hypothesis to fit the data exactly i.e.
Your Ultimate path for Becoming
underfitting.
a DATA Scientist!
Download this learning path to start your data
science journey.
Q16. Suppose we fit “Lasso Regression” to a data set, which has 100 features (X1,X2…X100). Now, we
rescale one of these feature by multiplying with 10 (say that feature is X1), and then refit Lasso
regression with the same regularization parameter.

Email Id
Now, which of the following option will be correct?

A. It is more likely for X1 to be excluded from the model  Download Resource k


https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 12/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

B. It is more likely for X1 to be included in the model

C. Can’t say

D. None of these

Solution: B

Big feature values =⇒ smaller coefficients =⇒ less lasso penalty =⇒ more likely to have be kept

Q17. Which of the following is true about “Ridge” or “Lasso” regression methods in case of feature
selection?

A. Ridge regression uses subset selection of features

B. Lasso regression uses subset selection of features

C. Both use subset selection of features

D. None of above

Solution: B

“Ridge regression” will use all predictors in final model whereas “Lasso regression” can be used for feature
selection because coefficient values can be zero. For more detail click here
(https://discuss.analyticsvidhya.com/t/difference-between-ridge-regression-and-lasso-and-its-effect/3000). -
Your Ultimate path for Becoming
a DATA Scientist!
Q18. Which of the following statement(s) can be true post Download
adding a this learning
variable in path to start
a linear your data model?
regression
science journey.
1. R-Squared and Adjusted R-squared both increase

2. R-Squared increases and Adjusted R-squared decreases

Email Id
3. R-Squared decreases and Adjusted R-squared decreases

4. R-Squared decreases and Adjusted R-squared increases


 Download Resource k
A. 1 and 2

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 13/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

B. 1 and 3

C. 2 and 4

D. None of the above

Solution: A

Each time when you add a feature, R squared always either increase or stays constant, but it is not true in case
of Adjusted R squared. If it increases, the feature would be significant.

Q19. The following visualization shows the fit of three different models (in blue line) on same training
data. What can you conclude from these visualizations?

1. The training error in first model is higher when compared to second and third model.
2. The best model for this regression problem is the last (third) model, because it has minimum
training error.
3. The second model is more robust than first and third because it will perform better on unseen data. -
4. The third model is overfitting data as compared to first
YourandUltimate
second model.
path for Becoming
5. All models will perform same because we have not seen the test data.
a DATA Scientist!
Download this learning path to start your data
A. 1 and 3
science journey.

B. 1 and 3

C. 1, 3 and 4
Email Id
D. Only 5

Solution: C  Download Resource k


https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 14/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

The trend of the data looks like a quadratic trend over independent variable X. A higher degree (Right graph)
polynomial might have a very high accuracy on the train population but is expected to fail badly on test dataset.
But if you see in left graph we will have training error maximum because it under-fits the training data.

Q20. Which of the following metrics can be used for evaluating regression models?

1. R Squared

2. Adjusted R Squared

3. F Statistics

4. RMSE / MSE / MAE

A. 2 and 4.

B. 1 and 2.

C. 2, 3 and 4.

D. All of the above.

Solution: D

These (R Squared, Adjusted R Squared, F Statistics , RMSE / MSE / MAE ) are some metrics which you can use
to evaluate your regression model. -
Your Ultimate path for Becoming
a DATA Scientist!
Downloadwith
Q21. We can also compute the coefficient of linear regression this learning
the helppath
oftoan
start your data method
analytical
science
called “Normal Equation”. Which of the following is/are true aboutjourney.
“Normal Equation”?

1. We don’t have to choose the learning rate


2. It becomes slow when number of features is very large
3. No need to iterate
Email Id

k
A. 1 and 2
 Download Resource
B. 1 and 3.

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 15/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

C. 2 and 3.

D. 1,2 and 3.

Solution: D

Instead of gradient descent, Normal Equation can also be used to find coefficients. Refer this article
(http://eli.thegreenplace.net/2014/derivation-of-the-normal-equation-for-linear-regression/) for read more about
normal equation.

Q22. The expected value of Y is a linear function of the X(X1,X2….Xn) variables and regression line is
defined as:

Y = β0 + β1 X1 + β2 X2……+ βn Xn

Which of the following statement(s) are true?

1. If Xi changes by an amount ∆Xi, holding other variables constant, then the expected value of Y
changes by a proportional amount βi ∆Xi, for some constant βi (which in general could be a
positive or negative number).
2. The value of βi is always the same, regardless of values of the other X’s.
3. The total effect of the X’s on the expected value of Y is the sum of their separate effects.

Note: Features are independent of each others(zero interaction).

A. 1 and 2 -
Your Ultimate path for Becoming
B. 1 and 3
a DATA Scientist!
C. 2 and 3 Download this learning path to start your data
science journey.
D. 1,2 and 3

Solution: D

Email Id
1. The expected value of Y is a linear function of the X variables. This means:

k
1. If X i changes by an amount ∆X i , holding other variables fixed, then the expected value of Y
changes by a proportional amount β i ∆X i , for some Download
 constant Resource
β i (which in general could be a
positive or negative number).

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 16/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

2. The value of β i is always the same, regardless of values of the other X’s.
3. The total effect of the X’s on the expected value of Y is the sum of their separate effects.

2. The unexplained variations of Y are independent random variables (in particular, not “auto correlated” if the
variables are time series)
3. They all have the same variance (“homoscedasticity”).
4. They are normally distributed.

Q23. How many coefficients do you need to estimate in a simple linear regression model (One
independent variable)?

A. 1

B. 2

C. Can’t Say

Solution: B

In simple linear regression, there is one independent variable so 2 coefficients (Y=a+bx).

Q24. Below graphs show two fitted regression lines (A & B) on randomly generated data. Now, I want to
find the sum of residuals in both cases A and B.
-
Note:
Your Ultimate path for Becoming
1. Scale is same in both graphs for both axis. a DATA Scientist!
Download
2. X axis is independent variable and Y-axis is dependent this learning path to start your data
variable.
science journey.

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 17/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Which of the following statement is true about sum of residuals of A and B?

A) A has higher than B

B) A has lower than B

C) Both have same

D) None of these

Solution: C

Sum of residuals always zero.

Q25. If two variables are correlated, is it necessary that they have a linear relationship?

A. Yes

B. No

Solution: B

It is not necessary. They could have non linear relationship

-
Q26. Correlated variables can have zero correlation coeffficient. True or False?
Your Ultimate path for Becoming
A. True a DATA Scientist!
Download this learning path to start your data
B. False science journey.

Solution: A

Email Id

k
Q27. Suppose I applied a logistic regression model on data and got training accuracy X and testing
accuracy Y. Now I want to add few new features in data. Select option(s)
DownloadwhichResource
are correct in such case.

Note: Consider remaining parameters are same.

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 18/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

1. Training accuracy always decreases.


2. Training accuracy always increases or remain same.
3. Testing accuracy always decreases
4. Testing accuracy always increases or remain same

A. Only 2

B. Only 1

C. Only 3

D. Only 4

Solution: A

Adding more features to model will always increase the training accuracy i.e. low bias. But testing accuracy
increases if feature is found to be significant.

Q28. The graph below represents a regression line predicting Y from X. The values on the graph shows
the residuals for each predictions value. Use this information to compute the SSE.

-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id

k
A. 3.02

 Download Resource
B. 0.75

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 19/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

C. 1.01

D. None of these

Solution: A

SSE is the sum of the squared errors of prediction, so SSE = (-.2)^2 + (.4)^2 + (-.8)^2 + (1.3)^2 + (-.7)^2 = 3.02

Q29. Height and weight are well known to be positively correlated. Ignoring the plot scales (the variables
have been standardized), which of the two scatter plots (plot1, plot2) is more likely to be a plot showing
the values of height (Var1 – X axis) and weight (Var2 – Y axis).

A. Plot2

B. Plot1
-
C. Both Your Ultimate path for Becoming
a DATA Scientist!
D. Can’t say
Download this learning path to start your data
science journey.
Solution: A

Plot 2 is definitely a better representation of the association between height and weight. As individuals get taller,
they take up more volume, which leads to an increase in height, so a positive relationship is expected. The plot
on the right has this positive relationship while the plot on the left Email
showsIda negative relationship.

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 20/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Q30. Suppose the distribution of salaries in a company X has median $35,000, and 25th and 75th
percentiles are $21,000 and $53,000 respectively.

Would a person with Salary $1 be considered an Outlier?

A. Yes

B. No

C. More information is required

D. None of these.

Solution: C

Q31. Which of the following option is true regarding “Regression” and “Correlation” ?

Note: y is dependent variable and x is independent variable.

A. The relationship is symmetric between x and y in both.

B. The relationship is not symmetric between x and y in both.

C. The relationship is not symmetric between x and y in case of correlation but in case of regression it is
symmetric.
-
D. The relationship is symmetric between x and y in case of correlation but in case of regression it is not
symmetric.
Your Ultimate path for Becoming
a DATA Scientist!
Solution: D Download this learning path to start your data
science journey.
1. Correlation is a statistic metric that measures the linear association between two variables. It treats y and x
symmetrically.
2. Regression is setup to predict y from x. The relationship is not symmetric.

Email Id

Download
 and
Q32. Can we calculate the skewness of variables based on mean median?Resource
k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 21/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

A. True

B. False

Solution: B

The skewness is not directly related to the relationship between the mean and median.

Q33. Suppose you have n datasets with two continuous variables (y is dependent variable and x is
independent variable). We have calculated summary statistics on these datasets. All of them give the
following result:

Are all the given datasets same?

A. Yes

B. No
-
Your Ultimate path for Becoming
C. Can’t Say a DATA Scientist!
Download this learning path to start your data
Solutiom: C
science journey.

To answer this question, you should know about Anscombe’s quartet. Refer this link
(https://en.wikipedia.org/wiki/Anscombe's_quartet) to read more about this.

Email Id

Q34. How does number of observations influence overfitting? Choose the correct answer(s).

Note: Rest all parameters are same


 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 22/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

1. In case of fewer observations, it is easy to overfit the data.


2. In case of fewer observations, it is hard to overfit the data.
3. In case of more observations, it is easy to overfit the data.
4. In case of more observations, it is hard to overfit the data.

A. 1 and 4

B. 2 and 3

C. 1 and 3

D. None of theses

Solution: A

In particular, if we have very few observations and it’s small, then our models can rapidly overfits data. Because
we have only a few points and as we’re increasing in our model complexity like the order of the polynomial, it
becomes very easy to hit all of our observations.

On the other hand, if we have lots and lots of observations, even with really, really complex models, it is difficult
to overfit because we have dense observations across our input.

Q35. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with tuning parameter lambda to reduce its complexity. Choose the option(s) below which
describes relationship of bias and variance with lambda.
-
A. In case of very large lambda; bias is low, variance is low
Your Ultimate path for Becoming
B. In case of very large lambda; bias is low, variance is high a DATA Scientist!
Download this learning path to start your data
C. In case of very large lambda; bias is high, variance is low science journey.

D. In case of very large lambda; bias is high, variance is high

Solution: C
Email Id

k
If lambda is very large it means model is less complex. So in this case bias is high and variance in low.
 Download Resource

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 23/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Q36. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with tuning parameter lambda to reduce its complexity. Choose the option(s) below which
describes relationship of bias and variance with lambda.

A. In case of very small lambda; bias is low, variance is low

B. In case of very small lambda; bias is low, variance is high

C. In case of very small lambda; bias is high, variance is low

D. In case of very small lambda; bias is high, variance is high

Solution: B

If lambda is very small it means model is complex. So in this case bias is low and variance is high because
model will overfit the data.

Q37. What is/are true about ridge regression?

1. When lambda is 0, model works like linear regression model


2. When lambda is 0, model doesn’t work like linear regression model
3. When lambda goes to infinity, we get very, very small coefficients approaching 0
4. When lambda goes to infinity, we get very, very large coefficients approaching infinity

A. 1 and 3
-
B. 1 and 4
Your Ultimate path for Becoming
C. 2 and 3 a DATA Scientist!
Download this learning path to start your data
D. 2 and 4 science journey.

Solution: A

Specifically, we can see that when lambda is 0, we get our least square solution. When lambda goes to infinity,
Email Id
we get very, very small coefficients approaching 0.

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 24/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Q38. Out of the three residual plots given below, which of the following represent worse model(s)
compared to others?

Note:

1. All residuals are standardized.


2. The plots are between predicted values Vs. residuals

-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id

A. 1  Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 25/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

B. 2

C. 3

D. 1 and 2

Solution: C

There should not be any relationship between predicted values and residuals. If there exist any relationship
between them means model has not perfectly capture the information in data.

Q39. Which of the following method(s) does not have closed form solution for its coefficients?

A. Ridge regression

B. Lasso

C. Both Ridge and Lasso

D. None of both

Solution: B

The Lasso does not admit a closed-form solution. The L1-penalty makes the solution non-linear. So we need to
approximate the solution.
-
If you want to read more about closed form solutions, refer this link
Your Ultimate
(http://statweb.stanford.edu/~tibs/sta305files/Rudyregularization.pdf).
path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Q40. Consider the following dataset

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 26/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Which bold point, if removed will have the largest effect on fitted regression line as shown in above
figure(dashed)?

A) a

B) b

C) c
-
Your Ultimate path for Becoming
D) d a DATA Scientist!
Download this learning path to start your data
Solution: D science journey.

Linear regression is sensitive to outliers in the data. Although c is also an outlier in given data space but it is
closed to the regression line(residual is less) so it will not affect much.

Email Id

Q41. In a simple linear regression model (One independent variable), If we change the input variable by 1
unit. How much output variable will change?  Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 27/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

A: By 1

B. No change

C. By intercept

D. By its Slope

Solution: D

Equation for simple linear regression: Y=a+bx. Now if we increase the value of x by 1 then the value of y would
be a+b(x+1) i.e. value of y will get incremented by b.

Q42. Logistic Regression transforms the output probability to be in a range of [0, 1]. Which of the
following function is used by logistic regression to convert the probability in the range between [0,1].

A. Sigmoid

B. Mode

C. Square

D. Probit

Solution: A
-
Sigmoid function is used to convert output probability between [0,1] in logistic regression.
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
Q43: Which of the following statement is true about partialscience journey.
derivative of the cost functions w.r.t weights /
coefficients in linear-regression and logistic-regression?

A. Both will be different

Email Id
B. Both will be same

C. Can’t say  Download Resource k


D. None of these

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 28/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Solution: B

Refer this link (http://feature-space.com/2011/10/28/logistic-cost-function-derivative/)

Q44. Suppose, we are using Logistic regression model for n-class classification problem. In this case,
we can use One-vs-rest method. Choose which of the following option is true regarding this?

A. We need to fit n model in n-class classification problem.

B. We need to fit n-1 models to classify into n classes.

C. We need to fit only 1 model to classify into n classes.

D. None of these.

Solution: A

If there are n classes, then n separate logistic regression has to fit, where the probability of each category is
predicted over the rest of the categories combined.

Take a example of 3-class(-1,0,1) classification. Then need to train 3 logistic regression classifiers.

1. -1 vs 0 and 1
2. 0 vs -1 and 1
3. 1 vs 0 and -1
-
Your Ultimate path for Becoming
Q45. Below are two different logistic models with differentavalues
DATAforScientist!
β0 and β1.
Download this learning path to start your data
science journey.

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 29/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Which of the following statement(s) is true about β0 and β1 values of two logistics models (Green,
Black)?

Note: consider Y = β0 + β1*X. Here, β0 is intercept and β1 is coefficient.

A. β1 for Green is greater than Black

B. β1 for Green is lower than Black

C. β1 for both models is same

D. Can’t Say. -
Your Ultimate path for Becoming
Solution: B
a DATA Scientist!
β0 and β1: β0 = 0, β1 = 1 is in X1 color(black) and β0 = 0, β1 =Download thiscolor
−1 is in X4 learning path to start your data
(green)
science journey.

End Notes
Email Id
I hope you enjoyed taking the test and you found the solutions helpful. The test focused on conceptual

k
knowledge of regression and its various techniques.
 Download Resource

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 30/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

We tried to clear all your doubts through this article but if we have missed out on something then let me know in
comments below. If you have any suggestions or improvements you think we should make in the next skilltest, let
us know by dropping your feedback in the comments section.

You can test your skills and knowledge. Check out Live Competitions
(http://datahack.analyticsvidhya.com/contest/all) and compete with best Data Scientists from
all over the world.

You can also read this article on Analytics Vidhya's Android APP

(//play.google.com/store/apps/details?
id=com.analyticsvidhya.android&utm_source=blog_article&utm_campaign=blog&pcampaignid=MKT-Other-
global-all-co-prtnr-py-PartBadge-Mar2515-1)

Share this:

 (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=linkedin&nb=1)

 (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=facebook&nb=1)

 (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=twitter&nb=1)

 (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=pocket&nb=1)

 (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=reddit&nb=1) -
Your Ultimate path for Becoming
a DATA Scientist!
Like this:
Download this learning path to start your data
Loading... science journey.

Related Articles Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 31/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

(https://www.analyticsvidhya.com/blog/2017/08/skilltest-
(https://www.analyticsvidhya.com/blog/2017/07/30-
(https://www.analyticsvidhya.com/blog/
logistic-regression/) questions-to-test-a-data-scientist- for-skilltest-machine-learning-
30 Questions to test your on-linear-regression/) revealed/)
understanding of Logistic Regression 30 Questions to test a data scientist Solutions for Skilltest Machine
(https://www.analyticsvidhya.com/blo on Linear Regression [Solution: Learning : Revealed
g/2017/08/skilltest-logistic- Skilltest – Linear Regression] (https://www.analyticsvidhya.com/blo
regression/) (https://www.analyticsvidhya.com/blo g/2016/11/solution-for-skilltest-
August 3, 2017 g/2017/07/30-questions-to-test-a- machine-learning-revealed/)
In "Machine Learning" data-scientist-on-linear-regression/) November 20, 2016
July 3, 2017 In "Machine Learning"
In "Linear Regression"

TAGS : LINEAR REGRESSION IN R (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/LINEAR-REGRESSION-IN-R/), LOGISTICS

REGRESSION (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/LOGISTICS-REGRESSION/), MULTIPLE REGRESSION


(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/MULTIPLE-REGRESSION/), MULTIPLE REGRESSION IN R

(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/MULTIPLE-REGRESSION-IN-R/), ORDINAL REGRESSION -


(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/ORDINAL-REGRESSION/), REGRESSION
Your Ultimate path for Becoming
(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/REGRESSION/), REGRESSION QUESTIONS
a DATA Scientist!
(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/REGRESSION-QUESTIONS/), SKILLTEST REGRESSION
Download this learning path to start your data
(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/SKILLTEST-REGRESSION/), SKILLTEST REGRESSION SOLUTIONS
science journey.
(HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/TAG/SKILLTEST-REGRESSION-SOLUTIONS/)

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 32/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

NEXT ARTICLE

Data Pre-Processing: A Crucial Element of Analytics – Driven Embedded Systems


(https://www.analyticsvidhya.com/blog/2016/12/data-pre-processing-a-crucial-element-of-analytics-driven-
embedded-systems/)

h
PREVIOUS ARTICLE

Hadoop developer-2
(https://www.analyticsvidhya.com/blog/2016/12/hadoop-developer-2/)

(https://www.analyticsvidhya.com/blog/author/facebook_user_4/)
Ankit Gupta
(Https://Www.Analyticsvidhya.Com/Blog/Author/Facebook_user_4/)
Ankit is currently working as a data scientist at UBS who has solved complex data mining problems in
many domains. He is eager to learn more about data science and machine learning algorithms.
-
x (https://www.linkedin.com/in/ankit-gupta-84b737ba?trk=nav_responsive_tab_profile)
Your Ultimate path for Becoming
I (https://github.com/anki1909) aGDATA
(ankit.gupta968)
Scientist!
Download this learning path to start your data
science journey.

This article is quite old and you might not get a prompt response from the author. We request you to post
this comment on Analytics Vidhya's Discussion portal (https://discuss.analyticsvidhya.com/)
Email Id to get your
queries resolved

51 COMMENTS  Download Resource k


https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 33/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

RISHABH Reply
December 19, 2016 at 5:26 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119944)

Can you explain Q26 please.

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 5:42 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119946)

Hi Rishabh,

Thank you for asking the question.

Think about Pearson correlation between two continuous variables. Pearson correlation will measure only linear
relationships between variables.
Refer this link to get more sense of it.
https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/
(https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/)

Hope above answer will help you to understand the concepts.

Best!
Ankit Gupta

-
KRISHNA Your Ultimate path for Becoming Reply
a DATA Scientist!
December 19, 2016 at 5:56 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119947) Download this learning path to start your data
science journey.
What is difference between continuous variable and Discrete( Categorical) variable?
How Logistic Regression used for predicting continuous variable? (Que 11)

ANKIT GUPTA Email Id Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 6:13 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119950)
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 34/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Hi Krishna,
If a variable can take any value between its minimum and maximum value, then it is called a continuous variable.
By nature, a lot of things we deal with fall in this category: age, weight, height being some of them.

But in case of categorical variable it can take only few values. For example, a gender can be male or female.

Coming to the point. How can you use logistic regression for continuous variable prediction? Convert the
continuous variables into bin(say 10). By creating the bin of this variable; you actually created some categories.
So now you can use logistic regression.

Hope this answer will help you to understand.


Best!
Ankit Gupta

KRISHNA Reply
December 19, 2016 at 6:19 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119952)

Thanks Ankit for such nice explanation.

SUBBIAH Reply
December 22, 2016 at 2:15 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120198)

What you mean by “minimum and maximum value, then it is called a continuous variable.”? Usually a metric
quantity (such as in your example, that too measurement scale is essential) is continuous variable (Precisely
-
assuming on a uncountable set) Say age measured in years isYour
a continuous
Ultimate variable.
path When
for “maximum”
Becoming is not
known we say age lies in an infinite interval.
a DATA Scientist!
Download this learning path to start your data
science journey.
ANKIT GUPTA Reply
(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 22, 2016 at 5:41 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120209)
Email Id
Hi Subbiah,

Thanks for asking this question. It is really a good question.  Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 35/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Here is the explanation:

A continuous variable can take infinite number of values between any defined interval. It doesn’t have to have
every possible numbers, for example, infinite. Infinite itself is not a real number.

Best!
Ankit Gupta

SUBBIAH Reply
December 22, 2016 at 11:21 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120232)

Plz explain “”can take infinite number of values between any defined interval” What do you mean by defined
interval? I do not understand the terms you are using to define a variable (or range of a variable)

I do not mean infinite as a number. There is a difference in saying finite / infinite interval in real line. For example
[0, 1] (for proportion say) is a finite interval whereas lifetime (usual example for exponential distribution) in years
has a range (0, Inf) which is an infinite interval. But both intervals are uncountable.

Precisely a reader should be able to distinguish between, finite, countable, and uncountable sets to understand a
discrete / continuous variable

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 22, 2016 at 11:55 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120235)
-
Your Ultimate path for Becoming
You can think a defined interval as a close interval of values, for example values between, [7,15] where 7 and 15
a DATA Scientist!
is defined values. Now I can choose any number of values between [7,15] like, 7.1, 7.001, ….. n values where n
Download this learning path to start your data
can be any thing. It can be 1 lac, 10 lac values that is why I am saying infinite values.
science journey.

If n is 1 lac means it is countable set. If n is infinite means it is uncountable set.


Hope this helps!
Regards,
Ankit Gupta Email Id

SUBBIAH  Download Resource k


Reply

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 36/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

December 23, 2016 at 5:31 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-


regression-skill-test-regression-solution/#comment-120292)

“If n is 1 lac means it is countable set. If n is infinite means it is uncountable set.” – Do you mean that with a finite
n (say 1 lac or 10 lac) the closed (or defined) interval [7,15] becomes countable?

KENNETH SINGH Reply


December 19, 2016 at 8:01 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119957)

For Q18, Isn’t Adjusted R Square also supposed to increase or remain constant (though fairly more robust than
R Square)? Haven’t come across Adjusted R Square decrease on adding a new variable. Please explain the
theory behind it.

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 8:39 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119962)

Hi Kenneth,

Thank you for reaching out. Please read this discussion which will give answer to your question.

Link = https://discuss.analyticsvidhya.com/t/difference-between-r-square-and-adjusted-r-square/264/2
(https://discuss.analyticsvidhya.com/t/difference-between-r-square-and-adjusted-r-square/264/2)
-
Best!
Your Ultimate path for Becoming
Ankit Gupta
a DATA Scientist!
Download this learning path to start your data
science journey.
KENNETH SINGH Reply
December 19, 2016 at 8:05 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119958)

Please explain Q22: The Beta for a particular variable is supposed to change
Email Id on adding or deleting other
variables right? I always see them change.

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 37/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 10:18 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119970)

Number of features is not changing. Only consider the changing in the values of other x’s variables.

KENNETH SINGH Reply


December 19, 2016 at 10:30 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119972)

But this would still lead to change in the Beta under consideration right?

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 10:45 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119977)

Lets take the example. Suppose you have equation:


y = b0 + b1*x1 + b2*x2.
Now you want to change the x2 to x2/10. So after doing this the coefficient of x2(b2) will change by b2*10.

Note:
1. b1 will not be affected
2. b2*x2 is remain same as before
-
Your Ultimate path for Becoming
KENNETH SINGH a DATA Scientist! Reply
December 19, 2016 at 10:56 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
Download this learning path to start your data
regression-skill-test-regression-solution/#comment-119981) science journey.

Please correct me if I am not understanding this correctly:


The change in Beta that I am referring to is the Beta that we get while determining this Beta.
Let’s say we have this dataset:
X 1 X2 Y Email Id
127
348
569
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 38/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

This would give one Beta value for X.

Now, if we change the value of X1, with X2 remaining the same, say:
X 1 X2 Y
10 2 7
11 4 8
569

This would change the Beta for X2, when we model Y = f(X1,X2), right?

KARIM LULU Reply


December 19, 2016 at 11:45 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119990)

I think that you’re right. In that particular case coefficients changed from [0.25; 0.25] to [7.3e-17; 0.5]

KENNETH SINGH Reply


December 19, 2016 at 12:05 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119995)

Yes, that’s what my understanding was in answering this question. But let Ankit give his thoughts, we could be
wrong.

KARIM LULU
December 19, 2016 at 11:50 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
-
Reply

regression-skill-test-regression-solution/#comment-119992) Your Ultimate path for Becoming


a DATA Scientist!
It seems that option B is correct one for Q22. Since it allows any transformation of X’s, not only
Download this learning path to start your data
multiplication/division.
science journey.

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
Email Id
December 19, 2016 at 12:05 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-

k
regression-skill-test-regression-solution/#comment-119994)

 Download Resource
Hi Karim,

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 39/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

It is due to interaction present between x1 and x2. In this question we wanted to check the linear regression
assumptions.

Best!
Ankit Gupta

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 12:12 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119996)

Hi Karim & kenneth,

Thanks for noticing it. It was assumed that features are independent of each other.

Best!
Ankit Gupta

KENNETH SINGH Reply


December 19, 2016 at 12:17 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119997)

Satisfying the no Multicollinearity condition (by having very low correlation among the Independent Variables)
would still have effect the Beta coefficient in the way discussed above right?

-
KARIM LULU Your Ultimate path for Becoming Reply
December 19, 2016 at 1:27 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
a DATA Scientist!
regression-skill-test-regression-solution/#comment-120000)
Download this learning path to start your data
science journey.
That’s clear now.

Thank you a lot!

Email Id
KENNETH SINGH Reply
December 19, 2016 at 8:09 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119959)
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 40/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Please explain Q26. Are we talking about Spearman Correlation here?

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 8:50 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119963)

No, It was not specifically asked from Spearman Correlation.

Refer this link: https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/


(https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/)

Best!
Ankit Gupta

KENNETH SINGH Reply


December 19, 2016 at 9:12 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119965)

I couldn’t find an answer to how correlated variables can have a zero correlation coefficient. Could you explain a
bit on this?

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 9:54 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
-
regression-skill-test-regression-solution/#comment-119968) Your Ultimate path for Becoming
a DATA Scientist!
Please read the point “Answer – 1: Correlation vs. Dependency” from the link which I wrote in previous comment.
Download this learning path to start your data
In this point Tavish has clearly written that:
science journey.

A non-dependency between two variable means a zero correlation. However the inverse is not true. A zero
correlation can even have a perfect dependency.

Email Id
KENNETH SINGH

k
Reply
December 19, 2016 at 10:12 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
 Download Resource
regression-skill-test-regression-solution/#comment-119969)

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 41/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Ok. Thank you.

SANDEEP Reply
December 19, 2016 at 10:39 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119976)

Hi Ankit,

Can you please elaborate this point Correlation coefficient is between [-1 ,1]. So -1.09 is not possible. Is it fair to
say in a Medical or a Finical data that if there is a -ve co relation then it means something. For example in a
Clinical research if there is some Diet followed and we a -ve correlation on the Diet and Health then we can say
that as the Protein Diet Decreases Increases risk in health issues and. In Financial Data if we happen to do a
Sales Prediction and we see a -ve co relation between DV and IDV then we can Fairly says that decrease in IDV
and increase a DV

* IDV: Independent Variable


*DV: Dependent Variable

Regards
Sandeep R Diddi

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 19, 2016 at 10:57 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119982)
-
correlation coefficient can not be below -1 and greater than 1. So if correlation
Your Ultimate coefficient
path for is -1Becoming
means variables
have perfect negative correlation. if correlation coefficient is 1 means variables have perfect positive correlation.
a DATA Scientist!
Download this learning path to start your data
science journey.
POUSHALI GOSWAMI Reply
December 20, 2016 at 8:49 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120036)

in question 23, we can have a model as y=ax, without consideringEmail


the intercept
Id ..

ANKIT GUPTA
 Download Resource k
Reply
(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 42/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

December 20, 2016 at 9:22 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-


regression-skill-test-regression-solution/#comment-120040)

Hi Poushali,
You can think simple linear equation as y = ax +c where c = 0.

Hope this will clear your doubts.

Best!
Ankit Gupta

MUKUL Reply
December 20, 2016 at 2:15 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120058)

Hi

Can you please explain Q 30. What all information is required to definitely call an observation as an outlier.

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 22, 2016 at 5:48 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120210)

Hi Mukul,
-
Thanks for reaching out!
Your Ultimate path for Becoming
a DATA Scientist!
Please refer this article: https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/
Download this learning path to start your data
(https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/)
science journey.

Best!
Ankit Gupta

Email Id
KOMAL Reply

skill-test-regression-solution/#comment-120770)  Download Resource k


January 9, 2017 at 8:52 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 43/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

hi

i want to study regression model in details. can you please suggest any book??
also book regarding analysis in predictive modeling.

thanks and regards,


komal.

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
January 9, 2017 at 9:13 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
skill-test-regression-solution/#comment-120773)

Hi Komal,

Thanks for reaching out.

You can refer this book “The Elements of Statistical Learning”.

Best!
Ankit Gupta

KOMAL Reply
January 14, 2017 at 5:42 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121006)
-
Thank you.. Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
SOWMYA science journey.
Reply
January 19, 2017 at 11:46 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121277)

Hi,
Email Id
In Q40, amongst data points ‘C’ and “D’ the regression line would vary upon removing point ‘C’ I believe since
right now it looks like outliers “A”, “B” and “C” play some role in directing the regression line in the direction given
in the figure.  Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 44/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

So if one of these key points, specifically “C” , since removing it can direct the regression line towards the outlier
“D” would change the direction of the line.

Your thoughts?

ANKIT GUPTA Reply


(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
January 20, 2017 at 7:24 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121308)

HI Sowmya,

Thanks for reaching out. Since, the residuals of A, B, C are very less as compare to D. So if you remove the
outlier D; It will give you maximum change in direction of regression line.

Regards,
Ankit Gupta

SOWMYA Reply
January 24, 2017 at 7:01 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121503)

Hi Ankit,

The link for Q43 seems broken. Can you provide another link where the answer for the question would be
explained better.
-
Your Ultimate path for Becoming
Thanks.
a DATA Scientist!
Download this learning path to start your data
science journey.
KARTHIK Reply
February 1, 2017 at 4:19 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121946)

Shouldn’t the solution for Q14 be B – 1&3. Email Id


Simple Linear regression will have high bias and low variance
polynomial of degree 3 will have low bias and high variance (instead of low variance)
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 45/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

ANKIT Reply
February 1, 2017 at 5:05 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121948)

Hi Karthik,

Thanks for asking. 3rd option is incorrect because data is itself generated using 3rd-degree polynomial. It means
that if you fit 3rd-degree polynomial it will perfectly fit the data(Not only train but test also). which means it
satisfies the condition of low bias and low variance.

For more information just read from this link:


https://discuss.analyticsvidhya.com/t/how-does-the-variance-bias-tradeoff-in-ensembles-work/6065/2
(https://discuss.analyticsvidhya.com/t/how-does-the-variance-bias-tradeoff-in-ensembles-work/6065/2)

Regards,
Ankit Gupta

MARAKHI DER Reply


May 28, 2017 at 9:19 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
skill-test-regression-solution/#comment-129368)

Regarding Q34.

With Fewer observation we can have bias issue. How can we have overfitting issue?
I am not getting the point. Can you please explain it?

-
MARAKHI DER Your Ultimate path for Becoming Reply
a DATA Scientist!
May 28, 2017 at 9:59 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
skill-test-regression-solution/#comment-129370) Download this learning path to start your data
science journey.
I got the explanation thank you.

BHAGYASHSREE (HTTP://ANALYTICS%20VIDYA) Reply


Email Id
August 23, 2017 at 9:29 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-

k
skill-test-regression-solution/#comment-134968)

Download Resource
which includes mathematical stuffs too.
ankit can i get questions related to all regression and ML techniques

https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 46/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

DATA SCIENCE QUESTIONARE : QUESTION EVERYTHING – ANKITONTOES


(HTTPS://KAVITASBLOG.WORDPRESS.COM/2017/08/24/DATA-SCIENCE-QUESTIONARE-
QUESTION-EVERYTHING/)
August 24, 2017 at 1:51 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
skill-test-regression-solution/#comment-135008)

[…] https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-ski&#8230
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-ski&#8230);
[…]

CRANIMAL (HTTP://CRANIMALS.COM/) Reply


September 15, 2017 at 7:37 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-137186)

I’m very pleased to find this site. I need to to thawnk


you for your time for this wonderful read!! I definitely savored
every part of it and I have you book marked to look at new
stuff in your site.

数据科学家45题自测:回归知识部分(有答案和解析) | 神刀安全网
(HTTP://WWW.SHELLSEC.COM/NEWS/50174.HTML)
November 10, 2017 at 9:57 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-143055)

[…] 文章原标题《45 questions to test a Data Scientist on Regression (Skill test – Regression Solution)》,作
者: ANKIT GUPTA文章为简译,更为详细的内容,请查看原文 […]
-
PRIYANKA SINGH Reply
Your Ultimate path for Becoming
February 20, 2018 at 11:16 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-151505) a DATA Scientist!
Download this learning path to start your data
Should we really use R^2 for evaluating regression models? Q-20
science journey.

REDDI Reply
April 6, 2018 at 12:49 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
Email Id
skill-test-regression-solution/#comment-152401)

can you please explain Q2 please


 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 47/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

JOIN THE NEXTGEN DATA SCIENCE ECOSYSTEM

Get access to free courses on Analytics Vidhya


Get free downloadable resource from Analytics Vidhya
Save your articles
Participate in hackathons and win prizes

(https://id.analyticsvidhya.com/accounts/login/?
next=https://www.analyticsvidhya.com/blog/? Join Now
utm_source=blog-subscribe&utm_medium=web)

Squarespace Website Builder


Choose one of Squarespace's
beautiful layouts to showcase your
-
products online.
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 48/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Squarespace Website Builder


Choose one of Squarespace's
beautiful layouts to showcase your
products online.

POPULAR POSTS

6 Useful Programming Languages for Data Science You Should Learn (that are not R and Python)
(https://www.analyticsvidhya.com/blog/2019/06/6-useful-programming-languages-data-science-r-python/)
24 Ultimate Data Science Projects To Boost Your Knowledge and Skills (& can be accessed freely)
(https://www.analyticsvidhya.com/blog/2018/05/24-ultimate-data-science-projects-to-boost-your-knowledge-
and-skills/)
Commonly used Machine Learning Algorithms (with Python and R Codes)
(https://www.analyticsvidhya.com/blog/2017/09/common-machine-learning-algorithms/)
-
A Complete Python Tutorial to Learn Data Science from Scratch
Your Ultimate path for Becoming
(https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-learn-data-science-python-scratch-2/)
a DATA Scientist!
7 Regression Techniques you should know! (https://www.analyticsvidhya.com/blog/2015/08/comprehensive-
guide-regression/) Download this learning path to start your data
science journey.
6 Powerful Open Source Machine Learning GitHub Repositories for Data Scientists
(https://www.analyticsvidhya.com/blog/2019/07/6-powerful-open-source-machine-learning-github-repositories-
data-scientists/)
Stock Prices Prediction Using Machine Learning and Deep Learning Techniques (with Python codes)
Email Id
(https://www.analyticsvidhya.com/blog/2018/10/predicting-stock-price-machine-learningnd-deep-learning-
techniques-python/)
Understanding Support Vector Machine algorithm from examples (along
 Downloadwith code)
Resource
(https://www.analyticsvidhya.com/blog/2017/09/understaing-support-vector-machine-example-code/)
k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 49/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

-
Your Ultimate path for Becoming
RECENT POSTS a DATA Scientist!
Download this learning path to start your data
science journey.
Introduction to PyTorch-Transformers: An Incredible Library for State-of-the-Art NLP (with Python code)
(https://www.analyticsvidhya.com/blog/2019/07/pytorch-transformers-nlp-python/)
JULY 18, 2019

How to Get Started with NLP – 6 Unique Methods to Perform Email


Tokenization
Id
(https://www.analyticsvidhya.com/blog/2019/07/how-get-started-nlp-6-unique-ways-perform-
tokenization/)
JULY 18, 2019
 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 50/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

Heroes of Machine Learning – Top Experts and Researchers you should follow
(https://www.analyticsvidhya.com/blog/2019/07/heroes-of-machine-learning-experts-researchers/)
JULY 17, 2019

Popular Machine Learning Applications and Use Cases in our Daily Life
(https://www.analyticsvidhya.com/blog/2019/07/ultimate-list-popular-machine-learning-use-cases/)
JULY 15, 2019

-
Your Ultimate path for Becoming
(http://www.edvancer.in/certified-data-scientist-with-python-course?
a DATA Scientist!
Download this learning path to start your data
science journey.

utm_source=AV&utm_medium=AVads&utm_campaign=AVadsnonfc&utm_content=pythonavad)
Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 51/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

(https://courses.analyticsvidhya.com/courses/natural-language-

processing-nlp?utm_source=Sticky_banner1&utm_medium=display&utm_campaign=NLPcourse)

(https://datahack.analyticsvidhya.com/contest/amexpert-decipher-

ANALYTICS DATA COMPANIES JOIN OUR COMMUNITY :


VIDHYA SCIENTISTS
Post Jobs
 
About Us Blog (https://www.analyticsvidhya.com/corporate/)
(http://www.analyticsvidhya.com/about-
(https://www.analyticsvidhya.com/blog)
Trainings (https://www.facebook.com/AnalyticsV
(https://twitter.com/ana
women-machine-learning-hackathon/?
me/) Hackathon
utm_source=Sticky_banner1&utm_medium=display&utm_campaign=amexpert)
Our Team
(https://trainings.analyticsvidhya.com)
(https://datahack.analyticsvidhya.com/)
Hiring
46336 21391 -
(https://www.analyticsvidhya.com/about-
Hackathons (https://www.facebook.com/AnalyticsV
(https://twitter.com/ana
Discussions
me/team/) Your Ultimate path for
(https://datahack.analyticsvidhya.com/)
(https://discuss.analyticsvidhya.com/) Followers
Becoming
Followers
Career Apply Jobs a DATA Scientist!
Advertising
(https://www.analyticsvidhya.com/career- (https://www.facebook.com/AnalyticsV
(https://twitter.com/ana
(https://www.analyticsvidhya.com/contact/)
(https://www.analyticsvidhya.com/jobs/)
analytics-vidhya/) Download this learning
 path to start
 your data
Leaderboard Reach Us
Contact Us science journey.
(https://www.analyticsvidhya.com/contact/)
(https://plus.google.com/+Analyticsvidh
(https://datahack.analyticsvidhya.com/users/)
(https://www.analyticsvidhya.com/contact/)
(https://in.linkedin.com
Write for us (https://plus.google.com/+Analyticsvidh
vidhya) 7513
(https://www.analyticsvidhya.com/about- Followers (https://in.linkedin.com
me/write/)
vidhya) Followers
(https://plus.google.com/+Analyticsvidh
Email Id (https://in.linkedin.com
vidhya)

 Download Resource
Subscribe to emailer > k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 52/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test

© Copyright 2013-2019 Analytics Vidhya. Privacy Policy (https://www.analyticsvidhya.com/privacy-policy/) Don't have an account? Sign up (https:/

Terms of Use (https://www.analyticsvidhya.com/terms/)


Refund Policy (https://www.analyticsvidhya.com/refund-policy/)

×
-
(http://play.google.com/store/apps/details?id=com.analyticsvidhya.android)

-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.

Email Id

 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 53/53

Вам также может понравиться