Академический Документы
Профессиональный Документы
Культура Документы
NEXT=HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/2016/12/45-QUESTIONS-TO-TEST-A-DATA-SCIENTIST-ON-REGRESSION-SKILL-TEST-
REGRESSION-SOLUTION/)
(https://www.analyticsvidhya.com/blog/)
(https://analyticsvidhya.com/datahack-summit-2019/?
utm_source=blog&utm_medium=topBanner&utm_campaign=DHS2019)
R (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/CATEGORY/R/)
-
45 questions to test a Data Scientist on Regression (Skill
Your Ultimate pathtest
for – Regression
Becoming
Solution) a DATA Scientist!
Download this learning path to start your data
science journey.
ANKIT GUPTA (HTTPS://WWW.ANALYTICSVIDHYA.COM/BLOG/AUTHOR/FACEBOOK_USER_4/), DECEMBER 19, 2016 LOGIN TO BOOKMA…
Email Id
(https://courses.analyticsvidhya.com/bundles/certified-machine-learning-master-s-program/?
Download Resource
utm_source=AVBannerbelowtitle&utm_medium=display&utm_campaign=AMLprogram) k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 1/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Introduction
Regression is much more than just linear and logistic regression. It includes many techniques for modeling and
analyzing several variables. This skill test was designed to test your conceptual and practical knowledge of
various regression techniques.
A total of 1845 number of people participated in the test. I am sure they all will agree it was the best skill
assessment test on regression they have come across.
If you are one of those who missed out on this skill test, then you did miss out on the real time test. But here the
questions with detailed solutions, find out how many you could have answered correctly.
(https://datahack.analyticsvidhya.com/contest/skilltest-regression/)
Overall Scores
Below are the distribution of scores, this will help you evaluate your performance: -
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 2/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
(http://https://datahack.analyticsvidhya.com/contest/skilltest-regression/lb)
Median Score: 23
Mode Score: 23
5 Questions which can teach you Multiple Regression (with R and Python)
(https://www.analyticsvidhya.com/blog/2015/10/regression-python-beginners/)
Email Id
7 Types of Regression Techniques you should know
(https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/)
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 3/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q1. Which of the following step / assumption in regression modeling impacts the trade-off between
under-fitting and over-fitting the most.
Solution: A
Choosing the right degree of polynomial plays a critical role in fit of regression. If we choose higher degree of
polynomial, chances of overfit increase significantly.
Q2. Suppose you have the following data with one real-value input variable & one real-value output -
variable. What is leave-one out cross validation mean square error in case of linear regression (Y =
Your Ultimate path for Becoming
bX+c)?
a DATA Scientist!
Download this learning path to start your data
science journey.
A. 10/27
Email Id
k
B. 20/27
Download Resource
C. 50/27
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 4/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
D. 49/27
Solution: D
We need to calculate the residuals for each cross validation point. After fitting the line with 2 points and leaving 1
point for cross validation.
Leave one out cross validation mean square error = (2^2 +(2/3)^2 +1^2) /3 = 49/27
Q3. Which of the following is/ are true about “Maximum Likelihood estimate (MLE)”?
C. 1 and 3
D. 2 and 4 Email Id
Solution: C
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 5/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
The MLE may not be a turning point i.e. may not be a point at which the first derivative of the likelihood (and log-
likelihood) function vanishes.
-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Email Id
Q4. Let’s say, a “Linear regression” model perfectly fits the training data (train error is zero). Now, Which
k
of the following statement is true?
Download Resource
A. You will always have test error zero
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 6/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Solution: C
Test error may be zero if there no noise in test data. In other words, it will be zero, if the test data is perfect
representative of train data but not always.
Q5. In a linear regression problem, we are using “R-squared” to measure goodness-of-fit. We add a
feature in linear regression model and retrain the same model.
C. Individually R squared cannot tell about variable importance. We can’t say anything about it right now.
D. None of these.
Solution: C
“R squared” individually can’t tell whether a variable is significant or not because each time when we add a
feature, “R squared” can either increase or stay constant. But, it is not true in case of “Adjusted R squared” -
(increases when features found to be significant).
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
Q6. Which one of the statement is true regarding residuals science journey. analysis?
in regression
k
C. Mean of residuals is always greater than zero
Download Resource
D. There is no such rule for residuals.
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 7/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Solution: A
Sum of residual in regression is always zero. It the sum of residuals is zero, the ‘Mean’ will also be zero.
D. None of these
Solution: A
The presence of non-constant variance in the error terms results in heteroskedasticity. Generally, non-constant
variance arises because of presence of outliers or extreme leverage values.
Q8. Which of the following indicates a fairly strong relationship between X and Y?
-
A. Correlation coefficient = 0.9
Your Ultimate path for Becoming
a DATA Scientist!
B. The p-value for the null hypothesis Beta coefficient =0 is 0.0001
Download this learning path to start your data
C. The t-statistic for the null hypothesis Beta coefficient=0 is 30science journey.
D. None of these
Solution: A
Email Id
k
Correlation between variables is 0.9. It signifies that the relationship between variables is fairly strong.
Download Resource
On the other hand, p-value and t-statistics merely measure how strong is the evidence that there is non zero
association. Even a weak effect can be extremely significant given enough data.
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 8/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q9. Which of the following assumptions do we make while deriving linear regression parameters?
A. 1,2 and 3.
B. 1,3 and 4.
C. 1 and 3.
D. All of above.
Solution: D
When deriving regression parameters, we make all the four assumptions mentioned above. If any of the
assumptions is violated, the model would be misleading.
Q10. To test linear relationship of y(dependent) and x(independent) continuous variables, which of the
following plot best suited?
A. Scatter plot -
Your Ultimate path for Becoming
B. Barchart
a DATA Scientist!
C. Histograms Download this learning path to start your data
science journey.
D. None of these
Solution: A
Email Id
To test the linear relationship between continuous variables Scatter plot is a good option. We can find out how
k
one variable is changing w.r.t. another variable. A scatter plot displays the relationship between two quantitative
variables. Download Resource
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 9/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q11. Generally, which of the following method(s) is used for predicting continuous dependent variable?
1. Linear Regression
2. Logistic Regression
A. 1 and 2
B. only 1
C. only 2
D. None of these.
Solution: B
Logistic Regression is used for classification problems. Regression term is misleading here.
Q12. A correlation between age and health of a person found to be -1.09. On the basis of this you would
tell the doctors that:
Q13. Which of the following offsets, do we use in case of least square line fit? Suppose horizontal axis is
Email Id
independent variable and vertical axis is dependent variable.
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 10/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
A. Vertical offset
B. Perpendicular offset
D. None of above
Solution: A
We always consider residual as vertical offsets. Perpendicular offset are useful in case of PCA.
Q14. Suppose we have generated the data with help of polynomial regression of degree 3 (degree 3 will
perfectly fit this data). Now consider below points and choose the option based on these points. -
1. Simple Linear regression will have high bias and low variance
Your Ultimate path for Becoming
2. Simple Linear regression will have low bias and highavariance
DATA Scientist!
3. Download this learning path to start your data
polynomial of degree 3 will have low bias and high variance
4. science journey.
Polynomial of degree 3 will have low bias and Low variance
A. Only 1
B. 1 and 3
Email Id
k
C. 1 and 4
Download Resource
D. 2 and 4
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 11/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Solution: C
If we fit higher degree polynomial greater than 3, it will overfit the data because model will become more
complex. If we fit the lower degree polynomial less than 3 which means that we have less complex model so in
this case high bias and low variance. But in case of degree 3 polynomial it will have low bias and low variance.
Q15. Suppose you are training a linear regression model. Now consider these points.
Solution: C
1.With small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e. overfitting.
2. We can see this from the bias-variance trade-off. When hypothesis space is small, it has higher bias and lower -
variance. So with a small hypothesis space, it’s less likely to find a hypothesis to fit the data exactly i.e.
Your Ultimate path for Becoming
underfitting.
a DATA Scientist!
Download this learning path to start your data
science journey.
Q16. Suppose we fit “Lasso Regression” to a data set, which has 100 features (X1,X2…X100). Now, we
rescale one of these feature by multiplying with 10 (say that feature is X1), and then refit Lasso
regression with the same regularization parameter.
Email Id
Now, which of the following option will be correct?
C. Can’t say
D. None of these
Solution: B
Big feature values =⇒ smaller coefficients =⇒ less lasso penalty =⇒ more likely to have be kept
Q17. Which of the following is true about “Ridge” or “Lasso” regression methods in case of feature
selection?
D. None of above
Solution: B
“Ridge regression” will use all predictors in final model whereas “Lasso regression” can be used for feature
selection because coefficient values can be zero. For more detail click here
(https://discuss.analyticsvidhya.com/t/difference-between-ridge-regression-and-lasso-and-its-effect/3000). -
Your Ultimate path for Becoming
a DATA Scientist!
Q18. Which of the following statement(s) can be true post Download
adding a this learning
variable in path to start
a linear your data model?
regression
science journey.
1. R-Squared and Adjusted R-squared both increase
Email Id
3. R-Squared decreases and Adjusted R-squared decreases
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 13/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
B. 1 and 3
C. 2 and 4
Solution: A
Each time when you add a feature, R squared always either increase or stays constant, but it is not true in case
of Adjusted R squared. If it increases, the feature would be significant.
Q19. The following visualization shows the fit of three different models (in blue line) on same training
data. What can you conclude from these visualizations?
1. The training error in first model is higher when compared to second and third model.
2. The best model for this regression problem is the last (third) model, because it has minimum
training error.
3. The second model is more robust than first and third because it will perform better on unseen data. -
4. The third model is overfitting data as compared to first
YourandUltimate
second model.
path for Becoming
5. All models will perform same because we have not seen the test data.
a DATA Scientist!
Download this learning path to start your data
A. 1 and 3
science journey.
B. 1 and 3
C. 1, 3 and 4
Email Id
D. Only 5
The trend of the data looks like a quadratic trend over independent variable X. A higher degree (Right graph)
polynomial might have a very high accuracy on the train population but is expected to fail badly on test dataset.
But if you see in left graph we will have training error maximum because it under-fits the training data.
Q20. Which of the following metrics can be used for evaluating regression models?
1. R Squared
2. Adjusted R Squared
3. F Statistics
A. 2 and 4.
B. 1 and 2.
C. 2, 3 and 4.
Solution: D
These (R Squared, Adjusted R Squared, F Statistics , RMSE / MSE / MAE ) are some metrics which you can use
to evaluate your regression model. -
Your Ultimate path for Becoming
a DATA Scientist!
Downloadwith
Q21. We can also compute the coefficient of linear regression this learning
the helppath
oftoan
start your data method
analytical
science
called “Normal Equation”. Which of the following is/are true aboutjourney.
“Normal Equation”?
k
A. 1 and 2
Download Resource
B. 1 and 3.
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 15/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
C. 2 and 3.
D. 1,2 and 3.
Solution: D
Instead of gradient descent, Normal Equation can also be used to find coefficients. Refer this article
(http://eli.thegreenplace.net/2014/derivation-of-the-normal-equation-for-linear-regression/) for read more about
normal equation.
Q22. The expected value of Y is a linear function of the X(X1,X2….Xn) variables and regression line is
defined as:
Y = β0 + β1 X1 + β2 X2……+ βn Xn
1. If Xi changes by an amount ∆Xi, holding other variables constant, then the expected value of Y
changes by a proportional amount βi ∆Xi, for some constant βi (which in general could be a
positive or negative number).
2. The value of βi is always the same, regardless of values of the other X’s.
3. The total effect of the X’s on the expected value of Y is the sum of their separate effects.
A. 1 and 2 -
Your Ultimate path for Becoming
B. 1 and 3
a DATA Scientist!
C. 2 and 3 Download this learning path to start your data
science journey.
D. 1,2 and 3
Solution: D
Email Id
1. The expected value of Y is a linear function of the X variables. This means:
k
1. If X i changes by an amount ∆X i , holding other variables fixed, then the expected value of Y
changes by a proportional amount β i ∆X i , for some Download
constant Resource
β i (which in general could be a
positive or negative number).
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 16/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
2. The value of β i is always the same, regardless of values of the other X’s.
3. The total effect of the X’s on the expected value of Y is the sum of their separate effects.
2. The unexplained variations of Y are independent random variables (in particular, not “auto correlated” if the
variables are time series)
3. They all have the same variance (“homoscedasticity”).
4. They are normally distributed.
Q23. How many coefficients do you need to estimate in a simple linear regression model (One
independent variable)?
A. 1
B. 2
C. Can’t Say
Solution: B
Q24. Below graphs show two fitted regression lines (A & B) on randomly generated data. Now, I want to
find the sum of residuals in both cases A and B.
-
Note:
Your Ultimate path for Becoming
1. Scale is same in both graphs for both axis. a DATA Scientist!
Download
2. X axis is independent variable and Y-axis is dependent this learning path to start your data
variable.
science journey.
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 17/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
D) None of these
Solution: C
Q25. If two variables are correlated, is it necessary that they have a linear relationship?
A. Yes
B. No
Solution: B
-
Q26. Correlated variables can have zero correlation coeffficient. True or False?
Your Ultimate path for Becoming
A. True a DATA Scientist!
Download this learning path to start your data
B. False science journey.
Solution: A
Email Id
k
Q27. Suppose I applied a logistic regression model on data and got training accuracy X and testing
accuracy Y. Now I want to add few new features in data. Select option(s)
DownloadwhichResource
are correct in such case.
Note: Consider remaining parameters are same.
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 18/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
A. Only 2
B. Only 1
C. Only 3
D. Only 4
Solution: A
Adding more features to model will always increase the training accuracy i.e. low bias. But testing accuracy
increases if feature is found to be significant.
Q28. The graph below represents a regression line predicting Y from X. The values on the graph shows
the residuals for each predictions value. Use this information to compute the SSE.
-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Email Id
k
A. 3.02
Download Resource
B. 0.75
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 19/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
C. 1.01
D. None of these
Solution: A
SSE is the sum of the squared errors of prediction, so SSE = (-.2)^2 + (.4)^2 + (-.8)^2 + (1.3)^2 + (-.7)^2 = 3.02
Q29. Height and weight are well known to be positively correlated. Ignoring the plot scales (the variables
have been standardized), which of the two scatter plots (plot1, plot2) is more likely to be a plot showing
the values of height (Var1 – X axis) and weight (Var2 – Y axis).
A. Plot2
B. Plot1
-
C. Both Your Ultimate path for Becoming
a DATA Scientist!
D. Can’t say
Download this learning path to start your data
science journey.
Solution: A
Plot 2 is definitely a better representation of the association between height and weight. As individuals get taller,
they take up more volume, which leads to an increase in height, so a positive relationship is expected. The plot
on the right has this positive relationship while the plot on the left Email
showsIda negative relationship.
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 20/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q30. Suppose the distribution of salaries in a company X has median $35,000, and 25th and 75th
percentiles are $21,000 and $53,000 respectively.
A. Yes
B. No
D. None of these.
Solution: C
Q31. Which of the following option is true regarding “Regression” and “Correlation” ?
C. The relationship is not symmetric between x and y in case of correlation but in case of regression it is
symmetric.
-
D. The relationship is symmetric between x and y in case of correlation but in case of regression it is not
symmetric.
Your Ultimate path for Becoming
a DATA Scientist!
Solution: D Download this learning path to start your data
science journey.
1. Correlation is a statistic metric that measures the linear association between two variables. It treats y and x
symmetrically.
2. Regression is setup to predict y from x. The relationship is not symmetric.
Email Id
Download
and
Q32. Can we calculate the skewness of variables based on mean median?Resource
k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 21/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
A. True
B. False
Solution: B
The skewness is not directly related to the relationship between the mean and median.
Q33. Suppose you have n datasets with two continuous variables (y is dependent variable and x is
independent variable). We have calculated summary statistics on these datasets. All of them give the
following result:
A. Yes
B. No
-
Your Ultimate path for Becoming
C. Can’t Say a DATA Scientist!
Download this learning path to start your data
Solutiom: C
science journey.
To answer this question, you should know about Anscombe’s quartet. Refer this link
(https://en.wikipedia.org/wiki/Anscombe's_quartet) to read more about this.
Email Id
Q34. How does number of observations influence overfitting? Choose the correct answer(s).
A. 1 and 4
B. 2 and 3
C. 1 and 3
D. None of theses
Solution: A
In particular, if we have very few observations and it’s small, then our models can rapidly overfits data. Because
we have only a few points and as we’re increasing in our model complexity like the order of the polynomial, it
becomes very easy to hit all of our observations.
On the other hand, if we have lots and lots of observations, even with really, really complex models, it is difficult
to overfit because we have dense observations across our input.
Q35. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with tuning parameter lambda to reduce its complexity. Choose the option(s) below which
describes relationship of bias and variance with lambda.
-
A. In case of very large lambda; bias is low, variance is low
Your Ultimate path for Becoming
B. In case of very large lambda; bias is low, variance is high a DATA Scientist!
Download this learning path to start your data
C. In case of very large lambda; bias is high, variance is low science journey.
Solution: C
Email Id
k
If lambda is very large it means model is less complex. So in this case bias is high and variance in low.
Download Resource
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 23/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q36. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with tuning parameter lambda to reduce its complexity. Choose the option(s) below which
describes relationship of bias and variance with lambda.
Solution: B
If lambda is very small it means model is complex. So in this case bias is low and variance is high because
model will overfit the data.
A. 1 and 3
-
B. 1 and 4
Your Ultimate path for Becoming
C. 2 and 3 a DATA Scientist!
Download this learning path to start your data
D. 2 and 4 science journey.
Solution: A
Specifically, we can see that when lambda is 0, we get our least square solution. When lambda goes to infinity,
Email Id
we get very, very small coefficients approaching 0.
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 24/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Q38. Out of the three residual plots given below, which of the following represent worse model(s)
compared to others?
Note:
-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Email Id
A. 1 Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 25/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
B. 2
C. 3
D. 1 and 2
Solution: C
There should not be any relationship between predicted values and residuals. If there exist any relationship
between them means model has not perfectly capture the information in data.
Q39. Which of the following method(s) does not have closed form solution for its coefficients?
A. Ridge regression
B. Lasso
D. None of both
Solution: B
The Lasso does not admit a closed-form solution. The L1-penalty makes the solution non-linear. So we need to
approximate the solution.
-
If you want to read more about closed form solutions, refer this link
Your Ultimate
(http://statweb.stanford.edu/~tibs/sta305files/Rudyregularization.pdf).
path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Q40. Consider the following dataset
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 26/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Which bold point, if removed will have the largest effect on fitted regression line as shown in above
figure(dashed)?
A) a
B) b
C) c
-
Your Ultimate path for Becoming
D) d a DATA Scientist!
Download this learning path to start your data
Solution: D science journey.
Linear regression is sensitive to outliers in the data. Although c is also an outlier in given data space but it is
closed to the regression line(residual is less) so it will not affect much.
Email Id
Q41. In a simple linear regression model (One independent variable), If we change the input variable by 1
unit. How much output variable will change? Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 27/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
A: By 1
B. No change
C. By intercept
D. By its Slope
Solution: D
Equation for simple linear regression: Y=a+bx. Now if we increase the value of x by 1 then the value of y would
be a+b(x+1) i.e. value of y will get incremented by b.
Q42. Logistic Regression transforms the output probability to be in a range of [0, 1]. Which of the
following function is used by logistic regression to convert the probability in the range between [0,1].
A. Sigmoid
B. Mode
C. Square
D. Probit
Solution: A
-
Sigmoid function is used to convert output probability between [0,1] in logistic regression.
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
Q43: Which of the following statement is true about partialscience journey.
derivative of the cost functions w.r.t weights /
coefficients in linear-regression and logistic-regression?
Email Id
B. Both will be same
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 28/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Solution: B
Q44. Suppose, we are using Logistic regression model for n-class classification problem. In this case,
we can use One-vs-rest method. Choose which of the following option is true regarding this?
D. None of these.
Solution: A
If there are n classes, then n separate logistic regression has to fit, where the probability of each category is
predicted over the rest of the categories combined.
Take a example of 3-class(-1,0,1) classification. Then need to train 3 logistic regression classifiers.
1. -1 vs 0 and 1
2. 0 vs -1 and 1
3. 1 vs 0 and -1
-
Your Ultimate path for Becoming
Q45. Below are two different logistic models with differentavalues
DATAforScientist!
β0 and β1.
Download this learning path to start your data
science journey.
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 29/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Which of the following statement(s) is true about β0 and β1 values of two logistics models (Green,
Black)?
D. Can’t Say. -
Your Ultimate path for Becoming
Solution: B
a DATA Scientist!
β0 and β1: β0 = 0, β1 = 1 is in X1 color(black) and β0 = 0, β1 =Download thiscolor
−1 is in X4 learning path to start your data
(green)
science journey.
End Notes
Email Id
I hope you enjoyed taking the test and you found the solutions helpful. The test focused on conceptual
k
knowledge of regression and its various techniques.
Download Resource
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 30/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
We tried to clear all your doubts through this article but if we have missed out on something then let me know in
comments below. If you have any suggestions or improvements you think we should make in the next skilltest, let
us know by dropping your feedback in the comments section.
You can test your skills and knowledge. Check out Live Competitions
(http://datahack.analyticsvidhya.com/contest/all) and compete with best Data Scientists from
all over the world.
You can also read this article on Analytics Vidhya's Android APP
(//play.google.com/store/apps/details?
id=com.analyticsvidhya.android&utm_source=blog_article&utm_campaign=blog&pcampaignid=MKT-Other-
global-all-co-prtnr-py-PartBadge-Mar2515-1)
Share this:
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=linkedin&nb=1)
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=facebook&nb=1)
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=twitter&nb=1)
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=pocket&nb=1)
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/?
share=reddit&nb=1) -
Your Ultimate path for Becoming
a DATA Scientist!
Like this:
Download this learning path to start your data
Loading... science journey.
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 31/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
(https://www.analyticsvidhya.com/blog/2017/08/skilltest-
(https://www.analyticsvidhya.com/blog/2017/07/30-
(https://www.analyticsvidhya.com/blog/
logistic-regression/) questions-to-test-a-data-scientist- for-skilltest-machine-learning-
30 Questions to test your on-linear-regression/) revealed/)
understanding of Logistic Regression 30 Questions to test a data scientist Solutions for Skilltest Machine
(https://www.analyticsvidhya.com/blo on Linear Regression [Solution: Learning : Revealed
g/2017/08/skilltest-logistic- Skilltest – Linear Regression] (https://www.analyticsvidhya.com/blo
regression/) (https://www.analyticsvidhya.com/blo g/2016/11/solution-for-skilltest-
August 3, 2017 g/2017/07/30-questions-to-test-a- machine-learning-revealed/)
In "Machine Learning" data-scientist-on-linear-regression/) November 20, 2016
July 3, 2017 In "Machine Learning"
In "Linear Regression"
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 32/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
NEXT ARTICLE
h
PREVIOUS ARTICLE
Hadoop developer-2
(https://www.analyticsvidhya.com/blog/2016/12/hadoop-developer-2/)
(https://www.analyticsvidhya.com/blog/author/facebook_user_4/)
Ankit Gupta
(Https://Www.Analyticsvidhya.Com/Blog/Author/Facebook_user_4/)
Ankit is currently working as a data scientist at UBS who has solved complex data mining problems in
many domains. He is eager to learn more about data science and machine learning algorithms.
-
x (https://www.linkedin.com/in/ankit-gupta-84b737ba?trk=nav_responsive_tab_profile)
Your Ultimate path for Becoming
I (https://github.com/anki1909) aGDATA
(ankit.gupta968)
Scientist!
Download this learning path to start your data
science journey.
This article is quite old and you might not get a prompt response from the author. We request you to post
this comment on Analytics Vidhya's Discussion portal (https://discuss.analyticsvidhya.com/)
Email Id to get your
queries resolved
RISHABH Reply
December 19, 2016 at 5:26 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119944)
Hi Rishabh,
Think about Pearson correlation between two continuous variables. Pearson correlation will measure only linear
relationships between variables.
Refer this link to get more sense of it.
https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/
(https://www.analyticsvidhya.com/blog/2015/06/correlation-common-questions/)
Best!
Ankit Gupta
-
KRISHNA Your Ultimate path for Becoming Reply
a DATA Scientist!
December 19, 2016 at 5:56 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119947) Download this learning path to start your data
science journey.
What is difference between continuous variable and Discrete( Categorical) variable?
How Logistic Regression used for predicting continuous variable? (Que 11)
Hi Krishna,
If a variable can take any value between its minimum and maximum value, then it is called a continuous variable.
By nature, a lot of things we deal with fall in this category: age, weight, height being some of them.
But in case of categorical variable it can take only few values. For example, a gender can be male or female.
Coming to the point. How can you use logistic regression for continuous variable prediction? Convert the
continuous variables into bin(say 10). By creating the bin of this variable; you actually created some categories.
So now you can use logistic regression.
KRISHNA Reply
December 19, 2016 at 6:19 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119952)
SUBBIAH Reply
December 22, 2016 at 2:15 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120198)
What you mean by “minimum and maximum value, then it is called a continuous variable.”? Usually a metric
quantity (such as in your example, that too measurement scale is essential) is continuous variable (Precisely
-
assuming on a uncountable set) Say age measured in years isYour
a continuous
Ultimate variable.
path When
for “maximum”
Becoming is not
known we say age lies in an infinite interval.
a DATA Scientist!
Download this learning path to start your data
science journey.
ANKIT GUPTA Reply
(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
December 22, 2016 at 5:41 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120209)
Email Id
Hi Subbiah,
Thanks for asking this question. It is really a good question. Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 35/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
A continuous variable can take infinite number of values between any defined interval. It doesn’t have to have
every possible numbers, for example, infinite. Infinite itself is not a real number.
Best!
Ankit Gupta
SUBBIAH Reply
December 22, 2016 at 11:21 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120232)
Plz explain “”can take infinite number of values between any defined interval” What do you mean by defined
interval? I do not understand the terms you are using to define a variable (or range of a variable)
I do not mean infinite as a number. There is a difference in saying finite / infinite interval in real line. For example
[0, 1] (for proportion say) is a finite interval whereas lifetime (usual example for exponential distribution) in years
has a range (0, Inf) which is an infinite interval. But both intervals are uncountable.
Precisely a reader should be able to distinguish between, finite, countable, and uncountable sets to understand a
discrete / continuous variable
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 36/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
“If n is 1 lac means it is countable set. If n is infinite means it is uncountable set.” – Do you mean that with a finite
n (say 1 lac or 10 lac) the closed (or defined) interval [7,15] becomes countable?
For Q18, Isn’t Adjusted R Square also supposed to increase or remain constant (though fairly more robust than
R Square)? Haven’t come across Adjusted R Square decrease on adding a new variable. Please explain the
theory behind it.
Hi Kenneth,
Thank you for reaching out. Please read this discussion which will give answer to your question.
Link = https://discuss.analyticsvidhya.com/t/difference-between-r-square-and-adjusted-r-square/264/2
(https://discuss.analyticsvidhya.com/t/difference-between-r-square-and-adjusted-r-square/264/2)
-
Best!
Your Ultimate path for Becoming
Ankit Gupta
a DATA Scientist!
Download this learning path to start your data
science journey.
KENNETH SINGH Reply
December 19, 2016 at 8:05 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119958)
Please explain Q22: The Beta for a particular variable is supposed to change
Email Id on adding or deleting other
variables right? I always see them change.
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 37/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Number of features is not changing. Only consider the changing in the values of other x’s variables.
But this would still lead to change in the Beta under consideration right?
Note:
1. b1 will not be affected
2. b2*x2 is remain same as before
-
Your Ultimate path for Becoming
KENNETH SINGH a DATA Scientist! Reply
December 19, 2016 at 10:56 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
Download this learning path to start your data
regression-skill-test-regression-solution/#comment-119981) science journey.
Now, if we change the value of X1, with X2 remaining the same, say:
X 1 X2 Y
10 2 7
11 4 8
569
This would change the Beta for X2, when we model Y = f(X1,X2), right?
I think that you’re right. In that particular case coefficients changed from [0.25; 0.25] to [7.3e-17; 0.5]
Yes, that’s what my understanding was in answering this question. But let Ankit give his thoughts, we could be
wrong.
KARIM LULU
December 19, 2016 at 11:50 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
-
Reply
k
regression-skill-test-regression-solution/#comment-119994)
Download Resource
Hi Karim,
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 39/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
It is due to interaction present between x1 and x2. In this question we wanted to check the linear regression
assumptions.
Best!
Ankit Gupta
Thanks for noticing it. It was assumed that features are independent of each other.
Best!
Ankit Gupta
Satisfying the no Multicollinearity condition (by having very low correlation among the Independent Variables)
would still have effect the Beta coefficient in the way discussed above right?
-
KARIM LULU Your Ultimate path for Becoming Reply
December 19, 2016 at 1:27 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
a DATA Scientist!
regression-skill-test-regression-solution/#comment-120000)
Download this learning path to start your data
science journey.
That’s clear now.
Email Id
KENNETH SINGH Reply
December 19, 2016 at 8:09 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119959)
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 40/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Best!
Ankit Gupta
I couldn’t find an answer to how correlated variables can have a zero correlation coefficient. Could you explain a
bit on this?
A non-dependency between two variable means a zero correlation. However the inverse is not true. A zero
correlation can even have a perfect dependency.
Email Id
KENNETH SINGH
k
Reply
December 19, 2016 at 10:12 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
Download Resource
regression-skill-test-regression-solution/#comment-119969)
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 41/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
SANDEEP Reply
December 19, 2016 at 10:39 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-119976)
Hi Ankit,
Can you please elaborate this point Correlation coefficient is between [-1 ,1]. So -1.09 is not possible. Is it fair to
say in a Medical or a Finical data that if there is a -ve co relation then it means something. For example in a
Clinical research if there is some Diet followed and we a -ve correlation on the Diet and Health then we can say
that as the Protein Diet Decreases Increases risk in health issues and. In Financial Data if we happen to do a
Sales Prediction and we see a -ve co relation between DV and IDV then we can Fairly says that decrease in IDV
and increase a DV
Regards
Sandeep R Diddi
ANKIT GUPTA
Download Resource k
Reply
(HTTPS://WWW.FACEBOOK.COM/APP_SCOPED_USER_ID/1107394225967405/)
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 42/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
Hi Poushali,
You can think simple linear equation as y = ax +c where c = 0.
Best!
Ankit Gupta
MUKUL Reply
December 20, 2016 at 2:15 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-120058)
Hi
Can you please explain Q 30. What all information is required to definitely call an observation as an outlier.
Hi Mukul,
-
Thanks for reaching out!
Your Ultimate path for Becoming
a DATA Scientist!
Please refer this article: https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/
Download this learning path to start your data
(https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/)
science journey.
Best!
Ankit Gupta
Email Id
KOMAL Reply
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 43/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
hi
i want to study regression model in details. can you please suggest any book??
also book regarding analysis in predictive modeling.
Hi Komal,
Best!
Ankit Gupta
KOMAL Reply
January 14, 2017 at 5:42 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121006)
-
Thank you.. Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
SOWMYA science journey.
Reply
January 19, 2017 at 11:46 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121277)
Hi,
Email Id
In Q40, amongst data points ‘C’ and “D’ the regression line would vary upon removing point ‘C’ I believe since
right now it looks like outliers “A”, “B” and “C” play some role in directing the regression line in the direction given
in the figure. Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 44/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
So if one of these key points, specifically “C” , since removing it can direct the regression line towards the outlier
“D” would change the direction of the line.
Your thoughts?
HI Sowmya,
Thanks for reaching out. Since, the residuals of A, B, C are very less as compare to D. So if you remove the
outlier D; It will give you maximum change in direction of regression line.
Regards,
Ankit Gupta
SOWMYA Reply
January 24, 2017 at 7:01 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121503)
Hi Ankit,
The link for Q43 seems broken. Can you provide another link where the answer for the question would be
explained better.
-
Your Ultimate path for Becoming
Thanks.
a DATA Scientist!
Download this learning path to start your data
science journey.
KARTHIK Reply
February 1, 2017 at 4:19 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121946)
ANKIT Reply
February 1, 2017 at 5:05 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-121948)
Hi Karthik,
Thanks for asking. 3rd option is incorrect because data is itself generated using 3rd-degree polynomial. It means
that if you fit 3rd-degree polynomial it will perfectly fit the data(Not only train but test also). which means it
satisfies the condition of low bias and low variance.
Regards,
Ankit Gupta
Regarding Q34.
With Fewer observation we can have bias issue. How can we have overfitting issue?
I am not getting the point. Can you please explain it?
-
MARAKHI DER Your Ultimate path for Becoming Reply
a DATA Scientist!
May 28, 2017 at 9:59 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
skill-test-regression-solution/#comment-129370) Download this learning path to start your data
science journey.
I got the explanation thank you.
k
skill-test-regression-solution/#comment-134968)
Download Resource
which includes mathematical stuffs too.
ankit can i get questions related to all regression and ML techniques
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 46/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
[…] https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-ski…
(https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-ski…);
[…]
数据科学家45题自测:回归知识部分(有答案和解析) | 神刀安全网
(HTTP://WWW.SHELLSEC.COM/NEWS/50174.HTML)
November 10, 2017 at 9:57 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-143055)
[…] 文章原标题《45 questions to test a Data Scientist on Regression (Skill test – Regression Solution)》,作
者: ANKIT GUPTA文章为简译,更为详细的内容,请查看原文 […]
-
PRIYANKA SINGH Reply
Your Ultimate path for Becoming
February 20, 2018 at 11:16 pm (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-
regression-skill-test-regression-solution/#comment-151505) a DATA Scientist!
Download this learning path to start your data
Should we really use R^2 for evaluating regression models? Q-20
science journey.
REDDI Reply
April 6, 2018 at 12:49 am (https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-
Email Id
skill-test-regression-solution/#comment-152401)
(https://id.analyticsvidhya.com/accounts/login/?
next=https://www.analyticsvidhya.com/blog/? Join Now
utm_source=blog-subscribe&utm_medium=web)
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 48/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
POPULAR POSTS
6 Useful Programming Languages for Data Science You Should Learn (that are not R and Python)
(https://www.analyticsvidhya.com/blog/2019/06/6-useful-programming-languages-data-science-r-python/)
24 Ultimate Data Science Projects To Boost Your Knowledge and Skills (& can be accessed freely)
(https://www.analyticsvidhya.com/blog/2018/05/24-ultimate-data-science-projects-to-boost-your-knowledge-
and-skills/)
Commonly used Machine Learning Algorithms (with Python and R Codes)
(https://www.analyticsvidhya.com/blog/2017/09/common-machine-learning-algorithms/)
-
A Complete Python Tutorial to Learn Data Science from Scratch
Your Ultimate path for Becoming
(https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-learn-data-science-python-scratch-2/)
a DATA Scientist!
7 Regression Techniques you should know! (https://www.analyticsvidhya.com/blog/2015/08/comprehensive-
guide-regression/) Download this learning path to start your data
science journey.
6 Powerful Open Source Machine Learning GitHub Repositories for Data Scientists
(https://www.analyticsvidhya.com/blog/2019/07/6-powerful-open-source-machine-learning-github-repositories-
data-scientists/)
Stock Prices Prediction Using Machine Learning and Deep Learning Techniques (with Python codes)
Email Id
(https://www.analyticsvidhya.com/blog/2018/10/predicting-stock-price-machine-learningnd-deep-learning-
techniques-python/)
Understanding Support Vector Machine algorithm from examples (along
Downloadwith code)
Resource
(https://www.analyticsvidhya.com/blog/2017/09/understaing-support-vector-machine-example-code/)
k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 49/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
-
Your Ultimate path for Becoming
RECENT POSTS a DATA Scientist!
Download this learning path to start your data
science journey.
Introduction to PyTorch-Transformers: An Incredible Library for State-of-the-Art NLP (with Python code)
(https://www.analyticsvidhya.com/blog/2019/07/pytorch-transformers-nlp-python/)
JULY 18, 2019
Heroes of Machine Learning – Top Experts and Researchers you should follow
(https://www.analyticsvidhya.com/blog/2019/07/heroes-of-machine-learning-experts-researchers/)
JULY 17, 2019
Popular Machine Learning Applications and Use Cases in our Daily Life
(https://www.analyticsvidhya.com/blog/2019/07/ultimate-list-popular-machine-learning-use-cases/)
JULY 15, 2019
-
Your Ultimate path for Becoming
(http://www.edvancer.in/certified-data-scientist-with-python-course?
a DATA Scientist!
Download this learning path to start your data
science journey.
utm_source=AV&utm_medium=AVads&utm_campaign=AVadsnonfc&utm_content=pythonavad)
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 51/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
(https://courses.analyticsvidhya.com/courses/natural-language-
processing-nlp?utm_source=Sticky_banner1&utm_medium=display&utm_campaign=NLPcourse)
(https://datahack.analyticsvidhya.com/contest/amexpert-decipher-
Download Resource
Subscribe to emailer > k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 52/53
18/07/2019 45 questions to test a Data Scientist on Regression | Skill test
© Copyright 2013-2019 Analytics Vidhya. Privacy Policy (https://www.analyticsvidhya.com/privacy-policy/) Don't have an account? Sign up (https:/
×
-
(http://play.google.com/store/apps/details?id=com.analyticsvidhya.android)
-
Your Ultimate path for Becoming
a DATA Scientist!
Download this learning path to start your data
science journey.
Email Id
Download Resource k
https://www.analyticsvidhya.com/blog/2016/12/45-questions-to-test-a-data-scientist-on-regression-skill-test-regression-solution/ 53/53