Академический Документы
Профессиональный Документы
Культура Документы
2019
Problem Statement - Answer the following questions to the best of your knowledge including the
concepts taught to you in the level.
1. If the variance of a variable/column is 0 then what does it mean? Can we use that variable for our
analysis?
Variance of a variable/column is zero, it means all the data(singular) are same/constant there is no
fluctuation in that column. If there is no fluctuation/variance in the data, then there is no point in any
analysis on that variable.
2. Calculate mean, median, mode, variance and standard deviation for column A
A = 7,6,7,7,8,5,8,7,7,5,5
N=11
5,5,5,6,7, 7 ,7,7,7,8,8
Mode is value that occurs most frequently in this case 7 which appeared 5 times compared to other
observations occurrence.
Variance measures how far a set of numbers are spread out from their mean
= 12.728
i.e., G = { g1, g2, g3, g4, g5, g6, g7, g8, g9, g10, g11, g12 }
let mean (µ) = ∑ (G)/N. where G=12 no. of individual scores and N is no. of scores i.e., 12
∑ (G)/N + 36/N
∑ (G)/N + 36/12
Mean + 3
4. Explain the difference between Data (Singular) and Data (Plural) with examples?
Single observation of a variable/column in a sample or population termed as data singular and multiple
observations termed as data plural.
data singular may be a nominal value say male, female etc., or a single word of categorical nature.
All the individual value of the variable set-1 &2 are Data Singular and all the collection of five individual
values of set1 is Data Plural.
Statistical measure divided into two parts 1. Descriptive statistics and 2. Inferential statistics.
Descriptive statistics Provide measures of central tendency like mean, median, mode, variance, std
deviation but to take a decision about the whole population we need inferential statistics.
For example: Blood sample of person, to test we need a sample not whole-body blood, also it is
important that at what condition we are taking this blood sample from that person who is infected.
Otherwise blood test may give wrong result. So correct sample must have same characteristics as that of
population.