Вы находитесь на странице: 1из 5

Assignment 1-Concepts of Statistics

1. If the variance of a variable/column is 0 then what does it mean? Can we use that variable
for our analysis?

Answer: If the variance of a variable is zero that means all the values in the column/variable
are same, there is no variation in the values of the data set. We cannot use that variable for
our analysis because if there is no variance then it means no deviation from the data. Data
will look same.

Explanation:
The Standard Deviation is obtained by taking the positive square root of the variance

• The value of standard deviation tells how closely the values of a data set are clustered
around the mean.
• A lower value of standard deviation for a data set indicates that the values of that
data set are spread over a relatively smaller range around the mean.
• A larger value of the standard deviation indicates that the values of that data set are
spread over relatively larger range around the mean.
• Zero value of the standard deviation indicates that the values of that data set are
same. There is no variation in the data, no analysis can be shown out of the data.

2. Calculate mean, median, mode, variance and standard deviation for column A.

Answer:

A
7
6
7
7
8
5
8
7
7
5
5
Total=72
• Mean
Sum of all observed outcome from sample divided by the total number of events

Column A Mean= (7+6+7+7+8+5+8+7+7+5+5)/11= 72/11= 6.54545

• Median
Median is the middle value when the data values have been sorted in increasing order

A( In Increasing Order)
5
5
5
6
7
7
7
7
7
8
8

Median= N+1/2 th
=11+1/2
=12/2
= 6th Number

Median = 7

• Mode
Mode of a set of a data is the number with the highest frequency.
Column A Mode = 7

• Variance
1. Calculate the mean.
2. Make a table that subtracts the mean from each observed value.
3. Square each of the differences.
4. Add this column.
5. Divide by N-1 where N is the number of items in the sample. This is Variance
A Mean X- Mean (X-Mean)^2
7 6.54 0.46 0.2116
6 6.54 -0.54 0.2916
7 6.54 0.46 0.2116
7 6.54 0.46 0.2116
8 6.54 1.46 2.1316
5 6.54 -1.54 2.3716
8 6.54 1.46 2.1316
7 6.54 0.46 0.2116
7 6.54 0.46 0.2116
5 6.54 -1.54 2.3716
5 6.54 -1.54 2.3716
Total= 72 Total= 12.7276

Variance = 12.7276/(11-1)

= 12.7276/10

Variance = 1.27276

• Standard Deviation: To get the Standard deviation we take the square root of the
variance.
√1.27276 = 1.12816
Standard Deviation = 1.12816

3. In a group of 12 scores, the largest score is increased by 36 points. What effect will this have
on the mean of the scores?

Answer:

In a group of 12 scores, the largest score is increased by 36 points. This will have some effects
because of the mean score.

Explanation:

Let Mean = X,

Group of score i.e 12 = N.

Total of Group scores = A

X = A/N
Largest score is increased by 36 means:

X = A+36/12

X = A + 3.

The mean score is increased by 3.

4. Explain the difference between Data (Singular) and Data (Plural) with examples?

Data (Singular) Data (Plural)

The value of the variable associated with The set of values collected for the variable
one element of a population or sample. This from each of the elements belonging to the
value may be a number, a word, or a sample.
symbol.

For Example: X = 5 For Example: Y = {2,4,6,8}

The Single Value of any variable is data The Set of values collected for the variable
singular from each of the elements belonging to the
sample is data plural

5. How the inferential statistics helps to make decisions out of it?

The major portion of Statistics deals with making decision, inferences, predictions and
forecast about population based on results obtained from sample. This area of statistics that
deals with such decision-making procedure is referred to Inferential Statistics.

We use representative sample(the sample which contain all the characteristics of different
group in population) from a population to extend and make conclusions or generalize about
the population. Inferential statistics is useful when we cannot access the entire population
that we want to investigate and draw conclusion about the entire population but have only
limited data from the population.

For example, if we want to find out whether Narendra Modi is better or Rahul Gandhi. In this
case we cannot survey the entire country, so we take a random survey from different parts of
India and then try to draw inferences for the population.
As we generalize the conclusion for a larger population, we can have error in our prediction
and so we also need to be careful that we select a truly random sample. If the samples are
truly not random then results can be skewed.

To get this better, if we survey majority of people from BJP then they are going to take the
name of Narendra Modi & if we are going to ask this from people of Congress then they are
going to say Rahul Gandhi.

So, that is why we need a truly random data, and this is how inferential statistics helps make
decision out of it.

Вам также может понравиться