Вы находитесь на странице: 1из 8

Data Analysis

Dr. Majid Majeed Akbar



Institute of Chemical Engineering and
Technology
University of the Punjab, Lahore.
Data Analysis
Mean or Average
It is a single value which is intended to
represent a set of data or a distribution
as a whole. It is more or less central
value round which the observations in
the set of data or distribution usually
tend to cluster. Such a central value is
also called a measure of central
tendency.
Data Analysis
Mean

where n is the number of measures in the
series and X stands for a score or other
measure.

Example:
Find mean for 7, 11, 6, 10, 13, and 20.
Solution:
Mean =
= 11.17

Data Analysis
Median
It is the numerical value separating the higher
half of a data sample, a population, or a
probability distribution, from the lower half.
The median of a finite list of numbers can be
found by arranging all the observations from
lowest value to highest value and picking the
middle one (e.g., the median of {3, 3, 5, 9, 11}
is 5). If there is an even number of
observations, then there is no single middle
value; the median is then usually defined to
be the mean of the two middle values(the
median of {3, 5, 7, 9} is (5 + 7) / 2 = 6)

Data Analysis
Variance
It measures how far a set of numbers is
spread out. A variance of zero indicates
that all the values are identical. Variance
is always non-negative: a small variance
indicates that the data tend to be very
close to the mean(expected value) and
hence to each other, while a high
variance indicates that the data are very
spread out around the mean and from
each other.
Data Analysis
Deviation
Deviate: To differ from a standard, mean
value.
For example we have a set of data
5, 6, 9, 13, 25, 26
The mean is = 5+6+9+13+25+26/6 = 14
There can be two types of deviations;
positive and negative. The numbers 25,
26 have shown +ve deviation, whereas
numbers 5, 6, 9, 13 has shown ve
deviation.



Data Analysis
Standard Deviation
The square root of the variance of a number of
observations.
Why it is called as standard deviation ?
Answer: The deviation which could predict the
highest and the lowest score of the distribution is
termed as a standard deviation.
The formulas pertaining to the prediction of max.
and min. score are as under;

Max. Score = M + 3 S.D.
Min. Score = M 3 S.D.

Data Analysis
Estimate the Mean, Variance, and
Standard Deviation for the given data:
600, 470, 170, 430, and 300.
Sol:
Mean =
= 394

Variance =
= 21,704
S.D = = 147.

Вам также может понравиться