Академический Документы
Профессиональный Документы
Культура Документы
Session 1
explain the foundational principles and concepts that are common to all
statistical methods,
identify appropriate uses of the statistical methods, including their strengths
and limitations,
and apply each of the concepts to practical situations in order to make
appropriate conclusions
Company Confidential
Agenda
I. Introduction What is Statistics?
A. Terms and Definitions
a. Types of Data
b. Population and Sample
II. Distributions & Summary Statistics
A. Describing the distribution:
a. Shape
1. Symmetry
2. Skewness
3. Modality
b. Center
1. Average\Mean
2. Median
3. Mode
c. Spread
1. Percentile, Deciles and Quartiles
2. Range
3. Inter-quartile Range
4. Standard Deviation
Company Confidential
What is Statistics?
Company Confidential
Data Types
For the purposes of our training, we will consider three types of data:
Company Confidential
Data Type Conversion
The tenure (in months) of Makati Analysts:
44, 90, 80, 135, 21, 53, 29, 128, 47, 11, 15, 49, 66, 49, 21, 110, 23,
50, 48, 50, 47, 45
Ordinal Nominal
Company Confidential
Where your data comes from
There are two possible sources of data.
Company Confidential
Data Distribution
Pattern
Company Confidential
Describing a Distribution
We describe a distribution by its
1. Shape usually described by
Symmetry
Modality
Outliers
2. Center refers to the measure of the
middle or expected value of the
data set
Mean
Median
Mode
3. Spread also called variation,
denotes variability in a distribution
Percentile, Decile, Quartile
Range
Interquartile Range
Standard Deviation and Variance
Company Confidential
Shape: Symmetry
Symmetric
Left and right side of the center are mirror images of each other.
Skewed
Skewed to the Right\Positively Skewed Long tail to the right
Skewed to the Left\Negatively Skewed Long tail to the left
Company Confidential
Shape: Modality
Modality
Refers to the number of peaks in a dataset.
Mode is the most frequent value in a dataset
Company Confidential
Outliers
Outliers
Observations that deviate markedly from the rest of the data
Could result from special causes; may indicate bad data
Company Confidential
Shape of the CSAT Data
CSAT Scores:
80, 70, 83, 68, 100, 61, 67, 86, 89, 75, 75, 79, 40, 77, 53, 74, 86,
71, 60, 62, 64, 80, 73, 68, 69, 83, 72, 79, 71, 82, 86, 100, 100,
80, 88
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Mean
When used without specification, mean refers to the arithmetic
average of a data set.
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Median
The median is a different kind of average. It is the value that is greater
than or equal to half of the values in the data set. It is the middle point of
the data set.
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Mode
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Mode
Some guiding principles:
If the mean > median, we have a positively-skewed distribution
If the mean < median, we have a negatively-skewed distribution
If the mean median, symmetrical distribution
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Company Confidential