Академический Документы
Профессиональный Документы
Культура Документы
WEEK 6
MODULE 4:
Section 2
Part I. Mathematics as a Tool
Chapter 4. Data Management
Overview
Data Management is the whole process of dealing with data from the very
beginning of the study with data analysis as the last part of it. It is actually
divided into three phases with phase 1 being the preparation of data entry that
includes review of questionnaire forms, coding, preparation of master sheets or
spread sheets, dummy tables and quality control; phase 2 is the data entry and
the 3rd and last phase as mentioned is the data analysis.
Prepared by Lhalili
GE04 1
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
*slide taken from the 2nd generation training on MMW, Mapua Institute of
Technology (2017)
Study Guide
Learning Outcomes
LO1: Use variety of statistical tools to process and manage numerical data.
LO3: Determine the range of probability values and find the probability of an event.
LO6: Use the methods of linear regression and correlations to predict the value of a variable
given certain conditions.
Prepared by Lhalili
GE04 2
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
PART 1 of Module 4
Statistics has both a plural and singular sense. In its plural sense, statistics
refers to numerical facts that are systematically collected and analyzed. In its singular
sense, statistics refers to the scientific discipline consisting of theory and methods for
processing numerical information that one can use when making decisions in the face
of uncertainty. The recognition of uncertainty and the importance of statistical
activities are likely to be as old as civilization itself. Even before the art of counting
was perfected, there is evidence to suggest that herdsmen were putting notches on
trees to keep track of their cattle. In its plural and singular sense, the term Statistics
refers to quantities computed from numerical information (Philippine Statistical
Association, 2008).
As such, statisticians are involved with methods of data collection, data
summarization, and data analyses, as well as communicating the results of its
analyses.
Areas of Statistics
1. Descriptive Statistics are methods concerned w/ collecting, describing, and
analyzing a set of data without drawing conclusions (or inferences) about a
large group.
2. Descriptive Statistics are methods concerned w/ collecting, describing, and
analyzing a set of data without drawing conclusions (or inferences) about a
large group.
Statistical methods have two broad aims: (a) to describe, and (b) to infer. In
the first case, the main task is that of data organization and presentation (without
drawing conclusions or inferences beyond the data). These tools are called
descriptive statistical methods. In the second case, the task is to generalize results
beyond the data collected provided that the data collected is a part (sample) of a large
set of items (population). In this case, the statistical analysis required is inferential
statistical methods.
Prepared by Lhalili
GE04 3
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Prepared by Lhalili
GE04 4
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Examples
Definition of Terms
The statistical population is the set of all possible values of the variable.
Measurement is the process of determining the value or label of the variable based
on what has been observed.
Prepared by Lhalili
GE04 5
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
The building blocks of statistical science are data. They come in diverse
range of formats and each type gives us a unique type of information. Data
represent the measured value of variables.
Types of Variables
The thing you are trying to explain or predict is most often called the dependent
variable (DV). It depends on others, which are called independent variables (IV). An
independent variable can sometimes be thought of as a cause and a dependent
variable as an effect.
Prepared by Lhalili
GE04 6
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Examples:
1. An experimenter might compare 4 types of antidepressants and seeks to
determine the effect of the antidepressant on relief from depression.
IV: antidepressants
DV: relief from depression
2. Can blueberries slow down aging? A study indicates that antioxidants found
in blueberries may slow down the process of aging. In this study, 19-month-
old rats (equivalent to 60-year-old humans) were fed either their standard
diet or a diet supplemented by either blueberry, strawberry or spinach
powder. After 8 weeks, the rats were given memory and motor skills tests.
Although all supplemented with blueberry powder showed the most notable
improvement.
Variables in General
Examples:
Discrete: Number of apples in a box, Number of Students who are absent, COVID19
cases in the Philippines
Prepared by Lhalili
GE04 7
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
A. DATA COLLECTION
The population of a study is all of the individuals, items or units relevant to the
study. It comprises individuals, groups, organizations, documents, campaigns,
incidents and so on. It is also called the “universe”. Samples are subsets of the
population selected to represent the population.
Example #2: A substitute teacher wants to know how students in the class did on
their last test. The teacher asks the 10 students sitting in the front row to state their
latest test score. He concludes from their report that the class did extremely well.
Levels of Measurement
The measurement process is an integral part of data collection. If the unit of analysis
is an individual person, many characteristics of that person, some visible and other
invisible, can be measured. Visible characteristics include sex, skin color, age,
height, weight, eye color and hair color. Invisible characteristics include intelligence,
prejudice, authoritarianism, alienation, paranoia, love and hate. Measurement is the
assignment of numbers to objects or events according to a predetermined set of
rules. To measure a property means to assign numbers to units as a way of
representing that property.
The kind of analysis that one can perform on the available data critically depends on
its scale of measurement or level of measurement.
1. Nominal (for categorical data) – one simply names or categorizes responses. The
nominal scale is the simplest scale of measurement for variables where a value or
unit of data is assigned to one of at least two distinct and exhaustive categories.
Each category is given some name but no assumptions are made about
relationships among the categories.
Examples of the nominal scale include sex, employment status, race, marital status,
religious affiliation, language spoken at home and race.
2. Ordinal (for ranked data) – allow comparisons of the degree to which two subjects
possess the dependent variable. It also tells when one unit has more of the property
than does another unit. The ordinal scale specifies the relative positive of items with
Prepared by Lhalili
GE04 8
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
respect to a given characteristic. The ordinal scale is like the nominal scale in that it
consists of mutually exclusive and exhaustive categories. Ordinal scaling is very
common in obtaining attitude scaling used in social research.
Examples of the ordinal scale include employee ratings, salary grade, IQ, score in an
authoritarian personality test, score in a personality/beauty test.
3. Interval (for measurement data) – numerical scales in which intervals have the
same interpretation throughout. It also tells us that one unit differs by a certain
amount of the property from another unit. The interval scale possesses the
properties of the nominal and ordinal scale with the additional property of equal
intervals between rank-ordered items. Consider the IQ of a person, we can tell not
only which person ranks higher in IQ but also how much higher he or she ranks with
another (how many units).
Examples of the interval scale include IQ levels, Aptitude Tests and Temperature.
One unit’s difference is the same regardless of their position in the scale.
Note, the interval scale allows addition and subtraction operations, but it does not
possess an absolute zero.
4. Ratio (for measurement data) – an interval scale with the additional property that
its zero position indicates the absence of the quantity being measured. It also tells
us that one unit has so many times as much of the property as does another unit.
The ratio scale possesses an absolute, fixed zero point and allows all arithmetic
operations. The existence of the zero point is the only difference between ratio and
interval measurement.
Examples of the ratio scale include measurements of heights, weights, and ages.
In summary:
Prepared by Lhalili
GE04 9
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
B. DATA PRESENTATION
Big data hardly give information that can be of help in making decisions fast.
Management of modern business or any organization demand fast and accurate
decisions for they can be ruined by other competitors who are knowledgeable in
summarizing and interpreting large mass of data.
Definition of Terms
Raw data are data in their original form just as they were collected.
Constants are quantities that do not change under the same condition.
Variables are quantities that change over time and in some location.
Variates are the actual values of the variable.
Tabular Presentation
Prepared by Lhalili
GE04 10
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Stem and Leaf Display or a Stem and Leaf Plot is a method of graphically presenting
quantitative data likened to a histogram that helps in the visualization of the shape
of the distribution.
10 20 50 23 21 12 13 15 24 25 26 23
24 25 28 24 56 20 10 32 30 31 13 25
65 45 51 42 35 65 36 28 35 40 60
Solution:
Array of scores
10 10 12 13 13 15 20 20 21 23 23 24
24 24 25 25 25 26 28 28 30 31 32 35
35 36 40 42 45 50 51 56 60 65 65
Prepared by Lhalili
GE04 11
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
1. Range = 65 – 10 = 55
2. 𝑛 = 1 + 3.322 𝐿𝑜𝑔𝑁 = 1 + 3.322 𝐿𝑜𝑔35 = 6.13 ≈ 6
55
3. 𝑖 = 6 = 9.17 ≈ 9
4. lower limit of the lowest class interval: 10 (lowest score)
upper limit of the lowest class interval: 10 + (6 − 1) = 15
Graphical Presentation
1. Line Graph
2. Bar Graph
3. Circle Graph or Pie Graph
4. Pictograph or Picture Graph
5. Scatter Plots or Scatter gram
6. Statistical maps
Prepared by Lhalili
GE04 12
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Examples
Prepared by Lhalili
GE04 13
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Prepared by Lhalili
GE04 14
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Example
The following scores represent the final examination grade for an elementary
statistics course:
22 60 49 32 57 74 52 70 82 36
80 77 81 95 41 65 92 85 55 76
52 10 64 75 78 25 80 98 81 67
41 71 83 54 64 72 88 62 74 43
60 78 89 76 84 48 84 90 15 79
34 67 17 82 69 74 63 80 85 61
Solution:
1. FDT
a. Range = 98 – 10 = 88
b. 𝑛 = 1 + 3.322𝐿𝑜𝑔60 = 6.9~7
88
c. 𝑖 = 7 = 12.57~13
d. 10 + (13 − 1) = 22
Prepared by Lhalili
GE04 15
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Prepared by Lhalili
GE04 16
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
2.
Prepared by Lhalili
GE04 17
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
Assessment
ASSIGNMENT: The following data give the hours worked last week by employees
of a company.
42 48 42 45 53 34 40 23 21 38
51 40 35 42 31 47 48 34 40 40
16 27 36 39 39 51 41 43 40 36
25 27 57 28 52 45 25 26 39 41
60 52 54 35 27 46 20 22 36 30
25 62 20 52 41 33 65 63 25 26
Date of submission: November 28, 2020 @ 11:59PM via Google Forms to be sent
as an attached file. All answers should be handwritten and submit in the form of
image/s.
Prepared by Lhalili
GE04 18
RIZAL TECHNOLOGICAL UNIVERSITY
Cities of Mandaluyong and Pasig
References
E. M. Adina & R. T. Earnhart. Mathematics in the Modern World Second Generation
Training. Mapua Institute of Technology. 2017
Training on Teaching Basic Statistics for Tertiary Level Teachers Summer 2008
Elementary Statistics: A Handbook of Slide Presentation prepared by Z.V.J. Albacea,
C. E. Reano, R. V. Collado, L. N. Cornia and N. A. Tandang. Institute of Statistics, CAS,
UP Los Baños, Laguna. 2005
Foster, Garett C.; Lane, David; Scott, David; Hebl, Mikki; Guerra, Rudy; Osherson,
Dan; and Zimmer, Heidi, "An Introduction to Psychological Statistics" (2018). Open
Educational Resources Collection.
Prepared by Lhalili
GE04 19