Академический Документы
Профессиональный Документы
Культура Документы
Effectively describe a single variable through measures of dispersion, central tendency and graphical depiction
Explain the nature of association - how it can be measured and the relationship between pairs of variables Understand the dangers of sampling error and the limitations of statistical data and its analysis
Produce univariate and bivariate statistical analysis of any given data set
Produce graphical depictions of any given data set
Edit and transfer computer output to word documents for the production of professional reports
Assessment
Suggested reading
Field, A. (2009). Discovering statistics using SPSS. Sage: London.
To answer research questions To test predictions To understand and explain the world in an efficient way (1 + 1 + 1 + 1 + 1) To build models we can use to make inferences/predictions about the world
Basic Terms
Measurement Data Variable the word that describes what has been measured, e.g. weight, gender, happiness Sample phenomena from which data has been collected Population all possible data if infinite time and access was available Experiment
Measurement
Assignment of a number to something
To establish relationships between variables, researchers must observe the variables and record their observations. This requires that the variables be measured.
There are lots of different types of data each has certain rules attached. For example, a person can only be living or dead, an Olympic medal winners can only be First, Second or Third, but length can be almost number
Data
The measurements obtained in a research study are called the data (plural of datum). The goal of statistics is to help researchers organize and interpret data.
Variables
A variable is a characteristic or condition that can change or take on different values. Most research begins with a general question about the relationship between two variables for a specific group of individuals, e.g. eating breakfast and concentration
Often we are interested in one particular variable (we call this the dependent variable) and how other variables (independent variables) are related to it.
How are crime rates in a city (DV) affected by unemployment rates (IV) What is the relationship between childhood victimisation (IV) and offending (DV)
There is no rule about which should be the DV - it depends on what we are interested in as researchers, but the outcome is usually a criminological one
Dependent = Outcome
Independent = Predictor
Population
The entire group of individuals is called the population. For example, a researcher may be interested in the relation between experience of bullying (variable 1) and academic performance (variable 2) for the population of Year 7 children in Hong Kong.
Sample
Usually populations are so large that a researcher cannot examine the entire group. Therefore, a sample is selected to represent the population in a research study. The goal is to use the results obtained from the sample to help make inferences about the population. It is likely to be impossible to measure these variables for every Year 7 child in Hong Kong, so we obtain a sample of 300 Year 7 children that we think share the same characteristics of the population.
Levels of measurement
There are many ways to measure (assign numbers) variables. The way in which we do this depends upon the characteristics of the variable.
We cannot measure gender on a scale of 1 to 10, there are a limited number of options (categories)
We cannot rank colours, red is not more than blue
We may wish to differentiate between lengths or scores, 250metres is 100m longer than 150m and it is twice as long as 125m
Different levels of measurement require different types of statistics, so it is very important that we can identify the correct level of measurement
Central tendency
Dispersion
The Mean
The mean is the sum of all scores divided by the number of scores.
Mean ( X )
xi
n
x
n
xi
n
The mean sometimes fails to tell us what is typical, e.g. Almost everyone has more than the average number of feet
Blastland, M. & Dilnot, A. (2008). The tiger that isnt. London: Profile Books.
Sometimes we want to know about the middle - Average Joe - Middle England
The middle is not the average/mean, but the case where 50% are higher and 50% are lower. This is the median (50th percentile)
The Median
Median The middle score when scores are ordered 22, 40, 53, 57, 93, 98, 103, 108, 116, 121, 252 (n=11)
n 1 11 1 6th 98 2 2
22, 40, 53, 57, 93, 98, 103, 108, 116, 121 (n=10)
10 1 93 98 5.5th 95.5 2 2
The median is not influenced by large sample values and is a better measure of centrality if the distribution is skewed
It is possible to examine data by eye-ball alone, but it is easier to use graphical representations of data to look for extreme values. We will do this in the next unit
Multimodal
Having several modes
Summary
Three main levels of measurement Nominal Ordinal Interval/Ratio Measures of central tendency (e.g. mean, median) are simple tools for modelling/describing data, but they create a very simplified description of data often this is too simplified. In the next lecture, we will cover this issue.