Академический Документы
Профессиональный Документы
Культура Документы
HEA 604
E-Portfolio- Part 3
1. Example of Descriptive Using the SPSS Dataset YRBS 2011 HS Survey
Q: What percentage of students reported gambling on a sports team, gambled when playing
cards or playing a dice game, played the lotto or scratch off tickets, gambled on the internet,
or bet on game of personal skill such as pool or a video game is the last 12 months?
The level of measurement is Ordinal (1= 0 times, 2=1 or 2 times, 3= 3 to 9 times, 4= 10 to
19 times, 5= 20-39 times, 6=40 or more times), therefore we can use mode or median.
The output we can find the valid percent for each category. The output tells us that there
are 72.8 % students who reported that they did not gambled in the past 12 months, 14.8%
gambled 1 or 2 times, 7.3 % gambled 3 to 9 times, 2.6 % gambled 10 to 19 times, 0.8 %
gambled 20 to 39 times, and 1.5 % gambled 40 or more times.
The number of participants (N= 2,275)
Median=1.00 The middle point in the distribution.
Mode= 1 The most common response, which means 72.8% or 1,656 responded 1
(gambled 0 times)
Skewness= 2.534 - Positive skewed distribution because tail towards positive numbers. The
skewness helps to determine that the variable is not normally distributed. Looking at the
graph we determine the distribution is positively skewed.
To obtain the graph for times gambled, we go to "Analyze"Descriptive
StatisticsFrequencies Charts option. Then select histograms.
Crosstabs
Case Processing Summary
Cases
Valid
N
During the past 12 months,
have you ever been bullied
2351
on school property * What is
your sex
Percent
Missing
N
Percent
Total
N
Percent
99.2%
20
0.8%
2371
100.0%
During the past 12 months, have you ever been bullied on school property * What is your sex
Crosstabulation
What is your sex
Female
During the past 12 months, yes
Count
Total
Male
Total
241
179
420
Expected Count
225.1
194.9
420.0
19.1%
16.4%
17.9%
1019
912
1931
Expected Count
1034.9
896.1
1931.0
80.9%
83.6%
82.1%
1260
1091
2351
Expected Count
1260.0
1091.0
2351.0
100.0%
100.0%
100.0%
Count
Count
Looking at the Crosstabulation table we determine the percent of girls bullied on school
property=19.1% and percent of boys bullied on school property=16.4%
Chi-Square Tests
Value
2.948a
2.766
2.959
df
1
1
1
Pearson Chi-Square
Continuity Correctionb
Likelihood Ratio
Fisher's Exact Test
.094
Linear-by-Linear
2.947
1
.086
Association
N of Valid Cases
2351
a. 0 cells (0.0%) have expected count less than 5. The minimum expected count is 194.90.
b. Computed only for a 2x2 table
.048
The chi-square value for this analysis is 2.948 (under Pearson Chi-Square under value
column)
The P-value is .086 under the Asymp. Sig. (2-sided) column. The value of p is greater than
.05 so we determine there is a not a statistically significant relationship between gender
and being bullied on school property.
The Observed and Expected count are similar (241 vs. 225 for Female and 179 vs. 195 for
Male). This suggest that there is no relationship of either gender being bullied on school
property.
3. Example of One Way ANOVA using APP 1994 Subset for Class R (1) Data set
Q: Are there differences between the size of community unity where students grew up (Q11)
and condom use (Q56)?
To conduct a one-way ANOVA we go to Analyze Compare Means One-way ANOVA
Dependent List condom use and Factor Community
Select Options button and select Homogeneity of Variance
Select post Hoc tab and select Tukey test and Tamhanes test Continue, then Ok.
Independent Variable- Community unity where students grew up (6 groups)
Dependent Variable - Condom Use Ordinal (4 groups)
Note: Although, One Way ANOVA requires Interval or Ratio, I am using as DV= Condom
(Ordinal) for illustration purposes only.
ANOVA
Condom Use (e.g., I use a condom when I have sex; Mean of i1,i3,i17,i21)
Sum of Squares
Between Groups
df
Mean Square
9.742
1.948
Within Groups
1187.142
1645
.722
Total
1196.884
1650
Sig.
2.700
.019
We can observe the mean for each of the communities where students grew up
Confidence Intervals for the means of each group (in the descriptive table) 95% chance that
true population mean for rural is between 2.46 and 2.82, Small town 2.78 and 2.99,
moderately sized city between 2.77 and 2.95, suburb of large town 2.80 and 2.95, Large
urban city 2.90 and 3.07 and other between 2.48 and 3.35.
Since F= 2.700 and P< .05, we want to examine Tahame (bottom numbers . Equal variance
not assumed. We can observe that is significant in particular rural vs. large urban city
category. Overall, we can answer the questions and say that there are differences among
the groups and condom use.
df1
df2
5
1645
Sig.
.771
Upper Bound
.0549
.0688
.0478
-.0609
.3507
.5580
.2278
.2016
.0962
.5647
.5203
SIZED CITY
SMALL TOWN
SUBURB OF LARGE TOWN
LARGE URBAN CITY
OTHER
SUBURB OF RURAL
LARGE TOWN SMALL TOWN
MODERATELY SIZED CITY
LARGE URBAN CITY
OTHER
LARGE
RURAL
URBAN CITY SMALL TOWN
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
OTHER
OTHER
RURAL
SMALL TOWN
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
LARGE URBAN CITY
Tamha RURAL
SMALL TOWN
ne
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
LARGE URBAN CITY
OTHER
SMALL TOWN RURAL
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
LARGE URBAN CITY
OTHER
MODERATELY RURAL
SIZED CITY
SMALL TOWN
SUBURB OF LARGE TOWN
LARGE URBAN CITY
OTHER
SUBURB OF RURAL
LARGE TOWN SMALL TOWN
MODERATELY SIZED CITY
LARGE URBAN CITY
OTHER
LARGE
RURAL
URBAN CITY SMALL TOWN
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
OTHER
OTHER
RURAL
SMALL TOWN
MODERATELY SIZED CITY
SUBURB OF LARGE TOWN
LARGE URBAN CITY
*. The mean difference is significant at the 0.05 level.
-.02580
-.01050
-.12552
-.05301
.23627
-.01529
.01050
-.11501
-.04250
.35128*
.09972
.12552
.11501
.07251
.27877
.02721
.05301
.04250
-.07251
-.25156
-.22576
-.23627
-.35128*
-.27877
.25156
.02580
.01529
-.09972
-.02721
.22576
-.02580
-.01050
-.12552
-.05301
.23627
-.01529
.01050
-.11501
-.04250
.35128*
.09972
.12552
.11501
.07251
.27877
.02721
.05301
.04250
-.07251
.07081
.05820
.06194
.20533
.09955
.06531
.05820
.05557
.20350
.10178
.06866
.06194
.05557
.20460
.22064
.20746
.20533
.20350
.20460
.10423
.10106
.09686
.09858
.22657
.10423
.07082
.06469
.06723
.21479
.10106
.07082
.05943
.06219
.21326
.09686
.06469
.05943
.05512
.21131
.09858
.06723
.06219
.05512
.21210
.22657
.21479
.21326
.21131
.21210
.999
1.000
.328
1.000
.166
1.000
1.000
.304
1.000
.008
.695
.328
.304
.999
.805
1.000
1.000
1.000
.999
.227
.338
.218
.008
.980
.227
1.000
1.000
.893
1.000
.338
1.000
1.000
.490
1.000
.218
1.000
1.000
.434
1.000
.008
.893
.490
.434
1.000
.980
1.000
1.000
1.000
1.000
-.2278
-.1766
-.3022
-.6388
-.0478
-.2016
-.1555
-.2736
-.6231
.0609
-.0962
-.0512
-.0435
-.5112
-.3507
-.5647
-.5328
-.5381
-.6562
-.5618
-.5271
-.5260
-.6457
-1.0157
-.0587
-.1825
-.1750
-.2975
-.7438
-.0756
-.2341
-.1851
-.3082
-.7674
-.0535
-.2056
-.1641
-.2768
-.7541
.0568
-.0981
-.0572
-.0468
-.6402
-.4582
-.6894
-.6613
-.6691
-.7852
.1762
.1555
.0512
.5328
.5203
.1710
.1766
.0435
.5381
.6417
.2956
.3022
.2736
.6562
.9083
.6191
.6388
.6231
.5112
.0587
.0756
.0535
-.0568
.4582
.5618
.2341
.2056
.0981
.6894
.5271
.1825
.1641
.0572
.6613
.5260
.1750
.1851
.0468
.6691
.6457
.2975
.3082
.2768
.7852
1.0157
.7438
.7674
.7541
.6402
Homogeneous Subsets
Condom Use (e.g., I use a condom when I have sex; Mean of i1,i3,i17,i21)
Subset for alpha
= 0.05
SIZE OF COMMUNITY
WHERE YOU GREW UP
N
1
Tukey HSDa,b
RURAL
84
2.6379
MODERATELY SIZED
349
2.8637
CITY
SUBURB OF LARGE
547
2.8742
TOWN
SMALL TOWN
245
2.8895
OTHER
18
2.9167
LARGE URBAN CITY
408
2.9892
Sig.
.110
Means for groups in homogeneous subsets are displayed.
a. Uses Harmonic Mean Sample Size = 76.252.
b. The group sizes are unequal. The harmonic mean of the group sizes is
used. Type I error levels are not guaranteed.
The Pearson Correlation coefficient can be used to measure the relationship between two
ordinal and/or two interval/ratio variables (or one ordinal and one interval ratio variable).
In the output, the correlation between these two variables is given in the line that says
"Pearson Correlation." The correlation coefficient here is .083 for the sample of 2,269
The p-value is given in the line that reads "Sig. (2-tailed)." In this case, the p-value is .000
The correlation is positive. If we were to report this in a scientific report, we would state
that the p < .05, we can conclude that there is a statistically significant relationship
between grades in school and hours spent watching TV.
5. Example for Independent T-Test using the dataset APP 1994 subset for class
Q: Are any differences between male and female students about beliefs about physical
outcomes (Q17)?
Gender is Nominal Variable
Physical outcome is Nominal Variable
To run this analysis, you can go to "Analyze" "Compare means" "Independent-samples
t-test." In the "Test Variables" box, put the variable " OutExpPhysical " In the grouping
variable box, put gender. You will then need to select "Define Groups." Here, boys are 1 and
girls are 2. Therefore, we should specify group 1 as 1 and group 2 as 2 and select "continue."
Then select "OK" or "Paste"
In the output, you will see a box called "Group Statistics." In this box, the first row shows
the descriptive statistics for males and the second row shows the descriptive statistics for
females. From the first column ("N"), we can see here that there are 729 males and 1246
females.
The mean belief about physical outcomes for male is 3.34 and for Female is 3.82. The mean
difference between males and females is 0.49. Based on the Levenes Test Equality of
Variance is .153 which is greater than .05, so we look at the top row = equal variance
assumed. The sig (2-tailed) is .000 which is less than .05. The mean difference between
groups is 0.49. We can conclude that there statistically significant difference between the
two groups.
10
6. Example of Dependent T-test (aka paired samples t-test) using SPSS Dataset KIR for Ttest
Q: We need to determine whether there is any statistically significant difference between
the PeerNorms_1 and PeerNorms_2 for Peer norms about sex and substance use (e..g, how
many of your close friends have sex / drink alcohol / smoke cigarettes).
IV= Nominal (Related) comparing T1 and T2
Go to Analyze Compare Means Paired Samples t-test
Under Variable 1 put PeerNorms_1 and for Variable 2 put PeerNorms_2
Descriptive
Paired Samples Statistics
Std. Error
Mean
Pair 1
Std. Deviation
Mean
1.7912
516
.74853
.03295
1.9954
516
.82031
.03611
Time1
Mean =1.80
SD= .75
SE=.033
Time 2
Mean 2.00
SD=.82
SE=.036
The difference between time1 and time 2 mean can be found in the mean box under paired
sample t-test which is = -.20419.
The p value is listed under Sig. (2-tailed) = .000 less than .05 which means that there are
statistically significant differences between PeerNorms_1 and PeerNorms_2. Also, because
the 95% confidence interval for the difference -.27889 to -.12950, does not include a zero.
11
Correlation
516
Sig.
.397
.000
Mean
Peer norms about
Pair 1
sex and substance
use (e..g, how many
of your close friends
have sex / drink
alcohol / smoke
cigarettes) - Time 1
-.20419
- Peer norms about
sex and substance
use (e..g, how many
of your close friends
have sex / drink
alcohol / smoke
cigarettes) - Time 2
Std.
Std.
Interval of the
Deviati
Error
Difference
on
Mean
Lower
Upper
df
Sig. (2-tailed)
515
.86371 .03802
-.27889
12
-.12950
-5.370
.000