Академический Документы
Профессиональный Документы
Культура Документы
Median the formal measure of midpoint, order all observations then use
M = (n+1)/2 which gives the location of M in the list of ordered pairs
Measuring Spread
Standard Deviation s
Variance s is the average of squares of the deviations of the observations from their mean
density curve is an idealization of the actual data, when computing the mean and standard deviation we
use different symbols to distinguish them
mean of a density curve is
SD of a density curve as
the standard deviation controls the spread, the larger the is, the larger the spread
Normal distribution N( ,)
Cumulative proportion for a value x in a distribution are the proportion of observations in the
distribution that are less than or equal to x
Put values into z-scores then use table to find the portion to the left of the value
To find normal proportions by table
draw picture of curve, proportion needed
standardize
use table
To find normal proportion by percentage and table
draw picture of curve, proportion needed
use table
nu-standardize using x = + (table value)
FOCUS ON CHAPTER 3
Scatter plot shows the relationship between two qualitative variables measured on the same
individual
Correlation measures the direction and strength of the linear relationship between two quantitative
variables the symbol is r
Regression Lines summarizes the relationship between two variables, Specifically only when there is
a explanatory variable and a response variable. Often used to predict the Y-value
when given the X-value
The least-squares regression line of y-value on x-value is the line that makes the sum of squares of the
vertical distances of the data points from the line as small as possible
The Equation of the Least-Squares Regression line
y for x
intercept
The smaller the P-value is, the stronger the evidence AGAINST the H provided by the data
The smaller the P-value is, the stronger the evidence FOR the H provided by the data
significance level is from 0 to 1, the lower the significance level is the more likely it is to occur
If the P-value is as small or smaller than , we say that the data are statistically significant at level .
If p0.05 then there is no more than 1 chance in 20 that a sample would give this strong of
evidence by chance when H is actually true
z test for a population Mean
draw an SRS of size n from a Normal population that has unknown mean and known standard
deviation
to test the H that has a specified value
H : =
calculate the one-sample z test statistic
in terms of a variable Z having the standard Normal distribution, the P-value for a test of H against
The numerator measures how far the sample mean deviates from the hypothesized
mean 0. Larger values of the numerator give stronger evidence against H :
= . The denominator is the standard deviation of xx. It measures how much
random variation we expect. There is less variation when the number of observations
n is large. So z gets larger (more significant) when the estimated effect
xx - gets larger or when the number of observations n gets larger. Significance
depends both on the size of the effect we observe and on the size of the sample.
Understanding this fact is essential to understanding significance tests.
The z confidence interval for the mean of a N( , ) will have the margin of error m with sample size
falsely rejecting H is type 1 error, the significance level is the probability of type 1
failing to reject H when it is false is type 2 error, the power , is 1 minus the probability of type 2