Академический Документы
Профессиональный Документы
Культура Документы
DEFINITION:
A table or an equation that relates each outcome of a statistical experiment with its probability of
occurrence is called probability distribution.
Probability the value of random variable falls within a specified range it is called cumultative
probability.
PROBABILITY FUNCTION:
A function which assigns probabilities to the values of a random variable is called probability function.
If these two conditions aren't met, then the function isn't a probability function. There is no
be between 0 and 1.
EXAMPLE:
X 1 2 3 4 5 6 SUM
P(X) ⅙ ⅙ ⅙ ⅙ ⅙ ⅙ 6/6=1
All probability distributions can be classified as discrete probability distributions or as continuous
probability distributions, depending on whether they define probabilities associated with discrete
variables or continuous variables.
Only a finite or countable number of outcomes. Ex: marital status ( single, married, divorced, widowed),
the number of infections an infant develops during his or her first year of life.
Uniform Distribution. Suppose the random variable X can assume k different values. Suppose also that
the P(X = xk) is constant. Then,
Example:
Suppose a die is tossed. What is the probability that the die will land on 5 ?
The term "uniform distribution" is also used to describe the shape of a graph that plots observed values
in a set of data. Graphically, when the observed values in a set of data are equally spread across the
range of the data set, the distribution is also called a uniform distribution. Graphically, a uniform
distribution has no clear peaks.
Another example,
if we roll a fair die, we assume that the probability of obtaining each number is
1
6
If X is the random variable (r.v.) “ the number showing” , the probability
distribution table is
The probability distribution function (p.d.f.) is
1
P(X=x) = 6 , x=1,2,3,4,5,6
The distribution with equal probabilities is called “uniform”
It is also possible to have a continuous uniform distribution where the r.v. can be any number in a given
interval.
The continuous uniform distribution is also called the rectangular distribution.
A continuous distribution can't be illustrated with a histogram, because this would require
an infinite number of bars.
With the uniform distribution, all values over an interval (a, b) are equally likely to occur. As a result, the
graph that illustrates this distribution is a rectangle. The figure shows the uniform distribution defined
over the interval (0, 10).
The uniform distribution defined over the interval (0, 10)
The area of a rectangle equals the base times the height, or in mathematical terms,
The number of success X in n trials of a binomial experiment is called a binomial random variable.
The probability distribution of the random variable X is called a binomial distribution, and is given by the
formula:
P ( X )=nrC p x qn− x
Where
x = 0 , 1 , 2 , …………n
Example:
A coin is tossed 10 times. What is the probability that exactly 6 heads will occur.
E ( X ) =μ=np
The variance of the binomial distribution is
V ( X )=σ 2=npq
σ =√ npq
EXAMPLE:
Find mean , variance and standard deviation for the numbers of sixes that appear
when rolling 30 dice.
1 5
p= ; q=
6 6
MEAN:
E ( X ) =np
30∗1
E ( X )= =5
6
VARIANCE:
V ( X )=npq
30∗1
∗5
6 25
V ( X )= =
6 6
STANDARD DEVIATION:
The standard deviation is the square root of variance = 2.04124(approx.)
POISSON DISTRIBUTION :
The Poisson distribution was developed by the French mathematician Simeon Denis Poisson in
1837.
The Poisson distribution is a discrete probability distribution for the counts of events that occur
randomly in a given interval of time (or space).
2. The probability of a success during a small time interval is proportional to the entire length
of the time interval.
Apart from disjoint time intervals, the Poisson random variable also applies to disjoint regions of
space. The probability distribution of a Poisson random variable X representing the number of
successes occurring in a given time interval or a specified region of space is given by the
Formula:
e−μ μx
P(X) = x!
Where
x =0, 1, 2, 3…
e=2.71828
μ= mean number of successes in the given time interval or region of space
If μ is the average number of successes occurring in a given time interval or region in the Poisson
distribution, then the mean and the variance of the Poisson distribution are both equal to μ.
E(X) = μ
and
V(X) = σ2 = μ
Applications:
the number of deaths by horse kicking in the Prussian army (first application)
birth defects and genetic mutations
rare diseases (like Leukemia, but not AIDS because it is infectious and so not independent) -
especially in legal cases
car accidents
traffic flow and ideal gap distance
number of typing errors on a page
hairs found in McDonald's hamburgers
spread of an endangered animal in Africa
failure of a machine in one month
The number of meteors greater than 1 meter diameter that strike earth in a year
The number of occurrences of the DNA sequence "ACGT" in a gene
The number of patients arriving in an emergency room between 11 and 12 pm
Examples:
1. The average number of homes sold by the Acme Realty Company is 2 homes per
day. What is the probability that exactly 3 homes will be sold tomorrow?
Solution:
2. Suppose you knew that the mean number of calls to a fire station on a weekday is
8. What is the probability that on a given weekday there would be 11 calls? This
problem can be solved using the following formula based on the Poisson
distribution:
Solution:
Thus, the probability for given days is 0.072
Practice Example 1:
Practice Example 2:
A company makes electric motors. The probability an electric motor is defective is 0.01. What is
the probability that a sample of 300 electric motors will contain exactly 5 defective motors?
A cumulative Poisson probability refers to the probability that the Poisson random variable is
greater than some specified lower limit and less than some specified upper limit.
3. Suppose the average number of lions seen on a 1-day safari is 5. What is the
probability that tourists will see fewer than four lions on the next 1-day safari?
Solution:
To solve this problem, we need to find the probability that tourists will see 0, 1, 2, or 3 lions. Thus,
we need to calculate the sum of four probabilities:
If a variable can take on any value between two specified values, it is called a continuous variable.
OR
DESCRIPTION:
Any variable can have two types of values. Either the values can be fix numbers which are also known as
discrete values or a specified range that is known as continuous values. Based on these types of values a
data set is defined as continuous or discrete. In continuous data type, the values can be lying anywhere
within the range that is specified.
For Example:
The time required to drive back from office to home is always continuous.
So the probability distribution over a random variable "Y" where "Y" takes
continuous values is termed as continuous probability distribution.
FORMULA:
Continuous variables does not take discrete values. They do take continuous values such as 1.1, 2.33,
and so on. To find a random value in uniform continuous distribution, the probability of finding a value
pp = 1b−a1b−a
If a probability density function is given as f(x)f(x) then the probability of finding the continuous variable
P(a≤x≤b)=∫baf(x)dxP(a≤x≤b)=∫abf(x)dx
Example:
Word Problems:
Problem 1:
Solution:
b = 88, a = 34b = 88, a = 34
Probability Probability = 188−34188−34 = 154154
Problem 2:
Solution:
P(X≤2)P(X≤2) = ∫21x−2∫12x−2
= x−1−121x−1−112
= −1x21−1x12 = −12−12 - −11−11
= 1212 = 0.50.5
Problem 3:
Solution:
P(2 ≤ P ≤ 4)P(2 ≤ P ≤ 4) = ∫422x−3dx∫242x−3dx
= [2x−3+1−3+1]42[2x−3+1−3+1]24 = [−x−2]42[−x−2]24
= −[116−[116 -14]14] = 316316
Problem 4:
Solution:
E(X)E(X) = 11−5211−52 = 6262 = 33.
APPLICATIONS:
1. Normal distribution;
Standard normal
2. Student’s t distribution
3. Chi-square distribution
NORMAL DISTRIBUTION:
The normal (or Gaussian) distribution is a very common distribution.A normal distribution is
completely defined by specifying its mean (the value of µ) and its variance (the value of σ2). The normal
distribution with mean µ and variance σ2 is written N(µ, σ2). Hence the distribution N(20, 25) has a
mean of 20 and a standard deviation of 5; remember that the second “coordinate” is the variance and
the variance is the square of the standard deviation. In other words,the Normal distribution describes
a special class of such distributions that are symmetric and can be described by two parameters
Changing the values of µ and σ alter the positions and shapes of the distributions
Formula:
1 2 2
F(x) = 2
e−(x−µ) / 2σ
√2σ π
PROPERTIES OF NORMAL DISTRIBTUION:
• Bell-shaped
• The mean, median, and mode are equal and located at the center of the distribution.
• The curve is continuous – i.e., there are no gaps or holes. For each value of X, here is a corresponding
value of Y.
• The curve never touches the x-axis. Theoretically, no matter how far in either direction the curve
extends, it never meets the x-axis, but it gets increasingly closer to the x-axis.
• The total area under the normal distribution curve is equal to 1.00 or 100%.
• P(a < z < b) is the area below the normal curve, above the z-axis between z = a and z = b
APPLICATIONS:
Many classical statistical tests are based on the assumption that the data follow a normal
distribution. This assumption should be tested before applying these tests.
In modeling applications, such as linear and non-linear regression, the error term is often
assumed to follow a normal distribution with fixed location and scale.
The normal distribution is used to find significance levels in many hypothesis tests and
confidence intervals.
EXAMPLES:
The distribution of IQ scores is normal with a mean of 100 and standard deviation of 15.
SOLUTION:
Notice that the horizontal axis shows IQ score and the bell is centered at the mean of 100.
FORMULA:
x−μ
𝑧= σ
Examples:
Solution
Hence,
2. Let the random variable x be the time required to transport the product from
the manufacturing facility in China to the United States. Suppose that x is
normally distributed with a mean of 24 days and a standard deviation of 3
days. What is the probability that a shipment will arrive within 30 days—
P(x less than or equal to 30)?
Solution:
To determine the probability, you must first calculate the Zscore. With the information from the
example, you know that
x = 30
m = 24
s = 3
Substitute this information into the Z score formula, and standardize the value of 30, as shown
below.
STUDENT’S T DISTRIBUTION:
DEFINITION:
According to the central limit theorem, the sampling distribution of a statistic (like a sample mean)
will follow a normal distribution, as long as the sample size is sufficiently large. Therefore, when we
know the standard deviation of the population, we can compute a z-score, and use the normal
distribution to evaluate probabilities with the sample mean.
But sample sizes are sometimes small, and often we do not know the standard deviation of the
population. When either of these problems occur, statisticians rely on the distribution of the t
statistic(also known as the t score), whose values are given by:
t = [ x̅ - μ ] / [ s / sqrt( n ) ]
Where,
x̅ =sample mean
µ=proportional mean
s =standard deviation of sample
n=sample size
t= t distribution
PROPERTIES OF T DISTRIBUTION:
GRAPHS:
FORMULA:
EXAMPLE:
Let T follow a t-distribution with r = 8 df. What is the probability that the absolute value
of T is less than 2.306?
Solution.
The probability calculation is quite similar to a calculation we'd have to make for a normal random
variable. First, rewriting the probability in terms of T instead of the absolute value of T, we get:
P(|T | < 2.306) = P(−2.306 < T < 2.306)
Then, we have to rewrite the probability in terms of cumulative probabilities that we can actually
find, that is:
But the t-table doesn't contain negative t-values, so we'll have to take advantage of the symmetry of
the T distribution. That is:
P(|T | < 2.306) = P(T < 2.306) − P(T > 2.306)
Can you find the necessary t-values on the t-table?
DEFINITION:
In probability theory and statistics, the chi-squared distribution (also chi-square or χ²-
distribution) with k degrees of freedom is the distribution of a sum of the squares of k
independent standard normal random variables.
DESCRIPTION:
EQUATION:
Χ2 = [ ( n - 1 ) * s2 ] / σ2
The distribution of the chi-square statistic is called the chi-square distribution. The chi-square
distribution is defined by the following probability density function:
In the figure below, the red curve shows the distribution of chi-square values computed from all
possible samples of size 3, where degrees of freedom is n - 1 = 3 - 1 = 2. Similarly, the green curve
shows the distribution for samples of size 5 (degrees of freedom equal to 4); and the blue curve, for
samples of size 11 (degrees of freedom equal to 10).
GRAPH
PROPERTIES:
The mean of the distribution is equal to the number of degrees of freedom: μ = v.
The variance is equal to two times the number of degrees of freedom: σ 2 = 2 * v
When the degrees of freedom are greater than or equal to 2, the maximum value for Y
occurs when Χ2 = v - 2.
As the degrees of freedom increase, the chi-square curve approaches a normal distribution.
EXAMPLE:
1. The Acme Battery Company has developed a new cell phone battery. On average, the
battery lasts 60 minutes on a single charge. The standard deviation is 4 minutes.
Suppose the manufacturing department runs a quality control test. They randomly
select 7 batteries. The standard deviation of the selected batteries is 6 minutes. What
would be the chi-square statistic represented by this test?
Solution
To compute the chi-square statistic, we plug these data in the chi-square equation, as shown below.
Problem 2
Let's revisit the problem presented above. The manufacturing department ran a quality control test, using
7 randomly selected batteries. In their test, the standard deviation was 6 minutes, which equated to a chi-
square statistic of 13.5.
Suppose they repeated the test with a new random sample of 7 batteries. What is the probability that the
standard deviation in the new test would be greater than 6 minutes?
Solution
Given the degrees of freedom, we can determine the cumulative probability that the chi-square statistic
will fall between 0 and any positive value. To find the cumulative probability that a chi-square statistic falls
between 0 and 13.5, we enter the degrees of freedom (6) and the chi-square statistic (13.5) into the Chi-
Square Distribution Calculator. The calculator displays the cumulative probability: 0.96.
This tells us that the probability that a standard deviation would be less than or equal to 6 minutes is 0.96.
This means (by the subtraction rule) that the probability that the standard deviation would begreater
than 6 minutes is 1 - 0.96 or .04.
REFERENCES:
https://people.richland.edu/james/lecture/m170/ch06-prb.html
http://stattrek.com/probability-distributions/probability-distribution.aspx
http://www.probabilityformula.org/continous-probability-distribution.html
http://stattrek.com/probability-distributions/discrete-continuous.aspx?tutorial=stat
http://www.math.uah.edu/stat/dist/Continuous.html
http://stattrek.com/probability-distributions/t-distribution.aspx
http://mathworld.wolfram.com/Studentst-Distribution.html
https://en.wikipedia.org/wiki/Student%27s_t-distribution
https://onlinecourses.science.psu.edu/stat414/node/175
https://en.wikipedia.org/wiki/Chi-squared_distribution
http://stattrek.com/probability-distributions/chi-square.aspx
http://www.stats.ox.ac.uk/~marchini/teaching/L5/L5.notes.pdf
http://www.intmath.com/counting-probability/13-poisson-probability-distribution.php
http://stattrek.com/probability-distributions/poisson.aspx
http://onlinestatbook.com/2/probability/poisson.html
http://mathworld.wolfram.com/StandardNormalDistribution.html
file:///C:/Users/KT/Desktop/L6.slides.pdf
http://stattrek.com/probability-distributions/discrete-continuous.aspx?Tutorial=Stat
http://bk.psu.edu/clt/Stat200/Chapter6_Printable.pdf
http://www.itl.nist.gov/div898/handbook/eda/section3/eda3661.htm
http://www3.ul.ie/~mlc/support/Loughborough%20website/chap21/21_2.pdf
http://www.intmath.com/counting-probability/13-poisson-probability-distribution.php