Вы находитесь на странице: 1из 27

Developed By Graeme Henchel

To show how to do some common statisti operations using Excell

http://hench-maths.wikispaces.com

Henchel

me common statistical

kispaces.com

Data 12 23 23 56 87 57 56 80 32 52 82 29 90 43 84 18 92 2 58 1 95 54 33 72 83 16 89

Generating Random Numb

RANDBETWEEN
In order for this function to work you need to have installed the A To do this go to tools-->addins and select Analysis ToolPak

This function produces a random integer between the values you specify

For exmple =randbewteen(0,100) will produce a random whole (integer)

Pressing will recalculate the random value. The sheet will also recalculate automatically every time you make anoth unless you turn this off. Do this in the Tools-->Options

F9

Random Numbers

EEN

eger between the values you specify

ill produce a random whole (integer) number between 0 and 100

need to have installed the Analysis Toolpak add in Analysis ToolPak.

andom value. every time you make another entry this can be annoying >Options dialogue box. Select manual calculation

Sorting data into Rank order


Unsorted Data 76 34 49 87 38 70 95 4 50 54 42 65 89 79 67 15 4 89 64 63 26 14 54 26 53 72 First Highlight the data you want to sort by selecting all the cells Then use the DATA-->SORT menu to sort an unordered list of data into an ordered list Sorted data 4 4 14 15 26 26 34 38 42 49 50 53 54 54 63 64 65 67 70 72 76 79 87 89 89 95

Data sorted into rank order

The sort fu advanced levels of o sex, age f sort this d on what y important

The sort function can be used in more advanced ways to sort data into various levels of order e.g. you might have names, sex, age for a group of people and you can sort this data in various orders depending on what you consider to be the most important or useful way for your purposes

Sorted data 2 4 4 40 14 15 24 26 26 34 38 42 49 50 50 52 53 54 54 63 63

Mean, Median, Mode, Minimum, M


Mean Median Mode Minimum Maximum Range 36.04762 40 4 2 63 61

Use the following functions to find mean, median, m

Mean: =AVERAGE(number1,number2,...) or =AVE Median: =MEDIAN(number1,number2,...)

or =M

Mode: =MODE(number1,number2,...) or =MODE(

Minimum: =min(number1, number2,..) or =min(arra

Maximum: =max(number1, number2,..) or =max(ar Click on the yellow cells and examine the formulae

to find the Range create a formula based on range=

NOTE: The data does not have to be sorted in rank worthwhile doing the sort anyway

Minimum, Maximum, Range

nctions to find mean, median, mode, maximum, minimum

number1,number2,...) or =AVERAGE(array) or =MEDIAN(array)

number1,number2,...)

ber1,number2,...) or =MODE(array) ber1, number2,..) or =min(array)

mber1, number2,..) or =max(array)

ells and examine the formulae used for the data set (B4:B24)

eate a formula based on range=max-min

es not have to be sorted in rank order for these functions to work. But it is e sort anyway

Sorted data 2 Minimum 4 14 15 24 26 26 Q1 34 38 42 49 50 Q2 50 52 53 54 54 65 67 Q3 70 72 76 77 89 95 Maximum

Quartiles, Maximum, Minim


Examine the yellow cells to see the formulae used Array is the array or cell range of numeric values for which you want Quart indicates which value to return.

The function =QUARTILE(array,quart) can be used to find quartile an

26

50
If quart equals QUARTILE returns 0 Minimum value 1 First quartile (25th percentile) 2 Median value (50th percentile) 3 Third quartile (75th percentile) 4 Maximum value

67

This information can be used to construct a box plot but excell does not function to do this. It can however be constructed using drawing objects

95
See the file Box_plot.xls to see how to do this.

, Maximum, Minimum
can be used to find quartile and max min of a set of data.

ls to see the formulae used

cell range of numeric values for which you want the quartile value.

used to construct a box plot but excell does not have an inbuilt graphing n however be constructed using drawing objects. to see how to do this.

Class Interval based Frequency Table


Sorted data 2 4 4 14 15 24 26 26 34 38 42 49 50 50 52 54 54 63 64 80 87 95 927 Bin 10 20 30 40 50 60 70 80 90 100 Sum Frequency 3 2 3 2 4 3 2 1 1 1 22

A class interval can be determined and frequency of occurrenc found . The FREQUENCY function can be used to determine

Determine the intervals you want to use and create an array. E The frequency function has the following logic where data_array is the data you want to analyse and
However it needs to be entered as an

In cell E4 type =frequency(B4:B25,D4:D13) then press enter. Now select all the cells where you want the frequencies to be Next press F2 (this puts the formula in the formula bar) Then press CTRL+SHIFT+ENTER (this makes an Note the array formulae is surrounded by braces { }

The sum of a set of cells can be calculated in several ways.

Method 1 : is to use the =SUM(array) formulae eg to sum cells b2 to b35 use the formulae =sum(b2:b3 Method 2 : is to use the auto sum button

this is on the toolbar next to the hyperlink button.

SUM

Select the cells you want to sum click button and the sum will appear in the cell below your list. If you then look at the formulae it looks the same as method 1 above

rval can be determined and frequency of occurrence of observations within each class interval FREQUENCY function can be used to determine frequencies for given intervals.

he intervals you want to use and create an array. Eg D4:D13 This is called your bin array

ncy function has the following logic =FREQUENCY(Data_array, Bin_array) is the data you want to analyse and bin_array is your intervals

needs to be entered as an array formula. This is how you do it.

pe =frequency(B4:B25,D4:D13) then press enter. all the cells where you want the frequencies to be placed i.e. E4:E13 F2 (this puts the formula in the formula bar) CTRL+SHIFT+ENTER (this makes an array formulae in your selected cells) }

ay formulae is surrounded by braces {

sum cells b2 to b35 use the formulae =sum(b2:b35) on the toolbar next to the hyperlink button.

he sum will appear in the cell below your list.

NOTE: An easier method for a frequency table is to use the TOOLS-->DATA ANALYSIS method explained on the freq+histogram sheet

Data 2 4 14 15 24 26 26 34 38 42 49 50 50 52 53 54 54 65 67 70 72 76 77 89 95

Bin 10 20 30 40 50 60 70 80 90 100

A frequency table and a Histogram can be constructed using the DAT

Assuming you have your data you need to decide on the class interv you can see the class intervals going up by 10. You need to put thes

Go to TOOLS-->DATA ANALYSIS and select The input range is your raw data. The bin range are your class interv begin to appear. I have put $k$13. Chart output is selected so that

am can be constructed using the DATA-ANALYSIS functions.

ou need to decide on the class intervals you want to use. These are called your bins. In this example oing up by 10. You need to put these in the spreadsheet somewhere.

and select Histogram You can see the data ranges that have been put in for this example. . The bin range are your class intervals. The output range is just a cell where you want the output to 3. Chart output is selected so that a chart is produced as well

Bin 10 20 30 40 50 60 70 80 90 100 More

Frequency 2 2 3 2 4 4 3 3 1 1 0

Histogram
4.5 4 3.5 3 2.5 2 1.5 1 0.5 0

Frequency

Frequency

Bin

Frequency

Data 2 4 14 15 24 26 26 34 38 42 49 50 50 52 53 54 54 65 67 70 72 76 77 89 95

Bin 10 20 30 40 50 60 70 80 90 100

Bin 10 20 30 40 50 60 70 80 90 100 More

Frequency Cumulative % 2 8.00% 2 16.00% 3 28.00% 2 36.00% 4 52.00% 4 68.00% 3 80.00% 3 92.00% 1 96.00% 1 100.00% 0 100.00%

Cumulative % tab tools

Proceeding as be as shown

Cumulative % table and Chart can be constructed also by using the tools-->data analysis-->histogram function. Proceeding as before simply check the Cumulative Percentage box to produce the table and graphs as shown

Histogram
4.5 4 3.5 3 2.5 2 1.5 1 0.5 0 120.00% 100.00% 80.00% 60.00% 40.00% 20.00% 0.00%

Frequency

Frequency Cumulative %

Bin

Data 2 4 14 15 24 26 26 34 38 42 49 50 50 52 53 54 54 65 67 70 72 76 77 89 95

0 10 20 30 40 50 60 70 80 90

2 4 4 4 2 0 5 0 9 5

4 5 6 6 8 9 0 2 3 4 4 7 2 6 7

A simple stem and leaf plot can be easily constr First sort the data

Adjusting Column widths makes for a neater app

af plot can be easily constructed.

ths makes for a neater appearance

Statistical functions
Function AVEDEV AVERAGE AVERAGEA BETADIST BETAINV BINOMDIST CHIDIST CHIINV CHITEST CONFIDENCE CORREL COUNT COUNTA COUNTBLANK COUNTIF COVAR CRITBINOM DEVSQ EXPONDIST FDIST FINV FISHER FISHERINV FORECAST FREQUENCY FTEST GAMMADIST

GAMMAINV GAMMALN GEOMEAN GROWTH HARMEAN HYPGEOMDIST INTERCEPT KURT LARGE LINEST LOGEST LOGINV LOGNORMDIST MAX MAXA MEDIAN MIN MINA MODE NEGBINOMDIST NORMDIST NORMINV NORMSDIST NORMSINV PEARSON PERCENTILE PERCENTRANK PERMUT POISSON PROB QUARTILE RANK RSQ SKEW SLOPE

SMALL STANDARDIZE STDEV STDEVA STDEVP STDEVPA STEYX TDIST TINV TREND TRIMMEAN TTEST VAR VARA VARP VARPA WEIBULL ZTEST Top of Page

Statistical functions
Description Returns the average of the absolute deviations of data points from their mean Returns the average of its arguments Returns the average of its arguments, including numbers, text, and logical values Returns the beta cumulative distribution function Returns the inverse of the cumulative distribution function for a specified beta distribution Returns the individual term binomial distribution probability Returns the one-tailed probability of the chi-squared distribution Returns the inverse of the one-tailed probability of the chi-squared distribution Returns the test for independence Returns the confidence interval for a population mean Returns the correlation coefficient between two data sets Counts how many numbers are in the list of arguments Counts how many values are in the list of arguments Counts the number of blank cells within a range Counts the number of nonblank cells within a range that meet the given criteria Returns covariance, the average of the products of paired deviations Returns the smallest value for which the cumulative binomial distribution is less than or equal to a criterion value Returns the sum of squares of deviations Returns the exponential distribution Returns the F probability distribution Returns the inverse of the F probability distribution Returns the Fisher transformation Returns the inverse of the Fisher transformation Returns a value along a linear trend Returns a frequency distribution as a vertical array Returns the result of an F-test Returns the gamma distribution

Returns the inverse of the gamma cumulative distribution Returns the natural logarithm of the gamma function, (x) Returns the geometric mean Returns values along an exponential trend Returns the harmonic mean Returns the hypergeometric distribution Returns the intercept of the linear regression line Returns the kurtosis of a data set Returns the k-th largest value in a data set Returns the parameters of a linear trend Returns the parameters of an exponential trend Returns the inverse of the lognormal distribution Returns the cumulative lognormal distribution Returns the maximum value in a list of arguments Returns the maximum value in a list of arguments, including numbers, text, and logical values Returns the median of the given numbers Returns the minimum value in a list of arguments Returns the smallest value in a list of arguments, including numbers, text, and logical values Returns the most common value in a data set Returns the negative binomial distribution Returns the normal cumulative distribution Returns the inverse of the normal cumulative distribution Returns the standard normal cumulative distribution Returns the inverse of the standard normal cumulative distribution Returns the Pearson product moment correlation coefficient Returns the k-th percentile of values in a range Returns the percentage rank of a value in a data set Returns the number of permutations for a given number of objects Returns the Poisson distribution Returns the probability that values in a range are between two limits Returns the quartile of a data set Returns the rank of a number in a list of numbers Returns the square of the Pearson product moment correlation coefficient Returns the skewness of a distribution Returns the slope of the linear regression line

Returns the k-th smallest value in a data set Returns a normalized value Estimates standard deviation based on a sample Estimates standard deviation based on a sample, including numbers, text, and logical values Calculates standard deviation based on the entire population Calculates standard deviation based on the entire population, including numbers, text, and logical values Returns the standard error of the predicted y-value for each x in the regression Returns the Student's t-distribution Returns the inverse of the Student's t-distribution Returns values along a linear trend Returns the mean of the interior of a data set Returns the probability associated with a Student's t-test Estimates variance based on a sample Estimates variance based on a sample, including numbers, text, and logical values Calculates variance based on the entire population Calculates variance based on the entire population, including numbers, text, and logical values Returns the Weibull distribution Returns the one-tailed probability-value of a z-test Top of Page

Вам также может понравиться