Вы находитесь на странице: 1из 16

Statistics

- Mathematical method of dealing with the collection, presentation, analysis, and interpretation of numerical facts and figures in order to draw conclusions and aid in making decisions on the basis of such analysis.

Statistics
Collection process of obtaining numerical measurements or data. Presentation or tabulation organizing of data into tables, graphs or charts. Analysis process of formulating numerical description from the given data Interpretation drawing of valid conclusions from the analyzed data

Statistics
Ungrouped Data Grouped Data Ungrouped Data

Discrete Variable Data Continuous

Grouped Data

Raw Data data presented in an array. It is a listing of the values as observed. Ordered Data raw data is arranged in either in increasing or decreasing order Frequency Data frequency of occurrence of distinct data values or cells are given Ungrouped Data enumeration of distinct data Grouped Data data grouped into cells or range of values

Frequency is the number of times that a value is observed. 4.1 Frequency Tally Sheet It is a method that determines the frequency value for the observed data or given cell. It provides a good idea of the general shape of the frequency data.

Preparing Frequency Tally Sheets for Ungrouped Data


Relative Frequency

Household sizes of various residences in a barangay in Metro Manila Table 4.1 Frequency Tally Sheet
Household Frequency Frequency Relative Sizes Tally Number Frequency (f)

3 6 8 4 5

6 7 7 7 4

6 5 7 8 9

7 5 9 6 7 5 6 8 9 3 10 10 8 5 4

8 3 4 10 7

6 8 5 6 7

Total(n):

Grouped Data Parameters Cells (k) are also known as class intervals. It is the grouping of successive data values using a specific upper and lower limit. No. of Data (n)z Under 50 50-100 100-250 Over 250 No. of Classes (k) 5-7 6-10 7-12 10-20

(round down)

Cell Width (c) it is the numerical value distance between the limits o any two cell. If possible all intervals should have the same width for easier computation and interpretation.
(round up)

Cell Boundaries the cell or class boundaries are values one-half measurement unit more accurate that the observed data. Therefore, if the observed accuracy is one decimal, the boundaries are written in two decimal values. This is necessary so that no value can be observed exactly on the boundary. Cell Midpoint this is one half of the distance between the cell boundaries. Each midpoint is assumed to be the point of concentration of all values in the cell.

Diameters of the rivet heads in 1/100 of an inch


6.72 6.66 6.72 6.77 6.66 6.64
Class Boundaries (LCB-UCB)

6.62 6.74 6.82 6.76 6.72 6.81


Frequency Tally

6.70 6.73 6.76 6.79 6.78 6.80


Frequency Number (f)

6.70 6.78 6.70 6.72 6.78 6.66

6.62 6.76 6.76 6.76 6.75 6.76

6.67 6.76 6.66 6.68 6.70 6.78


Midpoint ()

Relative Frequency

Total (n):

A. Mean The sample mean, denoted by , is the sum of the sample values divided by the sample size.
Where

MEASURES OF LOCATION (UNGROUPED DATA)

= arithmetic mean = actual values n = number of observations Also, Where; = frequency of each value

Example 1. An article in the Journal of Structural Engineering (Vol. 115, 1989) describes an experiment to test the yield strength of circular tubes with caps welded to the ends. The first yields (in kN) are 96, 96, 102, 102, 102, 104, 104, 108, 126, 126, 128, 128, 140, 156, 160, 160, 164, and 170. Calculate the sample mean.

B. Median Median, represented by , is actually the middle value, which divides the data set into two equal halves. Unlike the mean.
The value of the median is found by ordering the data in increasing size and counting until the middle observation is reached.

If the total number of observations is odd, is the center value. If it is an even value, is the average of the two middle values.

C. Mode Mode, , is another measure of central tendency which occurs at the most observed value of the variable. This is the value of the cell with the highest frequency. There should be at most 2 modes.

MEASURES OF VARIATION Used to describe the variability or scattering of data. This is used to gauge the reliability of averages as well as serve as a basis for control of variability itself.

A. Range The range is computed as the difference between the smallest and the largest value of the observation.

B. Standard Deviation This is the most commonly used measure of dispersion. It gives a numerical value in the units of the variable itself, to the clustering tendency of data.

Where

xi = actual value of the observation n = total number of observations = measure of average fi = frequency of the observation

Вам также может понравиться