Академический Документы
Профессиональный Документы
Культура Документы
INFORMATICS STATISTICS
PRACTICAL LAB
ARRANGED BY:
ANNA HENDRI S.J., S.Kom., M.Cs
Assalammu’alaikum Wr. Wb
Alhamdulillah, author conveys the presence of Allah SWT who has given his blessings and blessings so
that the completeness of this Informatics Statistics practicum manual can be completed. This practicum
manual and theory are the third revised edition. In the preparation of this book, the author experienced
a little difficulty in adjusting to the theoretical lecture RPS. Because not all lecture material can be
practiced in practicum implementation.
However, the authors still try to synchronize practical material with theory so that students do not
experience confusion in conducting analysis and processing data. The applications used are SPSS, MS
Excel and WEKA, but the usage is not too much different. Hopefully this Statistics practicum manual can
be useful and facilitate the practicum activities.
The author hopes that he can receive input, criticism from students, practicum assistants and other
statistical support lecturers for further development of this practical book. Finally, the authors are happy
to read and implement it.
Author
ii
TABLE OF CONTENTS
Preface
iii
PRACTICUM 1
IDENTIFICATION OF DATA TYPES AND SAMPLE TAKING
Time :2.5 Hours
Basic competencies :
After following this practicum, students are able to:
1. Identify the types of data in the real world
2. Provide an explanation of data classification and how data processing is in the SPSS application
a) FILE
For SPSS document file operations that have been made, both for printing repairs and so
on. There are 5 types of data used in SPSS, they are:
5. Database
♠ Recently used data: contains a list of data files that have been opened before.
♠ Recently used file : contains a list of files as a whole that have been done
b) EDIT
To make edits to the SPSS operation both data, and settings / options for the overall SPSS
configuration.
♠ Cut : deletion of a cell / text / object, copied for specific purposes with
the command from the paste menu
♠ Paste : displays a cell / text / object that results from the copy
Or cut command
For setting the capture on the SPSS work screen, and find out the processes that are happening in
the SPSS operation.
♠ Fonts : to set the type, font size in the SPSS data editor
- Outline size : SPSS output sheet font size
- Outline font : font type of SPSS output sheet
♠ Gridlines : set cell lines in the SPSS editor
♠ Value labels : adjust the display in the editor to find out the value label
d) DATA
♠ Define Dates : Defines a time for variables that include hour, date, year, and so
on
♠ Merge files : merge several SPSS document files, which are carried out by
e) TRANSFORM
♠ Count : to find out the amount of a certain data size on a particular line
♠ Recode : to replace the value in a particular variable column, its nature
replace (into the same variable) or change (into different variable) on the new variable
f) ANALYSE
Analyse menu is used to analyse data that we have entered into the variable. This menu is the
most important menu because all data processing and analysis is done using the correlate,
compare means, variable menu.
g) GRAPH
The graph menu is used to create graphics, including bar, line, pie, etc.
h) UTILITIES
The utilities menu is used to find out variable information, file information, etc.
i) ADD-ONS
The add-ons menu is used to give commands to SPSS if you want to use additional applications,
for example using the Amos application, SPSS data entry, text analysis, etc.
j) WINDOWS
The windows menu is used to make switches (switch) from one file to another.
k) HELP
The help menu is used to help users understand SPSS commands if they encounter difficulties.
POINTER : A cursor that indicates the position of the cell that is currently active /
selected.
Practical Work (Post Test) 1
1. Based on the data in Table 1.1, determine the data type, data type, its measurement scale!
Table 1.1. ABC Region Toddler Food Product Data
No Registration Number Product name Product Brands
1 Advanced Formula For Children 6-12
md 860513424001 LACTOGEN 2
Months
2 md 860513439001 NESTLE – Danstart
Advanced Formula for Infants 6-12 Months
2. Look for problems that occur in real life (for example in other cases). Look for and observe
examples of the types of data from the sample that you get through the internet!
For example: UAD Informatics Engineering students who have a work waiting period of less
than
6 months 30%
3. Formulate the problems you face, then detail these problems in the forms of information that
must be presented. Based on the case in step no. 1, the problem formulation is obtained;
6. Make the classification in the form of a table that contains parameters: the type of data and
the type of data and the scale of measurement of the data!
9. For Example; make a tabulation for the case you observed by following the format in Table 2.1.
Table 2.1. Example of the Data Identification Formulation Format
Specified population:
Definition of Population:
20
PRACTICAL LAB 2
FREQUENCY DISTRIBUTION
Indicator :
After following this practicum, students create a frequency distribution table by calculating the
value of the lower edge and making a table
To analyse the size of centring, the size of the location and the size of the deviation (when the
size is included in the statistics description), it can be done with a procedure.
a. Analyse Statistics Frequencies b. Analyse Descriptive
37
Table 2.1 Mobile User Data
No Handphone Price Total User Region
Brand
1 Samsung 3-10 Million 46% Korea
2 Sony Ericson 1-2 Million 5% Amerika
3 BlackBerry 2-10 Million 8% Amerika
4 Iphone 4-20 Million 30% Eropa
5 Oppo 1-5 Million 10% Asia
6 Asus 1-3 Million 3% Taiwan
7 LG 1-2 Million 2% Taiwan
8 Zyrex 500-1 Million 1% Tiongkok
9 Nokia 1-3 Million 2% Amerika
10 Avio 2-5 Million 3% China
- Variable names must start with a letter and cannot end with a period.
- There must not be the same, by not distinguishing lowercase or uppercase letters.
●Type, Width dan Decimal Variable
- The default of each new variable type is numeric, 8 characters wide according to decimal digits
of 2 digits.
- To change the variable type is done by clicking the selection button in the Type column.
- There are 8 types of variables, namely:
a. Numeric: numbers, signs (+) or (-) in front of numbers, decimal indicators
b. Comma: numbers, signs (+) or (-) in front of numbers, decimal indicators, commas as
separators of thousands
c. Dot: numbers, signs (+) or (-) in front of numbers, decimal indicators, periods as separators of
thousands
d. Scientific notation: same as the numeric type, but uses the symbol E for multiples of 10 (eg
120000 = 1.20E + 5)
38
e. Date: displays date or time format data
f. Dollar: give a dollar sign ($), a comma as a separator of thousands and a period as a decimal
g. Custom currency: for currency formats
f. String: usually letters or other characters
3. Saving Data
After the data has been entered, the data needs to be saved for the next analysis. The data
storage steps are as follows:
Click File Menu → Save Data → (Select storage folder), type File Name → Click OK.
39
Practical Work (Post Test) 2
PROCEDUR: Analyse Descriptive Statistics Frequencies
Click menu Analyse Descriptive Statistics Frequencies
Highlight the variable to be analysed then move it to the variable box by clicking the mark “”
Ÿ Click Statistics, check all Percentile Values check boxes
(Note: to determine the Percentile value of 10.25 and so on, it is done by putting a check box in
the percentile box)
Click chart, Select Histogram if you want to display
Click format, mark the ascending value on the order by option to sort the data from the smallest
largest value.
Click OK.
41
Case Example 2: An online store collects customer data to find out the loyalty of its customers.
Customer data is shown in Table 2.3.
Table 2.3. Customer data
Long be Number of transactions each The amount of the purchase
Name customer month Price each month
A. Rifai 29 2 34000
Anis-Yudi 29 3 65000
Anis 29 4 75000
Antin 29 6 55000
Alfi 26 1 90000
Asni 26 1 140000
Dewi 25 2 67000
Dewi 23 3 350000
Dede/ Ahli 22 5 165000
Dian 21 6 120000
Dedi 20 7 67000
Happy 20 3 75000
In 20 4 60000
Indra 20 2 60000
Indra 20 2 165000
Ipul 15 3 90000
Iqbal 14 4 35000
Eki 5 2 75000
From table 2.3 make a frequency distribution table and determine the lower and upper edges!
Case example 3.
Table 2.4 is data on 15 arrivals of Trans Yogyakarta buses taken randomly:
42
79 MT Haryono 22
81 SD Pujokusuman 55
83 SD Pujokusuman 39
75 South Papmi 34
97 Kehutanan 44
99 Kehutanan 36
52
PRACTICUM 3
FREQUENCY DISTRIBUTION (GRAPH AND REACH)
Time : 2.5 Hours
Basic competencies :
Student is able to provide an explanation of the distribution of data distribution with SPSS
Indicator :
After following this practicum, Student is able to make a frequency distribution table by
calculating range values.
77 85 75 76 63 72 81 73 67 86
74 53 76 62 78 88 57 73 80 65
75 71 65 76 85 78 97 67 62 79
71 83 79 60 95 75 61 89 78 96
60 68 74 69 77 94 75 82 78 66
53
2) Make a Frequency distribution table. Steps to make a frequency distribution table:
The number of data = 50, the smallest data value = 53 and the largest data value = 97. then the
range = 97 - 53 = 44. For example, we set the class width = 7, then the number of classes = 44/7 = 7
(rounded). So the class interval is as presented in table 3.1.
Table 3.1. Example of a frequency distribution table
Class Interval fi
52.5 59.5 2
59.5 66.5 9
66.5 73.5 9
73.5 80.5 18
80.5 87.5 6
87.5 94.5 3
94.5 101.5 3
From the table make the graph in the form of Pie, Bar and Histogram
55
15.Resolved a case study of cumulative frequency material using SPSS
16.Analysing the results of data processing from SPSS
Form of Competency Test: students take written examinations and computer practice exams based on
the competency test questions given during the exam
Timing: in the 5th week of the lecture according to the pre-practice schedule in the Basic Computing
laboratory
PRACTICUM 5
MEASURING THE CENTER (MEAN, MEDIAN AND
MODUS)
Time : 2,5 Hours
Basic Competence :
Student can analyse the data for the size of concentration and apply it to EXCEL
Indikator :
for the size of concentration and apply it to After following this practicum,
Praktikan can learn and understand the size of data centering using the Excel
application
Columns A and B contain the data to be processed Columns D and E will be filled with a
weighted average formula.
Figure 5.1. Example calculation of the average with Excel
i) Using the sumproduct, in cell E3 type the formula = SUMPRODUCT (A3: A4, B3: B4) / SUM
(B3: B4)
ii) Or manual formula by selling E4 type the formula = ((A3 * B3) + (A4 * B4)) / SUM (B3: B4)
2. Match the results you get using the Excel program or SPSS.
59
PRACTICUM 6
Practicum of Measure of Center (Quartiles)
Statistical measures are measures that indicate how a data group is centralized and spread. Within the
statistical measure there are three forms of data description size, namely: data centre size, size of data
variability and size of data distribution forms. The size of the data centres that are widely used to
describe data are the mean (mean count), median and mode. The size of the spread of a group of data
to a data centre is called disperse or variation or diversity of data. The commonly used data disperse
sizes are range, variance and standard deviation
1. Quartile
Is a value that divides data groups that have been sorted (ascending) into four equal parts. Quartile
value consists of quartile 1, quartile 2 and quartile 3. The value of quartile 2 of a data group is the
same as the median value.
I. Make a table
Fill in the data in column A in the range B3: B20 following data has been sorted from the smallest
value to the largest value).
ii. Type a formula to calculate quartile values
a. In cell D3 type: = = QUARTILE (B3: B20) Equation ii to calculate the minimum value.
60
b. In cell D4 type = QUARTILE (B3: B20,1) This equation is to find the value of Q1, quartile to 1.
c. In cell D5 type = QUARTILE (B3: B20.2) This equation is to find the value of Q2, second
quartile.
d. In cell D6 type = QUARTILE (B3: B20,3) This equation is to find the value of Q3, 3rd quartile in
cell.
1. Median, median is the middle value of a set of data, the median divides the data into 2 equal parts
and is also known as quartile 2 (K2),
2. Quartile (Quartile). Quartiles are observational values that divide data into 4 equal parts,
sometimes called k1, k2 (median) and k3. Manually the quartile can be determined by first
determining the values of n / 4 -> p, and then we get k1 and k3,
3. Decile (Decile), Deciles are observational values that divide data into 8 equal parts.
4. Percentile (percentile), Percentiles are observational values that divide data into 100 equal parts.
iv. How to write the percentile formula in Excel
To make it easier to implement the percentile function, please create a table like the one below:
1. Column A will be filled with data, in this example the data is in the range A3: A13 It should be
noted that the data must be sorted from small to large.
2. Next do a percentile calculation. To calculate the 100th percentile in cell D3 type the formula =
PERCENTILE (B3: B20,1)
3. To find the 20th percentile in cell D4 type formula = PERCENTILE (B3: B20,0.2)
4. To determine the 80th percentile in cell D5 type PERCENTILE formula (B3: B20,0.8)
62
Practical Tasks (POST TEST) 1
The data of students who made purchases in the canteen for 50 days is known as follows:
16 35 37 42 40 38 33 35 30 35
18 25 25 24 30 25 35 27 29 32
34 50 23 20 56 33 26 21 31 29
45 36 45 41 30 19 23 42 33 28
30 63 47 69 22 40 59 42 33 30
1. Observe the phenomenon in real life and take one type of sample data from two different
populations, each population of 100 samples. Calculate the quartile and percentile values
2. Match the results you get using the Excel program or SPSS.
3. Interpret the results that you have obtained!
Based on the case examples in Practicum 2 (Tables 2.2 to 2.4) determine the quartile value!
PRACTICUM 7
SIZE OF DISTRIBUTION (VARIANCE AND
DEVIATION STANDARDS)
Time : 2.5 hours
Basic competencies : Students can complete data analysis for the size of the spread
Indicator : Students can understand the material size of data dissemination with
using the Excel application
Theory
1. Range
The range or range (r) of a data group is the difference between the maximum value and the
minimum value. By looking at this measure, it can be known roughly about the variation of a data
distribution. This range value is very rough, because it does not consider other values besides the
extreme values.
2. Variance
Variance is the average of the square of the difference or the square of the deviation of all data
values against the average
average count. Variance for sample is denoted by s2, while for population is denoted by σ2
Variance (s) 2 = [Σ (Xi-X)] / (n-1) Actually what is the size of the deviation is the standard deviation
(standard deviation), however the size of the variance this is a square measure of the standard
deviation, so it can also be considered a measure of spread.
3. Standard Deviation
Standard deviation is the square root of variance. Standard deviations are often called standard
deviations. By using the average deviation of the results of observations of the spread already
takes into account all values in the data. However, because in the calculation using absolute
values, the direction of distribution cannot be known, then with the standard deviation this
weakness can be overcome, namely by making a power of value 2, so that negative values become
positive. This standard deviation is the most accurate measure of distribution.
In applying STDEV, the calculation of standard deviation manually uses the following formula:
√(∑( − )2
64 - n-1
Where:
x = data to n
= average x = sample mean value n = the amount of data
4. The coefficient of variance
The coefficient of variation is a measure of variance that can be used to compare data distributions that
have different units. If we compare various variances or two variables that have different units, it cannot
be done by calculating the absolute size of the spread.
Information
a. STDEV assumes that the argument is an example from the population. If your data represents the
entire population, to calculate the standard deviation using STDEVP.
b. Standard deviation is calculated using the "n-1" method.
c. Arguments can be numbers or names, arrays, or references that contain numbers.
d. The logical values and text representations of the numbers that you type directly into the argument
list will be counted.
e. If the argument is an array or reference, only numbers / numbers in the array or reference will be
counted. Blank cells, logical values, text, or error values in arrays or references will be ignored.
f. Arguments that error values or text that cannot be translated into numbers will cause an error.
g. If you want to enter logical values and text representation of numbers in the references as part of the
calculation, use the STDEVA function.
Practical Tasks (Post Test) 1
1. Create a table that contains data (you can use data that are not sequential from small to large values).
2. To calculate the standard deviation, in cell C3 type the following formula: = STDEV (A3: A13).
3. U To calculate variance, in Cell D3 type the following formula: = var (A3: A13).
4. Based on the data below, look for the mean, range, variance and standard deviation using Excel;
34 50 23 20 56 33 26 21 31 29
45 36 47 41 30 19 23 42 33 28
36 63 47 69 22 40 59 40 30 30
Record the results in this box.
1. During 10 repetitions this semester you get a value of 91, 79, 86, 80, 75, 100, 87, 93, 90, and 88.
What is the standard deviation of the test scores? Prove it with SPSS or Excel.
2. Make a case example using sample 45, determine the variance and standard deviation!
PRACTICUM 8
PROBABILISTIC STATISTICS
Time : 2.5 hours
Basic competencies :
Students can complete data analysis for probabilistic statistics
Indicator :
Students can understand probabilistic statistics by using the Weka application
1. PROBABILITY
Probability indicates whether something will happen or not.
For example from 10 scholars, 3 people master cisco, so the opportunity to choose a bachelor who
masters cisco is: p (cisco) = 3/10 = 0.3
2. BAYES THEOREM
with:
Example:
Asih has symptoms of freckles on her face. The doctor suspected that Asih had smallpox:
• probability of appearance of spots on the face, if Asih is exposed to smallpox p (spots2 | smallpox)
= 0.8
• Asih probability of getting smallpox regardless of any symptoms p (smallpox) = 0.4
• probability of appearance of freckles on the face, if Asih has an allergy bintik p (spots | allergies) =
0.3
• Asih probability of being allergic regardless of any symptoms p (allergic) = 0.7
• probability of appearance of spots on the face, if Asih spotty p (spots2 | spotty) = 0.9
• Spotty probabilities regardless of any symptoms p (spotty) = 0.5
then:
• Asih's probability of getting smallpox because there are spots on his face :
• The probability of Asih being allergic because there are spots on his face:
If after testing the hypothesis one or more evidence (facts) or new observations appear:
with:
e = old evidence
E = new evidence or observation
p (H | E, e) = probability of hypothesis H is true if new evidence E emerges from old evidence ep (H | E) =
probability of hypothesis H is true if given evidence E. p (e | E, H) = link between e and E if the
hypothesis is correct
P (e | E) = the link between e and E regardless of any hypothesis.
4. On the top Tab, select classification -> Select Bayes -> star
Basic Compentencies :
Students can solve the problem of Normal distribution
Indicator :
Students can use the SPSS application to analyze the Normal probability distribution theory
Theory
In the business world, a lot of data or information is quantitative, in the sense of containing the value of
a number or a certain number, for example: sales of a product in a month, Consumer Price Index,
factory production, etc.
This is very important, because if it turns out that the data is far from assuming a normal distribution
or close to normal, then the data group cannot be tested for hypotheses for normally distributed
(parametric) data. For groups of data that are not normally distributed, non-parametric statistical tests
will be performed.
The following is the profile of 40 respondents who were asked for their opinions on Roti DUTA
MAKMUR products.
Table 9.1 Bread Duta Makmur Product Data
102
3.00 3.00 1.00 1.00 286.00 30.00
3.00 2.00 3.00 1.00 620.00 31.00
3.00 3.00 4.00 1.00 288.00 32.00
1.00 3.00 2.00 2.00 275.00 28.00
1.00 2.00 2.00 2.00 452.00 33.00
1.00 2.00 3.00 2.00 415.00 34.00
1.00 2.00 4.00 2.00 478.00 35.00
1.00 1.00 4.00 2.00 893.00 32.00
1.00 1.00 2.00 2.00 952.00 28.00
1.00 1.00 1.00 2.00 689.00 32.00
2.00 3.00 3.00 2.00 75.00 30.00
2.00 3.00 4.00 2.00 159.00 35.00
2.00 3.00 1.00 2.00 200.00 29.00
2.00 1.00 3.00 2.00 900.00 34.00
2.00 1.00 2.00 2.00 850.00 30.00
2.00 1.00 4.00 2.00 845.00 32.00
2.00 2.00 3.00 2.00 315.00 33.00
3.00 3.00 4.00 2.00 219.00 34.00
3.00 3.00 4.00 2.00 129.00 32.00
3.00 3.00 2.00 2.00 275.00 30.00
3.00 1.00 4.00 2.00 875.00 35.00
3.00 1.00 4.00 2.00 833.00 37.00
3.00 2.00 4.00 2.00 621.00 32.00
3.00 1.00 2.00 2.00 655.00 35.00
Data Description:
Data for ATTITUDE variable is ordinal data with code:
1 = Like
2 = Enough Like
3 = Dislike
Data for GROUP variables are nominal data with the code:
1 = Rich
2 = Intermediate
3 = Poor
Data for the AGE variable are nominal data with the code:
1 = Children
2 = Teenager
3 = young
4 = Adult
Data for the GENDER variable are nominal data with codes:
1 = male
2 = woman
Data for INCOME and BUY variables are ratio data.
Income is the respondent's income in a month (in thousands of rupiah) either from working alone or
giving parents, while the BUY variable is the frequency of purchasing bread in a month.
Thus, the first line of the data can be read as consumers (respondents) who like the bread products
of AMBASSADOR DUTA, classified as middle income, teenage age, a man, income (pocket money
from parents) on average Rp. 300,000 per month, and in an average month buy 25 rotiproducts,
DUTA MAKMUR, etc..
From the above consumer profile, will be tested whether the INCOME variable is Normal or not?
Steps in SPSS:
From the Analyze menu, select the descriptive statistics submenu, then choose explore ...
Filling:
DEPENDENT LIST or the name of the variable to be tested. As is the case, enter the INCOME
variable.
DISPLAY or output choices to be displayed, which can be either statistical or graphical output (plots).
Because it will only test data normality, select plots.
Note:
Although it can be explained through plots, but testing with statistical tools is still more recommended
for the right decision making.
Reference:
Basic Competencies:
Students can solve hypergeometric-distribution problems
Indicator:
Students can use the SPSS application to analyze hypergeometric distribution theory
From the understanding & properties above can be concluded with the following formula:
112
X : number of class successes (x: 0,1,2,3,4, ... k)
SPSS processing
The following is data processing using SPSS:
1. Activate Variable View
2. Fill in x and PDF in the Name column
3. Fill in the Decimal column with 0 (zero) on x and 5 (five) on PDF
4. Fill in the two Measure columns with Scale
5. Activate Data View
6. Fill in the values x = 0, 1, 2, 3, 4 with the formulation Pdf.hyper (x, N, n, k)
7. On the Menu Bar click Transform >> Compute Variable
2. PT ekarasa, a company engaged in shipping goods, states that shipping goods is said to be good if
shipping 50 packages there are no more than 4 defects. If a random sample of 5 packages of goods from
50 packages are taken, what is the probability:
a. There is one defective package between 2 and 3 defective packages
b. Less than 2 packages are defective
c. Minimum 4 packages that are not defective.
1. Look for a sample data / population and take one type of sample data from two different populations.
Make a calculation using probabilistic hypergeometric using SPSS
2. Match the results you get using the Excel program or SPSS.
3. Interpret the results that you have obtained!
Reference
https://id.scribd.com/doc/179102232/Modul-3-Distribusi-Probabilitas
PRACTICUM 11
One Sample Hypothesis Test
Time: 1.5 hours
Basic Competencies:
Students can solve problems in the Hypothesis Test
Indicator:
Students can use the SPSS application to solve the Hypothesis Test questions for the average
test of one sample.
Theory
Hypotheses can be interpreted as guesses about a thing, or a hypothesis is a temporary answer to a
problem, or also a hypothesis can be interpreted as a temporary conclusion about the relationship of a
variable with one or more other variables. But according to Prof. Dr. S. Nasution definition of a
hypothesis is a tentative statement which is a conjecture about what we are observing in an effort to
understand it.
Function
To test the truth of a theory
Hypothesis test
Therefore the hypothesis needs to be formulated before data collection is done. This hypothesis is
called the Alternative Hypothesis (Ha) or the Work Hypothesis (Hk) or Hı. The work hypothesis or Hı is a
temporary conclusion and the relationship between variables has been learned from theories
related to the problem. For testing Hı, there needs to be a comparison namely the Zero Hypothesis (Ho).
Ho is also called the Statistical Hypothesis, because it is used as a basis for testing. The step or procedure
for determining whether to accept or reject a Statistical Hypothesis (Ho) is called Hypothesis Testing.
Therefore, in Hypothesis testing, drawing conclusions. Regarding population based on sample
information not the population itself, the conclusion could be wrong. In Hypothesis Testing there are
two errors or errors, viz:
The conclusion is right if we accept Ho, because indeed Ho is right, or reject Ho, because indeed Ho is
wrong. If we conclude rejecting Ho even though Ho is correct, then we have made a mistake called error
or type I error (α). (β). If the value of α is reduced, it will become large β. The α value is usually set at
0.05 or 0.01. If α = 0.05, meaning 5 out of every 100 conclusions we will reject Ho, which should be
accepted. Price (1-β) is called a Power of Test or Strength Test.
b. Two-Party Testing
Ho : α = αo
Hı : α # αo
Test an average of one sample
Testing the average of one sample is intended to test the mean or population mean μ equal to a certain
value of μo, versus the alternative hypothesis that the mean or population mean of μ is not the same as
μo. So we will test:
Ho :α = αo opposites Hı : α # αo
Ho is an initial hypothesis.
Experiment
A student conducts research on gallons of pure milk whose contents contain an average of 10 litres.
Random samples have been taken from 10 bottles whose contents have been measured, with the
following results: 10,2 ; 9,7 ; 10,1 ; 10,3 ; 10,1 ; 9,8 ; 9,9 ; 10,4 ; 10,3 ; 9,8. with α = 0,01 Analyze it
manually:
1. hypothesis Ho :α = 10 opposites Hı : α # 10
2. Statistical test t (because α is unknown or n <30).
3. α = 0.01
4. The area of criticism: t < t α/2(n-1) or t > t α/2(n-1).
5. Calculations, from data: average x = 10.06 and sample standard deviations s = 0.2459.
x-μ
t= = 0,772
s/√n
Since t = 0.772 lies between -3,250 and 3,250 concluded to accept Ho, this means that a statement that
the average 10 litres of pure milk content is acceptable.
Write down the results of the analysis below, and whether Ho was accepted?
The average time required per student to enrol in an odd semester at a tertiary institution is 50 minutes
with a standard deviation of 10 minutes. A new registration procedure using a modern machine is being
tried. With this modern machine it is known that 12 students require an average registration time of 42
minutes with a standard deviation of 11.9 minutes. With a confidence level of 0.05, test the hypothesis
that the average value of the modern machine population is less than 50? Assume that the time
population is normally distributed. solve it using the SPSS application to prove it!
PRACTICUM 12
COMPETENCE TEST 2
Time : 1,5 hours
Basic Competencies : Students can solve problems in the Hypothesis Test
Indicator : After getting this material students be able to:
Competency Material:
How to Test Competency: Students answer the questions given both in writing and practice.
PRACTICUM 13
Two Sample Hypothesis Test
Basic Competencies :
Students can solve problems in the Hypothesis Test
Indicator :
Students can use the SPSS application to complete the Hypothesis Test questions for the average
test of two samples.
Theory
For testing the average of two samples there are 2 types of data:
1. Two Paired Samples.
This means that the two samples are mutually exclusive and the number of observations
(repetitions) is the same in each sample.
2. Free / Independent Samples.
In the average test of two paired samples, the number of observations must be the same (n1 = n2),
whereas in the two free samples the number of observations does not have to be the same.
Practical Steps:
1. Enter the data above in Data View, but first we must specify the name and data type in Variable View.
2. Click Analyze Compare Means Paired Samples T-Test
3. For example, we will test the sample at the real level α = 0.05 that the wheel turn on bicycle 1 is
different from bicycle 2. The wheel turnout data (minutes) of the two bicycles is:
1. Enter the data above in Data View, but first we must specify the name and data type in Variable View.
2. Click Analyze Compare Means Independent Samples T-Test
Reference:
Winiarti, S, 2010, Diktat Statistik dasar
PRACTICUM 14
ANOVA TEST (REALIBILITY TEST AND VALIDITY TEST)
Time : 2.5 hours
Basic Competencies :
Students can complete the processing and questionnaire for data test materials with SPSS
Indicator :
Proving the truth of an item. Grains that are said to be valid / true if the item has a
contribution to the value of the measured variable.
Theory
Decision items are valid or fall used two ways, namely comparing the rxy value of the calculated results
(SPSS output) with r in the table and comparing the SPSS output probability value with the probability
value used by researchers (usually using 5% for social research and 1% for exact research). If the value of
rxy ≥ rtable or SPSS output probability ≤ 0.05, then the item is valid. Vice versa if the value of rxy <rtable
or the probability value is greater than 0.05, the item can be said to be invalid.
Example:
Will be conducted a study of the influence of leadership and work motivation on work performance
Before conducting research each instrument is tested first to get a valid and reliable instrument. The
instrument trial was only once and was carried out on 10 respondents.
147
TABLE 13.1. FORM OF QUISIONER
Respondent Form Answer (item)
Num. Form1 Form2 Form3 Form4 Form5 Form6 Form7 Form8 TOTAL
1 3 7 5 7 6 4 6 2 40
2 5 3 6 4 6 5 5 4 38
3 2 6 4 4 8 6 6 3 39
4 8 5 6 5 4 3 7 2 40
5 4 5 6 7 8 5 1 6 42
6 3 6 6 5 6 3 5 2 37
7 6 4 5 7 3 4 6 6 41
8 5 5 5 8 4 4 6 5 42
9 7 6 4 5 6 5 2 1 36
10 4 6 5 4 7 4 3 4 37
4. In the Dialog, items in the left box are entered into the Variables column, in the Correlation
coefficient select Pearson, in the Test of Significance dialog box select One Tailed, then
OK.
5. After that the output will appear as below
6. To make it look neat and make it easier for us to read the table, right click / double click on the
output table, select Pivot, Edit then select pivoting traying. An image will appear below the
Move box in the column to the layer and the statistics box in row to column.
150
7. Looks neater results, remember the second variable to be cited is TOTAL.
a. Listwise N=10
To analyze the validity test, one-tailed test of significance is used. And from the results of these
calculations we get the following interpretation,
Probability between Forms (item)1 and total items are 0,482 which mean p >0,05.
Probability between Forms (item) 2 and total items are 0,243 which mean p>0,05.
Probability between Forms (item) 3 and total items are 0,256 which mean p >0,05.
Probability between Forms (item) 4 and total items are 0,04 which mean p < 0,05.
Probability between Forms (item) 5 and total items are 0,205 which mean p >0,05.
Probability between Forms (item) 6 and total items are 0,464 which mean p > 0,05.
Probability between Forms (item) 7 and total items are 0,245 which mean p >0,05.
Probability between Forms (item) 8 and total items are 0,017 which mean p <0,05.
Reliability
The main purpose of reliability testing is to determine the consistency or regularity of the
measurement results of an instrument if the instrument is used again as a measurement tool for
an object or respondent (Triton PB, 2005).
To test the reliability of a questionnaire the Alpha-Cronbach method is used. The standard used
in determining whether or not a research questionnaire is generally a comparison between the
value of r count with r table at 95% confidence level or 5% significance level. Testing reliability
with this Alpha Cronbach method, then the calculated r value is represented by the Alpha value.
According to Santoso (2001: 227), if the alpha count is greater than r table and the alpha count is
positive, then a questionnaire can be called reliable. Cronbach's Alpha formula:
If the reliability coefficient has been calculated, then to determine the closeness of the
relationship can be used Guilford (1956) criteria, i.e.:
Example
Based on data on Validity Test:
1. Select the Analyze Menu, sort down select the Scale menu, then click the Reliability Analysis
menu...,
2. 2. A dialog box appears, enter the Item into the Item Box, then press OK
The respondents examined in the questionnaire trial were 10 (N = 10) and none of the data
was excluded or excluded from the analysis. The Cronbach Alpha value is - 0.217 with a total of 8
questions. The r table value for two-sided testing at a 95% confidence level or 5% significance (p =
0.05) can be sought based on the number of respondents. Because the Alpha Cronbach value = -
0.217 (minus value), the questionnaire tested proved unreliable.
Exercise:
1. Look for Validity and Reliability of Job Performance (Y), from the results of tabulated data for
Work Performance Variables as follows:
15 4 4 4 1 5 4 1 3 3 5 34
16 4 4 5 3 5 5 4 5 5 4 44
17 4 5 5 3 5 5 5 3 4 5 44
18 3 4 4 4 4 3 4 4 4 4 38
19 4 4 2 4 4 2 4 4 2 1 31
20 4 3 4 3 4 3 4 4 4 4 37
2. Look for Validity and Reliability of work motivation, from the results of tabulating data for Work
motivation Variables as follows:
Based on table 13.2, do the data processing and analysis of the results.