Вы находитесь на странице: 1из 336

Contacts

Minitab Inc. Minitab Ltd. Minitab SARL


Quality Plaza Unit E1, Brandon Court 1 Cité Paradis
1829 Pine Hall Road Progress Way 75010 Paris
State College, PA 16801-3008 Coventry CV3 2TE FRANCE
USA UNITED KINGDOM

Training Training Training


Phone:+1 814.238.3280 x236 Phone:+44 (0)24 7643 7550 Phone:+33 (0)1 55 33 12 36
Fax: +1 814.238.4383 Fax: +44 (0)24 7665 2888 Fax: +33 (0)1 55 33 12 39
e-mail:training@minitab.com e-mail:training@minitab.co.uk e-mail:formation@minitab.fr
www.minitab.com/training www.minitab.co.uk/training www.minitab.fr/formation

Customer Support Customer Support Customer Support


Phone:+1 814.231.2682 Phone:+44 (0)24 7643 7507 Phone:+33 (0)1 55 33 12 64
www.minitab.com/support; www.minitab.co.uk/support; www.minitab.fr/support;
customer.minitab.com to log a question customer.minitab.com to log a question customer.minitab.com to log a question

Copyright © 2003 Minitab Inc.

MINITAB® and the MINITAB logo are registered trademarks of Minitab Inc. Process Capability Sixpack™, ReportPad™, and StatGuide™ are
trademarks of Minitab Inc. Microsoft® Access and Microsoft® Excel are registered trademarks of Microsoft Corporation in the United States
and/or other countries. All other marks referenced remain the property of their respective owners.
MINITAB Release 14 Training Courses
Course Name Prerequisites Length
Introduction to MINITAB 1 day
Manufacturing:
Basic Statistics Introduction to MINITAB 1 day
Statistical Quality Analysis Introduction to MINITAB, Basic Statistics 1 day
Factorial Designs Introduction to MINITAB, Basic Statistics 1 day
Mixture Designs Introduction to MINITAB, Basic Statistics, Factorial Designs 1 day
Response Surface Designs Introduction to MINITAB, Basic Statistics, Factorial Designs 1 day
Reliability Analysis Introduction to MINITAB, Basic Statistics 2 days
DOE in Practice Introduction to MINITAB, Basic Statistics, Factorial Designs 1 day
Service Quality:
Basic Statistics Introduction to MINITAB 1 day
Statistical Quality Analysis Introduction to MINITAB, Basic Statistics 1 day
Advanced Introduction to MINITAB, Basic Statistics 1 day
1
Overview of MINITAB

Objectives
• Understand the MINITAB file structure.
• Navigate the MINITAB environment using the Project Manager.
• Learn to use MINITAB windows, menus, toolbars, and StatGuide.
• Perform basic graphical and statistical analysis.

Overview of Minitab Copyright Minitab Inc. 2002. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-1
Contents

Contents
Examples Purpose Page
Example 1 Navigate through MINITAB using the Project Manager 1-8
Coffee Consumption toolbar.
Example 2 Generate graphs and statistical output, create a graph 1-20
Analyzing Sales Revenue layout, and create a report.
Shortcut Keys Description of MINITAB’s shortcut keys. 1-39
Graph Editing Tools Description of MINITAB’s graph editing tools. 1-40

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-2
What is MINITAB?

What is MINITAB?
Minitab Inc. is a leader in delivering statistical software and Additional products
services for quality improvement, education, and research.
We provide data analysis tools that are accurate, reliable, and To further serve the quality market, we developed Minitab
easy to use. Our fundamental objective is to develop lasting Quality Companion, which integrates the soft tools needed
relationships with customers, based on mutual satisfaction for any process improvement—such as brainstorming,
and trust. consensus building, and process mapping—into one easy-to-
use software product.
History Because Minitab Inc. serves customers around the world, we
MINITAB was originally developed in 1972 at The also offer versions of MINITAB in other languages, including
Pennsylvania State University to help professors teach French, Japanese, and Korean.
statistics using computers so students could focus on learning Minitab Inc. operates offices in State College, Pennsylvania;
statistical concepts rather than on performing manual Coventry, England; and Paris, France, and has distributors
calculations. MINITAB is now used at 4,000 colleges around the world.
worldwide and has become the tool of choice for companies
focused on quality improvement.

Minitab World Headquarters


State College, PA

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-3
What is MINITAB?
Opening a new project in MINITAB
Menu bar
Toolbars

Session window

Data window

Project Manager window


(minimized)

By default, MINITAB opens with the view above, which MINITAB’s menus and toolbars are located above the Session
shows the project environmen. Note that the Session window window and provide access to all of MINITAB functions. You
is the top window and the Data window (an empty worksheet can customize the toolbars and menus so they contain the
titled Worksheet1) is on the bottom. The Project Manager is functions and features you use most often.
minimized, below the Data window.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-4
What is MINITAB?
Projects and worksheets
In a project, you can manipulate data, perform analyses, and Project windows
generate graphs. Projects contain one or more worksheets.
Project (.MPJ) files store: Toolbar Shortcut
Window Use to... shortcut key
• Worksheets • Dialog box settings Project Manage worksheets, c+I
• Graphs • Window layout Manager output, graphs, and
related documents
• Session window output • Preferences
Session View statistical output and c+M
• Session command enter session commands
history
Data View and edit worksheets c+D
Worksheet (.MTW) files store:
Graph View and edit graphs
• Columns • Design objects
• Constants • Column descriptions
• Matrices • Worksheet descriptions Closing a Data or Graph window removes the associated
worksheet or graph from your project. You cannot close the
Save your work as a project file to keep all of your data, Project Manager or Session window, although they can be
graphs, dialog box settings, and preferences together. Save minimized.
your work as a worksheet file to save only the data. A
Note By default, the Project Manager is minimized in the lower left corner
worksheet file can be used in several projects and sent of your screen.
electronically without sending an entire project.
Note Release 14 projects cannot be opened in previous versions of
MINITAB. However, you can save projects and graphs in Release 13
format.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-5
What is MINITAB?
Navigating in MINITAB
You can navigate the MINITAB environment using several Project Manager
methods.
1 Choose Window ➤ Project Manager, press c+I, or click
the icon on the standard toolbar.
Project Manager toolbar
A specialized toolbar allows direct access to elements of the The Project Manager window becomes active. Below is the
project. Many users find this method to be the easiest way to folder view.
navigate MINITAB project files. The Project Manager toolbar
is discussed on the next page.

Window menu
The Window menu lists each element of the project.
Selecting an item makes it the active window in the project.

Shortcut keys
A list of shortcut keys is on page 1-39.

Folder view
This view (example at right) shows all elements of the
project. The left pane of the folder view displays the folders;
the right pane displays the contents of the active folder. This
view is helpful when learning how MINITAB projects are
structured but, in practice, you may prefer to navigate a
project with the Project Manager toolbar or the Window
menu.
Note By default, a new MINITAB project starts with one empty worksheet.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-6
What is MINITAB?
Using the Project Manager
The Project Manager contains folders for accessing and Project Manager toolbar buttons
managing the elements of your MINITAB project:
Click... To…
Folder Use to…
Open the Session folder.
Session Access and manage Session window output
and graphs Open the Worksheet folder.
Worksheet Access all worksheets in the project Open the Graphs folder.
Graphs Access and manage graphs Open the Show Info folder.
Show Info View worksheet information not displayed in Open the History folder.
the Data window, such as column descriptions,
constants, matrices, and DOE objects Open the ReportPad.
History Access command language from analyses to Show the Related Documents folder.
quickly repeat complex tasks
ReportPad Create and store reports Note To change the size of the Project Manager window, click and drag
the divider.
Related Store links to MINITAB files, non-MINITAB files, By clicking the toolbar button again, MINITAB displays the Project
Documents and URLs Manager in folder view.

Project Manager toolbar


Use the detachable toolbar to open Project Manager folders
and to tile the Data, Session, and Graph windows alongside
the Project Manager.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-7
Overview of MINITAB

Overview of MINITAB
Example 1 Coffee Consumption

Problem Data set


This example introduces the MINITAB environment and COFFEE.MPJ
demonstrates navigation tools.

Data collection
The project includes four worksheets which contain data on
coffee consumption for selected countries, coffee-drinking
habits in the United States, and a designed experiment
involving coffee.

Tools
• Project Manager

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-8
Overview of MINITAB
Opening a project
Open the coffee project and examine the graphical and Open Project
statistical results.
1 Choose File ➤ Open Project.
Tip You can also use the toolbar button to open a project.
2 Choose COFFEE.MPJ.
3 Click Open.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-9
Overview of MINITAB
Interpreting your results

A MINITAB project always opens exactly as it was last saved. What's next
In this case, the project was saved with the Graphs folder
open and the graphs tiled, as shown above. The project Navigate the coffee project using the Project Manager
contains not only data, but also includes graphs and text toolbar.
output that were generated using the data in the project.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-10
Overview of MINITAB
Accessing worksheets in a project
MINITAB stores all data in worksheets. Examine the Show Worksheets Folder
worksheets in the coffee project. The second button from the
1 Click on the Project Manager toolbar. (If you click the
left in the Project Manager toolbar displays the worksheet
button again, the Project Manager appears in folder view.)
names. When you hover the cursor over each button, a tool
tip appears with the button’s name.

In the example to the right, each folder in the left pane


represents a worksheet. MINITAB displays the active
worksheet in the right pane.

Current worksheet
The active or current worksheet is the one MINITAB uses for
analysis. In the Project Manager, MINITAB indicates a
worksheet is active by turning the folder green. In this
example, the Consumption.MTW worksheet is the active
worksheet.
To make a different worksheet active, double-click the folder
icon next to the worksheet name.
Note MINITAB displays the name of the current worksheet in the status
bar.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-11
Overview of MINITAB
Worksheets
Columns Maximize a worksheet
Each MINITAB worksheet can contain up to 4,000 columns. 1 Click in the upper right corner of the worksheet to
Each column is identified by a number. You can name maximize CONSUMPTION.MTW.
columns up to 31 characters, including spaces.
The letter after the column number indicates the data type:
• D = date/time
• T = text (alphanumeric)
If no letter appears, the data are numeric.
Note MINITAB determines the data type of each column by the first value
entered.

Data entry arrow


The arrow in the upper left corner of each worksheet denotes
the entry direction:
• When entering data with the arrow in the vertical position,
pressing the E key moves the cursor down to the
next row in the column.
• With the arrow in the horizontal position, pressing the
E key moves the cursor across to the next column in
that row.
To change entry direction, click the data entry arrow.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-12
Overview of MINITAB
Descriptions
Note the red triangle in the upper left corner of the Description
Consumption worksheet. This triangle indicates an available
1 Hover the cursor over the red triangle in the upper left corner
worksheet description. Hover the cursor over the red triangle
of the worksheet to view its description.
to view the description.
You can add descriptions for projects, worksheets, and all
columns in a worksheet.

Project descriptions
To add or view an existing project description, choose File ➤
Project Description. Project descriptions are not represented
by red triangles.

Worksheet descriptions
To add or edit a worksheet description, click anywhere in the
worksheet to make it active, then choose Editor ➤
Worksheet ➤ Description. You can also double-click the
red triangle to view or edit an existing description.

Column descriptions
To add or edit a column description, highlight the column
and choose Editor ➤ Column ➤ Description or right-click
and choose Column ➤ Description.You can also double-
click the red triangle to view or edit an existing description.

What's next
Examine the Show Info folder.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-13
Overview of MINITAB
Show Info folder
The Show Info folder displays the following summary Show Info
information for a worksheet:
1 Click on the Project Manager toolbar.
• Column name
• Column number (Id)
• Number of observations (Count)
• Missing data count
• Data type
• Description
This view provides a fast way to examine column lengths,
missing data counts, and descriptions. You can view the
columns folder for the active worksheet by clicking the Show
Info button on the Project Manager toolbar. Double-click a
column to display that column in the Data window.

What's next
Examine results in the Session folder.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-14
Overview of MINITAB
Session folder
The Session folder allows you to access both text and Show Session Folder
graphical output. To view the Session folder, click the Show
1 Click on the Project Manager toolbar.
Session Folder button on the Project Manager toolbar.
2 In the Project Manager window, double-click Response
To view output or a graph, double-click the icon of the Surface Regression.
desired output item in the Session folder window. Notice that
graphs are represented with graph icons and text output with
document icons. MINITAB displays the output or graph in the
window to the right of the Project Manager.

What's next
Examine graphs in the Show Graphs folder.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-15
Overview of MINITAB
Graphs folder
The Graphs folder allows you to access and manage the Show Graphs Folder
graphs in your project. To view the Graph folder, click the
1 Click on the Project Manager toolbar.
Show Graphs Folder button on the Project Manager toolbar.
2 In the Project Manager window, double-click Contour Plot of
Display a graph Color vs. Time, Temp.
To display a graph, double-click the graph title in the Project
Manager window. MINITAB displays the graph in the window
on the right.

Tile graphs
To tile graphs, highlight the graphs, then right-click and
choose Tile. (To highlight consecutively listed graphs, hold
the s key when selecting the graphs; to highlight graphs
that are not consecutively listed, hold the c key.)

What's next
View the ReportPad.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-16
Overview of MINITAB
ReportPad
The ReportPad is a folder within the project that can store Show ReportPad
graphs, Session window output, and text in an editable
1 Click on the Project Manager toolbar.
format. After you have analyzed your data, you can save your
results to the ReportPad.

Edit reports
Basic editing capabilities such as changing fonts and
alignment are found in the Editor menu.

Export to a word processor


Exporting results from the ReportPad to a word processor is
discussed on page 1-37.

What's next
View related documents.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-17
Overview of MINITAB
Related Documents folder
You can store links to files or web pages in the Related Show Related Documents
Documents folder. You can link to word processing
documents, spreadsheets, presentations, or any Internet 1. Click on the Project Manager toolbar.
address (uniform resource locator, or URL).

File links
The files are not embedded in the MINITAB project; therefore,
to access the information, you must have access to the drive
or URL from your computer.
To add a link to a file or a Web site, click the Related
Documents button on the Project Manager toolbar. Right-
click the Related Documents folder in the Project Manager,
then choose Add Link. You can either type the path to the
link or you can browse for the file and let MINITAB complete
the path. To edit an existing link, right-click the link in the
Related Documents folder, then choose Edit Link.

Descriptions
You can include a description of each document by typing
text in the Description text box under the path. To add the
description after you have already added the link, right-click
the document link, then choose Edit Link.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-18
Overview of MINITAB
Final considerations

Summary and conclusions Additional considerations


• You can save data in a single worksheet or as part of a Other elements of the project, such as the History folder and
project file. menu bars, are discussed later in the course and in other
• You can include multiple worksheets in a project, along courses.
with statistical results, graphs, and other information.
• The Project Manager toolbar provides a simple and easy
way to navigate MINITAB projects.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-19
Overview of MINITAB

Overview of MINITAB
Example 2 Analyzing Sales Revenue

Problem Data set


An analyst for a stationery company is preparing a sales CARDSALES.MPJ
report for a new line of greeting cards that the company
intends to distribute nationally. In a pilot launch, the
company sold the cards at a variety of stores in New York Variable Description
and Florida for three months. Date Date
FL Sales Revenue from Florida stores
Data collection
NY Sales Revenue from New York stores
The analyst collected daily-sales amounts for the two
locations during the three-month test period and stored them Month Month
in a MINITAB project.

Tools
• Dotplot
• Graphical Summary
• Display Descriptive Statistics

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-20
Overview of MINITAB
Opening a project
Open the MINITAB project CARDSALES.MPJ. Open project
1 Choose File ➤ Open Project.
Viewing the worksheet
2 Highlight CARDSALES.MPJ.
Worksheets contain data arranged in columns and rows.
3 Click Open.
Columns
Each column in a MINITAB worksheet represents a variable.
The column name is the variable name. You select columns
for analysis in the appropriate dialog box.
A column can contain one of three types of data: numeric,
text (-T), and date/time (-D). Columns with numeric data do
not have a letter designation.
In this example, FL Sales and NY Sales are numeric
columns, Date is a date/time variable, and Month is a text
column.
Note To change the data type for a variable, choose Data ➤ Change
Data Type.

Rows
In this example, each row of data represents an observation
(case).

What's next
Create dotplots of the sales data.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-21
Overview of MINITAB
Creating dotplots
Graphs are an important preliminary step in data analysis. Dotplots
They enable you to examine the data, and identify patterns
1 Choose Graph ➤ Dotplot.
and potential problems.
2 Under Multiple Y's, choose Simple as shown in the dialog
Dotplots box below. Click OK.
Use dotplots to examine the shape and spread of sample data.
Begin the analysis by examining a dotplot of the sales data
for each location.

3 In Graph variables, enter FL Sales and NY Sales by double-


clicking each variable, or by highlighting them and clicking
Select.
4 Click OK.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-22
Overview of MINITAB
Interpreting your results
The graph shows the sales data during the three-month period
for both locations. It appears that, on average, New York
experienced higher sales than Florida. Also, an unusually low
value for New York sales appears on the far left side of the
plot.

Correcting the outlier


Hover the cursor over the New York sales value of 0. The
data tip identifies the value as row 20 in the worksheet. After
checking with the person who entered the data, you discover
that the sales information for this date is missing. Instead of
entering 0, you should enter an asterisk (∗) to indicate that
the value is missing.

What's next
Change observation 20 and update the graph.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-23
Overview of MINITAB
Removing incorrect values
The sales data on 1/20/00 for the New York location should Show Info
be recorded as missing, not as a zero.
1 Click in the Project Manager toolbar.
Backspace 2 In the NY Sales column, highlight row 20 as shown below.
When you must preserve the association among data in
different columns, always use the N key to clear
the contents of a cell. Notice that, after clearing the cell, the
length of the NY Sales column is still 91 observations with 1
missing value and the association among the data in the
columns is maintained.
Tip You can also clear cells using the eraser icon on the toolbar, or 3 Press the N key, then press e.
by right clicking in the cell and choosing Clear Cells.

Delete
The D key removes the cell from the worksheet and the
remaining cells in the column shift up. If the data in the
column are associated with data in other columns, this
approach corrupts the association.

Missing value symbol (∗)


MINITAB inputs an asterisk in the cell to indicate data are
missing for numeric or date/time data; for text data, MINITAB
leaves the cell blank. Most commands exclude from analysis
all rows with a missing value. You can display the number of
missing values for a variable using Display Descriptive
Statistics.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-24
Overview of MINITAB
Updating a graph
The icon in a Graph window’s upper left corner indicates the Update graph
current status of the graph. When data in the associated
1 To choose the dotplot, click in the Project Manager
worksheet are modified, the icon turns yellow. The table
toolbar. Double-click the dotplot.
below outlines the meaning of the graph icon colors.
2 Choose Editor ➤ Update ➤ Update Graph Now, or right-
Icon color Indicates the Because... click the graph and choose Update Graph Now, as shown
graph is...
below.
Green plus Current You manually updated the
graph, have specified
MINITAB to automatically
update, or haven't changed
the graph data.
Yellow minus Not current, MINITAB is waiting for you to
but data have manually update the graph.
changed
Red “X” On hold (only You are changing the data
occurs when and some condition exists
the graph is that prevents the update.
set to update (Column lengths may be
automatically) unequal or data values
unacceptable.) MINITAB
waits for the condition to be
resolved.
White “X” Not current, The graph is not a core
and is graph, or cannot otherwise
ineligible for be updated.
updating

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-25
Overview of MINITAB
Interpreting your results
The graph updates to reflect the modified data. In the graph
on the right, the x-axis automatically adjusts to represent the
current data and the icon in the upper left corner is now a
green plus.
In general, New York sales appear higher than Florida sales.
Graph update choices are:
Update Graph Now Updates an individual graph
for current data changes
Update Graph Automatically Updates the selected graph
and all other graphs
created with the same
dialog or command
whenever the worksheet
changes
Update All Graphs Now (under Updates all open graphs for
Editor ➤ Update or Window current data changes
menu)

What's next
Display a graphical summary for each sales location.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-26
Overview of MINITAB
Displaying a graphical summary
The graphical summary provides graphical representations of Graphical Summary
the data and some descriptive statistics. The summary
1 Choose Stat ➤ Basic Statistics ➤ Graphical Summary.
components are:
• Histogram with normal curve—Shows the distribution of 2 Complete the dialog box as shown below.
the data overlaid with a normal curve
• Boxplot—Summarizes information about the shape,
spread, and center of the data
• Confidence intervals—Provide interval estimates of the
population parameters of the data
• Anderson-Darling normality test—Tests whether the data
follow a normal distribution
• Summary statistics—Describe important features of the
data, such as the mean and standard deviation

3 Click OK.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-27
Overview of MINITAB
Interpreting your results
The graphical summaries for New York and Florida sales are
shown on the right.

Histogram
The histograms on each summary appear approximately bell-
shaped, indicating the data are normally distributed. You can
use the Anderson-Darling test to confirm that the data are
normal. This test is discussed in detail in the MINITAB Basic
Statistics course.

Confidence intervals
Confidence intervals are estimates of the true population
parameter, based on a sample. Parameters, such as the
population mean, median, and standard deviation are
commonly estimated, as shown in this output. The intervals
provide a likely range of values for a population parameter,
given a certain confidence level. In this example, the
confidence level is 95%.

Quartiles
Examine the minimum, maximum, and quartile values to
understand the spread of data. This example shows that 50%
of the data fall between the first and third quartiles.

What's next
Display descriptive statistics.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-28
Overview of MINITAB
Displaying descriptive statistics
You can display descriptive statistics and graphs for any Display Descriptive Statistics
column in the Session window. If your data have groups, you
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
can obtain statistics for subsets of data within a column by
Statistics.
choosing a by-group (or “By”) variable.
2 Do one of the following:
Variables • Double-click FL Sales and NY Sales in the list box.
Choose the columns you want to analyze. • Click FL Sales, then click Select. Repeat for NY Sales.
3 Click Statistics.
By variables (optional)
4 Uncheck SE of mean, as shown below.
To display descriptive statistics separately for groups within
your data, enter the column containing the By variable.
Note The column length of a By variable must equal the column length of
the variable.

Statistics
Choose the statistics you want to display.

Graphs
Choose from the following graphs: histogram of data,
histogram with normal curve, individual value plot, and
boxplot. 5 Click OK in each dialog box.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-29
Overview of MINITAB
Interpreting your results
The descriptive statistics for Florida and New York sales Descriptive Statistics: FL Sales, NY Sales
include: Variable N N* Mean StDev Minimum Q1 Median Q3 Maximum
FL Sales 91 0 1513.8 46.5 1420.0 1483.0 1514.0 1546.0 1640.0
Statistic Description NY Sales 90 1 1690.1 48.7 1587.0 1655.0 1687.0 1722.0 1813.0

N Number of observations; count


N* Number of missing observations
Mean Average; measure of central tendency
StDev Standard deviation: a measure of variability or
spread in the data
Minimum Lowest data value
Q1 First quartile (25th percentile)
Median Midpoint (50th percentile)
Q3 Third quartile (75th percentile)
Maximum Highest data value

What's next
Access the MINITAB StatGuide.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-30
Overview of MINITAB
Accessing the StatGuide
The StatGuide provides statistical guidance for interpreting StatGuide
the results of your analysis in MINITAB. It contains links to
1 Right-click the Session window output of descriptive
definitions and interpretations of sample output.
statistics.
The StatGuide contains two windows: 2 Choose StatGuide.
• The MiniGuide window shows a list of topics available
3 In the MiniGuide window, click Standard Deviation, as
for an area and sample output, where appropriate.
shown below.
• The main StatGuide window contains general
explanations of statistical procedures and specific
interpretations of the output in the MiniGuide.
You can access the StatGuide several ways:
1 Put your cursor in the Session window next to the output
that corresponds to the StatGuide topic you want to
review or make a Graph window active. Then, do one of
the following:
• Right-click the output in the Session window or the
Graph window, then choose StatGuide.
• Click the StatGuide button on the toolbar.
• Press s+1.
2 In the Project Manager Session folder, right-click the
output or graph, then choose StatGuide.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-31
Overview of MINITAB
Interpreting your results
MiniGuide window
The MiniGuide displays sample output and links to
definitions of terms.

StatGuide window
The StatGuide displays definitions of statistics,
interpretations of sample output, and in-depth topics.
Tip To see a complete list of topics for an area, click next to
the title in the StatGuide window.

Tip Use the arrow keys to browse through related topics.

What’s next
Create a layout of the graphs in the Graphs folder.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-32
Overview of MINITAB
Creating a multiple graph layout
Use the graph layout tool to place multiple graphs on the Graph Layout
same page.
1 Click , then double-click the dotplot in the Project
Manager. Then, click the graph to make it the active window.
Rows
2 Choose Editor ➤ Layout Tool.
Choose the number of rows for the layout.
3 Double-click Summary for FL to place the graph in the layout
Columns window.

Choose the number of columns for the layout.

Adding graphs to the layout


Select the graph you want to add, then click the cell in which
you want to place the graph. Then, click the right arrow
button to place the graph in the layout. The left arrow
removes a graph from the layout. You can also double-
click a graph to move it to the layout. When a graph is in the
layout, you can click and drag it to a different cell.

4 In the layout window, click the bottom right square. Then,


double-click Summary for NY.
5 Click Finish.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-33
Overview of MINITAB
Interpreting your results
Editing graphs in the layout
You can edit individual graphs in the layout. Double-click a
graph to access the editing tools.

Updating graphs in a layout


If the worksheet data change for graphs in a multiple layout,
MINITAB cannot update the layout. You must update the
individual graph and replace the outdated version with the
new one in the layout.

What's next
Create a summary report using the ReportPad.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-34
Overview of MINITAB
Creating reports
ReportPad Append to Report
ReportPad is a folder within the Project Manager that can 1 Click to view the graphs in the project.
store graphs, Session window output, and text in an editable 2 In the Project Manager, right-click the Layout. Choose
format. You can add items to a report from the session folder, Append to Report.
or from individual graphs and Session window output.
3 Click to view the Session folder.
To add a graph to the ReportPad, you can either:
4 In the Project Manager, right-click Descriptive Statistics: FL
• Right-click the graph title in the Project Manager, then Sales, NY Sales. Choose Append to Report, as shown
choose Append Graph to Report. below.
• Right-click an active graph, then choose Append Graph to
Report.
To add Session window output to the ReportPad, you can
either:
• Right-click the output title in the Project Manager, then
choose Append to Report.
• Right-click in the Session window output, then choose
Append Section to Report.

5 To view the ReportPad, click on the Project Manager


toolbar.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-35
Overview of MINITAB
Interpreting your results
The ReportPad displays the graphs and output that you
appended to the report. You can move items in the ReportPad
by highlighting and dragging them to the desired location.
You can also type text in the ReportPad.

Editing the report


To edit the report, use the tools in the Editor menu. You can
edit:
• Text alignment
• Text format (bold/italic/underline)
• Fonts and color

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-36
Overview of MINITAB
Creating reports in other applications
Word processing programs usually have more formatting Copy and Paste into Word Processor
options than ReportPad. To create a word processor report
1 Open a new or existing Microsoft Word document.
from MINITAB output, you can:
• Copy or move the contents of ReportPad to a word For graphs:
processor. 2 Right-click a graph in MINITAB and choose Copy Graph.
• Copy and paste Session window output and graphs from
3 Place the cursor in the Microsoft Word document and press
MINITAB directly into a word processor.
c+V.
Note To paste text and graphs in versions of Microsoft Word earlier than
Word 2000, use Paste Special, then uncheck Float over text. For Session window output:
You can also save a report in HTML format, for use on the 4 Highlight a block of text in the Session window output, right-
Internet. click in the output, then choose Copy.
5 Place the cursor in the Microsoft Word document and press
Copy to Word Processor c+V.
1 Right-click the ReportPad folder in the Project Manager, then
choose Copy to Word Processor. Save a report in HTML format
Caution Choosing Move to Word Processor deletes the content from 1 Right-click the ReportPad folder in the left pane of the Project
ReportPad at the same time it is copied to the word processor. Manager window.

2 In Save in, enter a drive (for example, A, C, or D), then 2 Choose Save Report As.
choose the appropriate folder. 3 In Save in, enter a drive (for example, A, C, or D), then
3 In File Name, enter a name for the new report. choose the appropriate folder.

4 Click Save. 4 In File name, enter a file name.

Note When you save to the hard drive, MINITAB saves the report as an .rtf 5 In Save as type, choose Web Page (*.htm; *.html).
file and automatically opens it in the default word processor for your
computer. 6 Click Save.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-37
Overview of MINITAB
Final considerations

Summary and conclusions

Project Use the Project Manager to manage and access


Manager various elements of a project through the
Session, Worksheets, History, Graphs, Info,
ReportPad, and Related Documents folders.
Graphs Graphs help you visualize data. You can update
MINITAB graphs to reflect changes in the source
data. You can display multiple graphs on the
same page using the Layout tool.
Session Displays output from your analyses in MINITAB.
Window
StatGuide Use the StatGuide to access statistical guidance
for any graphical or text output.
ReportPad Use the ReportPad to generate reports within
MINITAB. You can easily export the report to a
word processor or save it as an .rtf or .html file.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-38
Shortcut Keys

Shortcut Keys
Help 1 Data Window Dialog Boxes
StatGuide s+1 Next row/column c+e Edit last dialog box c+E
Go to … c+G Reset defaults 3
Files 1 screen down, up, left, or right c+arrow key Move forward one item t
New worksheet or project c+N Beginning of worksheet c+h Move backward one item s+t
Open project c+O End of worksheet c+j Move anywhere in a dialog box a+underlined letter
Save project c+S Format numeric column c+Y Choose Cancel q
Print window c+P Enter cell edit mode a+e Open a drop-down list 4 or a+Z

MINITAB Windows Graphics Menus


Command Line Editor c+L Close active graph window c+4 Open or choose a menu item a+underlined letter
Session c+M Move selected object(s) … Open the shortcut menu s+0
Data c+D 1 pixel arrow key
Project Manager c+I 5 pixels s+arrow key Canceling Operations
Switch between windows c+6 10 pixels c+arrow key Cancel a %macro, Exec, or graph c+k
50 pixels s+c+arrow key
Session Window Close active graph window c+4 Program Manager
Submit command e (default) Edit selected graph object c+T Show Session folder c+a+M
Add line break c+e (default) Duplicate graph 3 Show Worksheet folder c+a+D
Next command 2 Show Graphs folder c+a+G
Previous command a+2 Editing (cells or text) Show Info c+a+I
Apply I/O font a+1 Copy c+C* Show History c+a+H
Apply Title font a+2 Cut c+X* Show ReportPad c+a+R
Apply Comment font a+3 Paste/Insert c+V* Show Related Documents c+a+L
Select an area s+arrow key*
Select All c+A

*These shortcuts also work inside dialog boxes

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-39
Graph Editing Tools

Graph Editing Tools


Graph editing toolbar Graph annotation toolbar

Toolbar item Use to... Toolbar item Use to...


Select a graph item for editing. Put the active graph in select mode or deselect
a selected object.
Access the editing dialog box for the selected
item. Add text to a graph
Put the active graph in select mode. Draw rectangles and squares on a graph.
Put the active graph in brush mode Draw ellipses and circles on a graph.
Put the active graph in crosshairs mode. Draw lines on a graph.
Put the active DOE contour plot in plant flag Add marker symbols on a graph.
mode.
Draw a polyline (a series of connected lines)
Select a graph item to add. Depending on the on a graph.
item, MINITAB may prompt you for more
Draw polygons on a graph.
information.
Delete the selected item.
Make the graph appear larger or smaller with
the zoom option.

Overview of Minitab Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1-40
2
Data Entry and Basic Charts

Objectives
• Enter data into a MINITAB worksheet and learn how to use the Autofill feature.
• Create a Pareto chart and bar charts.
• Update a graph after changing data in the worksheet.
• Learn how to edit graphs after they have been created.

Data Entry and Basic Charts Copyright Minitab Inc. 2002. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-1
Contents

Contents
Examples and Exercises Purpose Page
Example 1 Enter data into a MINITAB worksheet and create a Pareto 2-3
Invoice Mistakes chart to summarize frequency data.
Example 2 Enter data into a MINITAB worksheet using Autofill, create 2-8
Surface Defects on Brass Fittings a bar chart, and update graph.
Example 3 Enter data into a MINITAB worksheet using Autofill, create 2-16
Sales Revenue a bar chart with reference lines, and update graph.
Exercise A Create a bar chart with reference lines to analyze data. 2-29
Solar Cell Manufacturer
Exercise B Create a Pareto chart to summarize frequency data. 2-30
Quality Assurance Testing
Exercise C Create a bar chart to analyze data. 2-31
Travel Expenses

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-2
Data Entry and Basic Charts

Data Entry and Basic Charts


Example 1 Invoice Mistakes

Problem Data set


A quality team notices a high frequency of mistakes on None
shipping invoices. These mistakes result in canceled orders,
customer dissatisfaction, and excessive rework. To gain
approval for a quality initiative, the team wants to show Variable Description
upper management a graphical summary of the types of Mistake Type of invoice mistake
mistakes and frequency of occurrences.
Frequency Frequency of each type of invoice
mistake
Data collection
The quality team records mistakes reported by customers for
six months.

Tools
• Pareto chart

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-3
Data Entry and Basic Charts
Entering data
Because the data are not currently stored in a computer New MINITAB Project
database or software application, you must enter the data
1 Choose File ➤ New ➤ Minitab Project or press c+N.
directly into MINITAB.
Click OK.
Tip The arrow in the upper left corner of the Data window controls the
direction the cursor moves when you press e. You can click the 2 Enter the column names and data as shown below.
data-entry arrow to change its direction, or use Editor ➤
Worksheet ➤ Change Entry Direction. When the arrow points
down, pressing e moves the active cell to the next cell in the
column; when the arrow points to the right, pressing e moves
the active cell to the cell in the next column in the row.

What's next
Analyze the data using a Pareto chart.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-4
Data Entry and Basic Charts
Creating Pareto charts
One appropriate way to graphically summarize frequency Pareto Chart
data is with a Pareto chart.
1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
Pareto charts are a type of bar chart where the bars are 2 Choose Chart defects table.
arranged in order of descending frequency. By ordering the
bars from largest to smallest, a Pareto chart can help you 3 Complete the dialog box as shown below.
determine where most problems occur. A cumulative
percentage line helps you judge the added contribution of
each category.
Note You can also use Graph ➤ Bar Chart to display frequency data.
However, in this case, a Pareto chart provides more information
than a bar chart.

4 Click OK.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-5
Data Entry and Basic Charts
Interpreting your results
The bars on the chart indicate the number of mistakes for
each category (left y-axis scale) and percentages of total
mistakes (right y-axis scale). The line above the bars
indicates cumulative percentages.
The table at the bottom displays the following statistics for
each category:
• Count of mistakes
• Percentage of total mistakes
• Cumulative percentage of total mistakes
The Pareto chart indicates that:
• Pricing mistakes (48) account for the largest number of
errors (39.7% of total).
• Pricing, product, and quantity mistakes account for over
80% (87.6%) of all invoice errors.
Recall that these mistakes are based on customer complaints
and may not represent all shipping invoice errors. For
example, customers may not complain when a mistake
occurs in their favor, such as when they receive more product
than they ordered.
Note By default, MINITAB generates bars until the cumulative percent
reaches 95, and then groups the remaining categories into a bar
named Others. You can change the cut-off percent for the
Others category.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-6
Data Entry and Basic Charts
Final considerations

Summary and conclusions Additional considerations


• The Pareto chart indicates that most of the mistakes A Pareto chart of counts may not provide all the information
involved pricing, product, or quantity errors. Initially, the you need to make a decision. For example, an infrequently
quality team should focus its attention on these three occurring defect may be very costly. If you assign costs to the
shipping invoice mistakes. defects, you can create a weighted Pareto chart to help
• Eliminating or reducing these mistakes may increase identify less common, but more costly, defects. Defects can
customer satisfaction, but it may not eliminate all of the also be weighted by severity, person-hours to fix, or other
shipping invoice errors. factors. Weighted Pareto charts are discussed in the
Statistical Quality Analysis training course.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-7
Data Entry and Basic Charts

Data Entry and Basic Charts


Example 2 Surface Defects on Brass Fittings

Problem Data set


A process engineer is concerned about the high occurrence of None
surface defects found on brass fittings. She wants to
determine whether the frequency of defects differs by
production line. In addition, she believes the speed of the Variable Description
production lines may affect the frequency of defects. Line Manufacturing production line (A, B, C, D)
Speed Speed of manufacturing line (Low, Medium, High)
Data collection
Defects Number of observed surface defects
The engineer designs an experiment that runs 4 different
lines (A, B, C, and D) at 3 different production speeds (low,
medium, and high). From each of the 12 combinations of
production line and speed, a technician takes 450 brass
fittings, inspects them, and records the number of defects. A
single fitting can have more than 1 surface defect.

Tools
• Autofill
• Bar Chart
• Updating a graph

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-8
Data Entry and Basic Charts
Entering data
Because the data are not currently stored in a computer Using Autofill
database or software application, you must enter the data
1 Choose File ➤ New ➤ Minitab Project or press c+N.
directly into MINITAB.
Click OK.
MINITAB provides several options for entering repeated data, 2 Enter the column names and data as shown in the worksheet
including Copy and Paste, Autofill, or Make Patterned Data. below on the left.
In this example, use Autofill to enter the data in the Speed
column. 3 Select the first three cells in the Speed column. Place the
cursor over the Autofill handle in the bottom right corner of
Note The data are also available in SURFACEDEFECTS.MPJ.
the third cell until the cross symbol (+) appears.
4 Click and drag to row 12. MINITAB automatically fills in the
What's next data, repeating the Low, Medium, and High sequence, as
Analyze the data using a bar chart. shown in the worksheet on the right.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-9
Data Entry and Basic Charts
Creating bar charts
A bar chart is a useful way to graphically summarize Bar Chart
frequency data.
1 Choose Graph ➤ Bar Chart.
MINITAB provides many types of bar charts. In this example, 2 In Bars represent, choose Values from a table.
the data are in summary form, so the bars represent values
from a table. Because the data include only one response 3 Under One column of values, choose Cluster. Click OK.
variable in a single column, choose One column of values. To
4 Complete the dialog box as shown below.
display bars side-by-side for each combination of the factors
Line and Speed, choose Cluster.

5 Click OK.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-10
Data Entry and Basic Charts
Interpreting your results
The bar chart indicates that:
• Among the samples taken, production line C has the
highest overall number of surface defects for all speeds.
• For all lines, the High production speed produces the
largest number of surface defects.
• No systematic difference in the frequency of surface
defects appears between the Low and Medium production
speeds.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-11
Data Entry and Basic Charts
Updating a graph
After consulting with the technician, you learn that he Updating a graph
inadvertently recorded the number of defective parts, not
1 Correct the data in the worksheet. In the Defects column,
defects, for one of the line speed combinations. Production
row 11, change 11 to 21.
line D actually had 21 surface defects, not 11, at the Medium
production speed. 2 Right-click the graph and choose Update Graph Now.
MINITAB indicates on a graph when data in a worksheet have
changed. When you correct the data in the worksheet, the
symbol in the upper left corner of the graph changes from a
green plus sign to a yellow circle with a minus sign. After
you update the graph, the symbol returns to a green cross.
Note If you have turned on the Automatic Update feature on the graph,
MINITAB immediately updates the graph.

The updated graph is shown on the right.

What’s next
Change the colors of the bars on the graph to make it easier
to interpret.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-12
Data Entry and Basic Charts
Changing the colors of the bars
MINITAB makes it easy to change the characteristics of a Editing a graph
graph once you have created it. You can color code the bars
1 Click once in any bar on the graph. MINITAB highlights all the
by the variable speed to make the graph easier to read.
bars.
Selecting and editing graph elements 2 Once the bars are highlighted, double-click any bar.
You can access graph editing dialogs by double-clicking a 3 Choose the Groups tab. Complete the dialog box as shown
graph item or by selecting the item and choosing edit. below.
MINITAB provides several ways to select and edit options.
To select a graph item, do one of the following:
• Click the graph item you want to edit.
• Right-click the graph and choose Select Item. Choose the
item you want to edit.
• Choose Editor ➤ Select Item. Choose the item you want
to edit.
• On the Graph Editing Toolbar, select the item you want to
edit from the drop-down menu.
To edit a selected graph item, do one of the following:
• Right-click the highlighted item on the graph and choose
the edit menu option.
• Choose Editor ➤ Edit or press c+T. 4 Click OK.

• On the Graph Editing Toolbar, click the Edit icon .

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-13
Data Entry and Basic Charts
Interpreting your results
For bar charts, when you choose a variable for attribute
assignment, Minitab changes fill attributes assigned to each
category value of the variable. In this example, MINITAB
changed the colors of the bars, so each bar of the same speed
is the same color. For example, the bars corresponding to
high speed are all blue.
Note that Minitab displays a legend on the graph, so you can
easily determine the color that corresponds to each variable
level.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-14
Data Entry and Basic Charts
Final considerations

Summary and conclusions Additional considerations


• Production line C produces the most number of defects, • You can use more sophisticated statistical tools to analyze
while the other three lines are roughly equal. these data. For example, you can use an analysis of
• The analysis supports the engineer's hypothesis that the variance (ANOVA) procedure to determine whether the
production speed may affect the frequency of surface observed differences in the bar graph are statistically
defects. The bar chart indicates that more surface defects significant.
occurred at the high production speed. • This analysis assumes that the production speed is held
constant; for example, Low production speed corresponds
to an exact value and does not vary. If speed varies, record
the exact values instead of simply Low, Medium, and
High. You can then use other graphical procedures to
visualize the data more clearly.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-15
Data Entry and Basic Charts

Data Entry and Basic Charts


Example 3 Sales Revenue

Problem Data set


A sales manager wants to present a visual summary of the SALESREVENUE.MPJ
sales revenue for three territories for each quarter of the
previous year. The graphical display also needs to show
whether or not each territory met its goal. The territory goals Variable Description
are: Sales Sales revenue (in millions; US dollars)
• East: $5M
Territory Geographic sales territory
• Central: $4M
Quarter Fiscal quarter
• West: $6M

Data collection
The 3 territories record their revenues for each quarter.

Tools
• Autofill
• Bar Chart
• Updating a graph

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-16
Data Entry and Basic Charts
Entering data
Because the territory and quarter data are not currently stored Using Autofill
in a computer database or software application, you must
1 Choose Open ➤ Project and choose
enter the data directly into MINITAB. Note that the sales
SALESREVENUE.MPJ.
revenue, in millions, are already in the worksheet.
2 In Column C2, enter the column name Territory and in
Use Autofill to enter the data for Quarter. column C3, enter the column name Quarter.
Note With the Autofill function, by default, MINITAB continues the
sequence of values consecutively when possible. For example, 3 Enter the text values in the Territory column, as shown below.
if you enter A in row 1 and B in row 2 and use Autofill, MINITAB
enters C in the next row. If you want to repeat a sequence, hold
4 In the Quarter column, enter data in the first four cells, as
the c key down while dragging the Autofill handle. For the shown below in the worksheet on the left. Highlight the first
same example, MINITAB enters A in the next row instead of C. four cells. Place the cursor over the Autofill handle in the
bottom right corner of the fourth cell until the cross symbol (+)
appears.
What’s next
5 Press c and click and drag to row 12. MINITAB repeats the
Create a bar chart to examine the sales data. 1, 2, 3, 4 sequence, as shown in the worksheet on the right.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-17
Data Entry and Basic Charts
Clustering bar charts
Create a bar chart to display the quarterly sales revenue data. Bar Chart
In this example, the data are in summary form, so the bars 1 Choose Graph ➤ Bar Chart.
represent values from a table. Because the data include only 2 In Bars represent, choose Values from a table.
one response variable in a single column, choose One
column of values. To display bars side-by-side for each 3 Under One column of values, choose Cluster. Click OK.
combination of the factors for territories and for quarterly
4 Complete the dialog box as shown below.
sales, choose Cluster.

5 Click OK.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-18
Data Entry and Basic Charts
Interpreting your results
The bar chart indicates that the revenue is highest in the West
territory. Notice that the pattern of bars is different for each
territory.
From this bar chart, you cannot easily determine the total
sales revenue for each territory.

What’s next
Stack the bars for quarter within each territory.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-19
Data Entry and Basic Charts
Stacking bar charts
Another way to present the bar chart is to stack the bars using Edit Bar Chart Options
a grouping variable. For this example, stack the quarters to
1 Right-click the graph and choose Graph Options.
form one bar for each territory, instead of four. Then, the
graph will more clearly show the yearly sales revenue for the 2 Check Stack values of innermost categorical variable, as
territories. You can stack the bars by editing the current shown below.
graph.

3 Click OK.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-20
Data Entry and Basic Charts
Interpreting your results
Each territory’s bar is comprised of four color-coded
sections, each representing a quarter. The graph makes it
easier to determine the total sales for each territory. The West
has the highest revenue for the year, followed by the East and
the Central territory. MINITAB also adds a legend to the graph
so you can easily determine the color that corresponds to
each quarter.

What’s next
Add reference lines for each territory’s goal.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-21
Data Entry and Basic Charts
Adding reference lines
Recall that management wants to know whether each Reference Lines
territory met its sales goal: $5M in East, $4M in Central, and
1 Right-click the graph.
$6M in West.
2 Choose Add ➤ Reference Lines.
A convenient way to indicate these values on the graph is to
add reference lines. 3 Complete the dialog box as shown below.

4 Click OK.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-22
Data Entry and Basic Charts
Interpreting your results
The bar chart indicates that:
• The West territory met its goal.
• The Central and East territories did not meet their goals.
• The Central territory is closer to meeting its goal than the
East territory.
Be careful when interpreting whether a numeric goal is met
and by how much or little. When the goals are different, you
should consider what percentage of the goal is met versus the
actual dollar value. In this example, the territories had
different goals; the Central territory is 5.2% short of its goal,
while the East territory is 11.4% short of its goal. The West
territory is 12.5% above its goal.

What’s next
Change the text labels of the reference lines to make them
more useful.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-23
Data Entry and Basic Charts
Modifying reference labels
The bar chart does not clearly indicate which reference line Reference Labels
corresponds to each territory. To clarify the presentation,
1 Double-click the $4M reference line.
change the text label associated with each reference line.
2 Choose the Text tab.
3 Complete the dialog box as shown below.

4 Click OK.
5 Follow steps 1–4 for the East territory ($5M goal) and West
territory ($6M goal).

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-24
Data Entry and Basic Charts
Interpreting your results
The original interpretation remains the same. However, this
graph clearly indicates that West territory is the only territory
to meet its revenue goal.
Note When charting data from a table, the order of the quarters within
each bar and the order of the x-axis variable are determined by the
order of occurrence in the worksheet.
When charting counts of unique values or a function of a variable,
by default, the order is based on the natural order of the variable
(alphabetical or numerical order). You can change the order by
setting the value order for the column.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-25
Data Entry and Basic Charts
Updating a graph
Since the original data were recorded, a few sales that were Updating a Worksheet and Graph
pending in the East territory have come through. The new
1 Correct the data in the worksheet. In the Sales column, row
total for Q4 is now $1,276,482. Update both the worksheet
4, change 0.98190 to 1.27648. Press e.
and the bar graph.
2 Right-click the graph and choose Update Graph Now as
Note When you change the data in the worksheet, the symbol in the
upper left corner of the graph changes from a green plus to a yellow shown below.
circle with a minus. After you update the graph, the symbol returns
to a green plus.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-26
Data Entry and Basic Charts
Interpreting your results
The updated graph shows that sales for the East territory have
increased.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-27
Data Entry and Basic Charts
Final considerations

Summary and conclusions Additional considerations


• Only the West territory achieved its sales goal, even after For these data, you could have created a pie chart for each
updating the East territory’s fourth quarter revenue. territory and paneled them on one page. You could have also
• The Central and East territories are both about 5% short arranged the data in a two-way table, with the territories in
of their respective goals: 5.14% for Central and 5.49% for columns and each quarter in a row. The results remain the
East. same, although you would choose different graphing options.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-28
Data Entry and Basic Charts
Exercise A Solar Cell Manufacturer

Problem Data set


Production managers for a solar cell manufacturer are SOLAR.MPJ
examining the number of solar cells produced each month
over a 3-year period (2000–2002). By the year 2002,
managers had hoped to reach a production level of 23,000 Variable Description
units per month during the months of June, July, August, and Month Month of year
December, and 26,000 units per month during the remaining
months. 2000 Monthly solar cell production for 2000
2001 Monthly solar cell production for 2001
Data collection
2002 Monthly solar cell production for 2002
The manufacturer records the number of solar cells produced
each month during 2000, 2001, and 2002.

Instructions
1 Create a bar chart of the monthly production rates for
each of the three years.
2 Compare the results to the managers’ goals.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-29
Data Entry and Basic Charts
Exercise B Quality Assurance Testing

Problem Data set


A quality assurance (QA) department for a computer BUGS.MPJ
solutions development company wants to graphically
summarize the types of bugs they encounter during product
testing. Variable Description
Bug Type Type of software problem
Data collection
Frequency Frequency of each problem
During the past few years, the QA department has
maintained a database of the bug types and frequency. They
summarize these data in a table.

Instructions
1 Create a Pareto chart (of a defects table) to summarize the
bug types by their frequency of occurrence.
2 Determine which bug types account for more than 80% of
all bugs.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-30
Data Entry and Basic Charts
Exercise C Travel Expenses

Problem Data set


Management wants to examine a graphical summary of TRAVEL.MPJ
business travel expenses incurred by three sales teams during
the past year.
Variable Description
Data collection Month Month of year
A financial database provides monthly travel expenses for North Monthly travel expenses for the North sales team
three sales teams (North, South, and Central) during the past (US dollars)
year. South Monthly travel expenses for the South sales team
(US dollars)
Instructions Central Monthly travel expenses for the Central sales
Open the MINITAB worksheet TRAVEL.MPJ and create an team (US dollars)
appropriate graphical summary of the data.
1 Create a bar chart (values from a table) that summarizes
the monthly travel expenses for the three sales teams.
2 Describe any similarities or differences in travel expenses
among the sales teams.
3 Determine in which months the travel expenses are lowest
and highest.
4 Calculate the total yearly expenditures of each sales team
using Calc ➤ Column Statistics.

Data Entry and Basic Charts Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2-31
3
Patterned Data

Objectives
• Learn how to create data collection, and sampling plans.
• Generate patterned numeric, text, and date/time data.
• Generate random samples from a column of data.

Patterned Data Copyright Minitab Inc. 2002. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-1
Contents

Contents
Examples and Exercises Purpose Page
Example 1 Create a data collection plan for a Gage R&R 3-3
Measurement System Experiment (Gage R&R) experiment by generating patterned data, taking a
random sample, and sorting.
Example 2 Create a sampling plan for process control and 3-11
Process Sampling capability analysis by generating patterned data.
Example 3 Create a stratified random sample by splitting a 3-18
Employee Evaluation worksheet and generating a random sample.
Exercise D Create a sampling plan and analyze data using a 3-25
Investigating Variability in Surface Finish graphical summary, boxplots, dotplots, and time series
plots.
Exercise E Create a stratified random sample by splitting a 3-26
Customer Focus Group worksheet and generating a random sample.
Exercise F Create a random sample by generating patterned data, 3-27
Parts Sampling creating a random sample, and sorting to create a
worksheet for a capability study.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-2
Patterned Data

Patterned Data
Example 1 Measurement System Experiment (Gage R&R)

Problem Data set


Technicians working in a testing laboratory want to analyze None
the performance of a measurement system by conducting a
gage repeatability and reproducibility (Gage R&R)
experiment. Variable Description
Operator Operator identifier (1, 2, ..., 5)
Data collection
Part Part identifier (1, 2, ..., 12)
Before conducting the experiment, the technicians determine Order Random run order (1, ..., 240)
the data will be collected in the following manner: each of 5
operators takes 4 measurements on 12 different parts, for a
total of 240 measurements. To ensure that the results are
valid, the operators must take all 240 measurements in
random order.

Tools
• Make Patterned Data (Simple Set of Numbers)
• Random Data (Sample from Columns)
• Sort

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-3
Patterned Data
Making patterned data
Before the operators can measure the parts, you must create a Make Patterned Data
worksheet that details the data collection plan. You can use
1 Choose File ➤ New ➤ Minitab Project. Click OK.
Make Patterned Data to create columns of data with repeated
text, numbers, dates, or times. Specify the pattern in the 2 Choose Calc ➤ Make Patterned Data ➤ Simple Set of
dialog box and MINITAB fills in the column appropriately. Numbers.
First, create the Operator column. 3 Complete the dialog box as shown below.

Each of the 5 operators will measure 12 parts 4 times:


12 ∗ 4 = 48 measurements
The 5 operators will make a total of 240 measurements:
5 ∗ 48 = 240 measurements or worksheet rows

What's next
Create a column that identifies the 12 different parts.

4 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-4
Patterned Data
Making patterned data
Next, create the Part column. Each of the 5 operators will Make Patterned Data
measure the 12 parts 4 times each.
1 Choose Calc ➤ Make Patterned Data ➤ Simple Set of
Create a column that lists each part number 4 times, starting Numbers.
with 1 and ending with 12. Repeat the entire sequence 5 2 Complete the dialog box as shown below.
times, once for each operator.

What's next
Create a column to store the data collection order.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-5
Patterned Data
Making patterned data
The 240 measurements in the experiment must be taken in a Make Patterned Data
completely random order. Create a column to identify this
1 Choose Calc ➤ Make Patterned Data ➤ Simple Set of
order:
Numbers.
1 Create a column that contains numbers 1 to 240 in
consecutive order. 2 Complete the dialog box as shown below.

2 Randomize the numbers in the column.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-6
Patterned Data
Randomizing data
Now, randomize the Order column. Sample From Columns
Use MINITAB's random data function to randomize columns 1 Choose Calc ➤ Random Data ➤ Sample From Columns.
of data.You can store the randomized data in the same 2 Complete the dialog box as shown below.
column or a new column in the worksheet by specifying the
column in the dialog box.

What's next
Sort the data by the Order column to show the sequence in
which the operators should take the measurements.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-7
Patterned Data
Sorting data
Sort the Operator, Part, and Order columns so the data are in Sort Data
the order in which the operators should collect it.
1 Choose Data ➤ Sort.
MINITAB can store the newly sorted data into a new MINITAB 2 Complete the dialog box as shown below.
worksheet, into the original columns of the same worksheet,
or into new columns within the same worksheet. You can
also sort data by more than one variable. For this example,
sort the data into the original columns by the Order column.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-8
Patterned Data
Interpreting your results
You created a worksheet that shows the experimental design
for the Gage R&R study. Each row represents a unique
operator/part combination.
The rows are sorted so that the first row corresponds to the
first measurement the operators should take, the second row
corresponds to the second measurement, and so on. Based on
the worksheet on the right, the first measurement should be
taken by operator 5 on part 4.
Your worksheet does not match the worksheet on the right
because MINITAB selects the random sample based on a value
determined by the exact time you opened MINITAB. This
process results in a different random sample each time you
use the command.
Note If you need to obtain the same random sample multiple times, use
Calc ➤ Set Base and use the same base for the random generator.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-9
Patterned Data
Final considerations

Summary and conclusions Additional considerations


Using several MINITAB commands, you: If you run this type of experiment frequently, you can use a
• Created columns for Operator, Part, and Order macro to generate the appropriate columns for Operator, Part,
and Order. Macros can automate certain repetitive tasks in
• Generated a randomized run order for the Gage R&R MINITAB, making them quicker to perform.
experiment
There are many macros freely available to MINITAB users at
• Sorted all three columns by Order to show the order in
www.minitab.com/support/macros. Look under the Quality
which the data should be collected
Control and Designed Experiments category to find a macro
After the measurements are taken, use Gage R&R to analyze entitled MSA Setup, which you can use to generate the
the data. appropriate columns in random order for a Gage R&R study.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-10
Patterned Data

Patterned Data
Example 2 Process Sampling

Problem Data set


Quality control technicians plan to evaluate the statistical None
control and capability of a manufacturing process using
sampled parts.
Variable Description
Before they sample the parts, the technicians must construct
a MINITAB worksheet that details the sampling plan. Date Date parts are sampled (8/18/2003–8/22/2003)
Day Day of week on which parts are sampled (Mon–Fri)
Data collection Time Time of day at which parts are sampled (8:00 am–
The technicians design a plan in which they sample two parts 5:00 pm)
from each of four machines (A, B, C, and D) every hour from Machine Identifier for machine (A, B, C, D)
8:00 am to 5:00 pm for five days (Monday through Friday).

Tools
• Make Patterned Data (Simple Set of Date/Time Values)
• Extract from Date/Time (To Text)
• Make Patterned Data (Text Values)

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-11
Patterned Data
Making patterned data
Before the technicians can measure the parts, you need to Make Patterned Data
create a worksheet that details the part sampling plan. Use
1 Choose File ➤ New ➤ Minitab Project. Click OK.
Make Patterned Data to create columns of data with repeated
text, numbers, dates, and times. You specify the pattern in the 2 Choose Calc ➤ Make Patterned Data ➤ Simple Set of
dialog box and MINITAB fills in the column. Date/Time Values.
First, create a column for Date. Technicians will measure 3 Complete the dialog box as shown below.
parts over a 5-day period, from 8/18/2003–8/22/2003.
Two parts are sampled from 4 different machines 10 times
each day:
2 ∗ 4 ∗ 10 = 80 parts per day
Create a column that lists each date 80 times:
80 ∗ 5 = 400 rows

What's next
Create a column that contains the day of the week parts are
sampled.

4 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-12
Patterned Data
Extracting text data from dates
Use Extract from Date/Time to obtain information from date/ Extract from Date/Time to Text
time columns. You can extract numeric or text data
1 Choose Data ➤ Extract from Date/Time ➤ To Text.
depending on your application.
2 Complete the dialog box as shown below.
You can extract the following date/time components.
• Day of week or month
• Week, month, quarter, or year
• Hour, minute, second, or hundredth of a second
You want to determine which day of the week corresponds to
each date. MINITAB stores the new data in the next available
column in the worksheet and labels it Day.

What's next
Create a column specifying the hour at which samples are
taken.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-13
Patterned Data
Making patterned data
Create a column for Time. Each hour, 2 parts are sampled Make Patterned Data
from 4 different machines:
1 Choose Calc ➤ Make Patterned Data ➤ Simple Set of
2 ∗ 4 = 8 parts per hour Date/Time Values.
Create a column that lists each hour (8:00 am to 5:00 pm) 8 2 Complete the dialog box as shown below.
times. Repeat the entire sequence 5 times for each of the 5
days, creating 400 rows.

What's next
Create a column that identifies the 4 different machines.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-14
Patterned Data
Making patterned data
Create a column for Machine. Samples are taken 10 times Make Patterned Data (Text)
each day:
1 Choose Calc ➤ Make Patterned Data ➤ Text Values.
10 times ∗ 5 days = 50 sampling hours
2 Complete the dialog box as shown below.
Each hour, 2 parts are taken from each machine. Create a
column that lists each machine twice (A A B B C C D D, and
so on). Repeat the entire sequence 50 times for the 50
sampling hours in the study.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-15
Patterned Data
Interpreting your results
Check the worksheet to ensure that the four columns—Date,
Day, Time, and Machine—match the sampling plan in the
problem statement. The first 17 rows of the plan are shown in
the worksheet on the right.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-16
Patterned Data
Final considerations

Summary and conclusions Additional considerations


Using Make Patterned Data, you: After collecting the responses using the sampling plan, you
• Created columns for Date, Day, Time, and Machine can use many different graphical and statistical tools in
MINITAB to analyze the data.
• Generated a sampling plan that can be used to collect data
for analysis
You did not sort these data because they will be collected in
time-order.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-17
Patterned Data

Patterned Data
Example 3 Employee Evaluation

Problem Data set


A human resources manager wants to conduct interviews EMPLOYEEEVAL.MPJ
with employees to better understand the impact that job
stress has on employee performance, expectations,
motivation, and overall job satisfaction. Variable Description

Due to time constraints, the manager plans to interview only First Name Employee’s first name
30 of the 586 employees in the company. To obtain a Last Name Employee’s last name
representative sample, the manager decides to randomly
select 5 employees from each of 6 job categories. Job Type Employee’s job category

Data collection
The 586 employees are classified into 6 job categories:
Administrative, Customer Support, Engineering,
Management, Production, and Sales. All 586 employees and
the corresponding job type are in a MINITAB project.

Tools
• Split Worksheet
• Random Sample (Sample from Columns)

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-18
Patterned Data
Opening a project
Open the MINITAB project, EMPLOYEEEVAL.MPJ. Open Project
Examine the worksheet. Each row contains the information 1 Choose File ➤ Open Project.
for a single employee, resulting in 586 rows. There are 3 2 Choose EMPLOYEEEVAL.MPJ.
columns: First Name, Last Name, and Job Type.
3 Click Open.
What's next
To select the employees to interview, split the worksheet by
job type.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-19
Patterned Data
Splitting worksheets
Because you are sampling five employees from each of the Split Worksheet
six job types, you should have a separate MINITAB worksheet
1 Choose Data ➤ Split Worksheet.
for each job type.
2 Complete the dialog box as shown below.
Splitting the worksheet is a simple way to place data in
separate worksheets according to different values of a
variable or a set of variables.

3 Click OK.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-20
Patterned Data
Interpreting your results
You created six new worksheets, one for each job type.
MINITAB labels the worksheets according to the different
levels in the By variable.

What's next
Randomly select five employees from each job type.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-21
Patterned Data
Selecting a random sample
For each job type, randomly select five employees. MINITAB Random Data
randomly selects five rows of data in the worksheet and
1 Make the Job Type = Administrative worksheet active.
stores those samples in the columns you specify.
2 Choose Calc ➤ Random Data ➤ Sample From Columns.
3 Complete the dialog box as shown below.

4 Click OK.
5 Repeat steps 1−4 for the remaining worksheets.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-22
Patterned Data
Interpreting your results
Note that MINITAB stored the 5 randomly selected names in
columns 5, 6, and 7, as shown in the worksheet on the right.
The randomly selected names in your worksheet do not
match the names in the sample worksheet on the right
because MINITAB selects the random sample based on a value
determined by the exact time you opened MINITAB. This
process results in a different random sample each time you
use the command.
Note If you need to obtain the same random sample multiple times, use
Calc ➤ Set Base and use the same base for the random generator.

To create the final worksheet with the 30 employees, you can


copy and paste the 5 employees from each worksheet into a
new worksheet.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-23
Patterned Data
Final considerations

Summary and conclusions Additional considerations


You created a random sample of 30 employees from 6 job When taking a stratified random sample from a population
types to take part in interviews by: with multiple groups of different size, consider relative group
• Splitting the worksheet into 6 worksheets, one for each size when determining the sampling plan. In this example,
job type the group sizes are:
• Generating a random sample of 5 employees from each Administrative 24
job type Customer Support 63
• Copying and pasting the 5 employees from each Engineering 111
worksheet into a new worksheet
Management 114
Production 218
Sales 56

Sampling 5 employees from each group represents about


20% of the administrative staff and only about 2% of the
production staff. This sampling approach could result in data
that does not fully represent the overall population.
In this case, sampling the same percentage of employees
from each group, instead of the same number of employees,
results in a better plan. For example, selecting 5% from each
group results in a sample of 2 employees from the
administrative group and 11 employees from the production
group.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-24
Patterned Data
Exercise D Investigating Variability in Surface Finish

Problem 3 Use graphical and statistical tools to determine whether


the data from this process have any unusual patterns or
A manufacturing facility operates a machining process that variations.
produces aluminum parts for an automotive application. The
parts have a milled, gasket sealing surface. The facility has 4 Investigate the causes of any unusual variations found.
been having problems with parts failing leak tests. A quality
technician has determined that the aluminum parts have a Data set
poor surface finish, which causes the leaks. The specification
for the surface finish (Roughness) is 64. The technician ROUGHNESS.MPJ
needs to create a sampling plan and then analyze the data.

Data collection Variable Description


Roughness Roughness measurement
The technician wants to conduct a study to examine 100
roughness samples of the surface finish, 50 samples from
each of two shifts.
There are 10 hours in each shift. Starting at 6 am, the
technician wants to collect one sample from each of five
fixtures every hour for 20 hours. The five samples represent a
composite sample in which each fixture is sampled each
hour.

Instructions
1 Open ROUGHNESS.MPJ. Note that the roughness
variable is already entered in the project.
2 Create input variable columns DateTime, Shift, and
Fixture.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-25
Patterned Data
Exercise E Customer Focus Group

Problem Data set


To gain customer information about an upcoming product CUSTFOCUS.MPJ
release, a marketing team decides to select 24 gold-status
customers to participate in a focus group. Variable Description
Customer ID Customer account number
Data collection Status Status level (Level 1, Silver, Gold)
A MINITAB project contains information, including customer Gender Gender of customer (Male, Female)
ID, status level, gender, and ownership level for
Ownership (6 months, 1 year)
approximately 25,000 customers. A customer’s status level is
determined by their yearly expenditures. Ownership level is
based on how long they’ve owned the company’s product.
From this list, the marketing team wants to take a stratified
random sample of 24 gold-status level customers, half male
and half female. For each set of males and females, 6 should
be in the 6-month ownership level and 6 should be in the 1-
year ownership level.

Instructions
Using appropriate data manipulation and sampling tools in
MINITAB, produce the desired sample of customers for the
focus group.
1 Split the worksheet by the appropriate variables.
2 In the appropriate worksheets, take a random sample from
columns.
3 Combine samples into one worksheet.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-26
Patterned Data
Exercise F Parts Sampling

Problem Data set


To conduct a process capability study, a quality engineer None
wants to sample 150 parts from a recent production run of
125,000 parts, which are currently stored in a warehouse. Variable Description
Pallet Pallet identifier (1, ...,1250)
Data collection Part Part identifier (1, ..., 100)
The warehouse currently has 1,250 pallets each with 100
parts. The engineer needs a random sample of 150 parts from
the 1,250 pallets.

Instructions
1 Using appropriate data generating and sampling tools in
MINITAB, produce a random sample of 150 parts from the
125,000 total parts, with the pallet and part numbers
identified.
2 Name the new columns Sample Pallet and Sample Part.
3 In addition, sort the final worksheet according to pallet
number.

Patterned Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3-27
4
Basic Data Analysis

Objectives
• Use histograms, dotplots, boxplots, and time series plots to analyze data.
• Calculate descriptive statistics to examine important features of data.

Basic Data Analysis Copyright Minitab Inc. 2002. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-1
Contents

Examples and Exercises Purpose Page


Example 1 Analyze the accuracy and precision of a measurement 4-3
Gage Accuracy and Precision system experiment using descriptive statistics and a
boxplot.
Example 2 Analyze two processes using descriptive statistics, 4-10
Overadjusting the Process dotplots, and time series plots.
Example 3 Analyze data using descriptive statistics and histograms. 4-21
Wait Time
Exercise G Compare defect rates using a bar chart, time series plot 4-34
Linen Process and scatterplot.
Exercise H Analyze call rates using descriptive statistics. 4-35
Call Center Data
Exercise I Analyze loan application data using descriptive statistics. 4-36
Online Application

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-2
Basic Data Analysis

Basic Data Analysis


Example 1 Gage Accuracy and Precision

Problem Data
Engineers design a measurement system to measure a part LENGTH.MPJ
dimension critical to an assembly process. To assess the
accuracy and precision of this system, the engineers plan an
experiment in which two operators make repeated Variable Description
measurements on the same part. Operator Operator identifier
Length Measurement of part taken by operator
Data collection
The engineers select a standard part with a known length of
25 mm for the study. Two operators each measure this
standard part 50 times.

Tools
• Calculator
• Display Descriptive Statistics
• Dotplot

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-3
Basic Data Analysis
Calculating values
Before analyzing the data, use MINITAB’s Calculator to Calculator
transform the data by subtracting 25 from the measurements,
1 Open LENGTH.MPJ.
which quantifies the difference between the measured length
and its true length. This difference is the amount of bias in 2 Choose Calc ➤ Calculator.
the measurements. MINITAB stores the transformed variable
in a column you specify. 3 Complete the dialog box as shown below.

Although not required for the analysis, doing this step can
make the data easier to interpret.

4 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-4
Basic Data Analysis
Interpreting your results
Examine the worksheet. In the new column, labeled Bias,
note that:
• A value of 0 indicates a measured value equal to the part’s
true length.
• A positive number indicates a measured value greater
than the part’s true length.
• A negative number indicates a measured value less than
the part’s true length.

What’s next
Analyze the transformed data to assess the accuracy and
precision of measurements.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-5
Basic Data Analysis
Displaying summary statistics
Use descriptive statistics to determine the accuracy and Display Descriptive Statistics
precision of the operators’ measurements. Because the
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
measurements are in one column, use the By variable,
Statistics.
Operator, so that MINITAB displays the statistics separately
for each operator. 2 In Variables, enter Bias. In By variables, enter Operator.
For a single measurement, accuracy refers to how close the 3 Click Statistics.
measured value is to the true value of the part. For multiple
4 Complete the dialog box as shown below.
measurements, accuracy refers to how close the average of
the measured values is to the part’s true value. In this case,
the part’s true value is 25.
Precision refers to the variability in the measurements. Use a
measure of variability, such as the standard deviation, to
assess precision.
Note that you can only assess the accuracy of a measurement
system if you know the true value of the measured part.
However, you can assess precision without knowing this
value.
Note You can also analyze the original measurement data to assess
accuracy and precision.
5 Click Graphs. Check Boxplot of data.
6 Click OK in each dialog box.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-6
Basic Data Analysis
Interpreting your results
Examine the sample means and sample standard deviations Descriptive Statistics: Bias
of the bias variable for each operator. The results indicate
Variable Operator Mean StDev
that: Bias 1 -0.000146 0.005499
2 0.007400 0.003221
• Operator 1’s measurements appear to be more accurate
because the average bias of the measurements is closer to
0 than Operator 2’s measurements.
• Operator 2 has a smaller standard deviation than Operator
1, indicating that Operator 2’s measurements are more
precise.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-7
Basic Data Analysis
Interpreting your results
Compare the boxplots of the two operators. The plot supports
the results of the descriptive statistics, although the graph
provides more detail because it displays the range of
measurements made by each Operator. For example, the plot
shows that:
• Operator 2 consistently obtains measurements greater
than the standard part length of 25 mm. You cannot
determine this fact from the mean or standard deviation.
• The measurements from Operator 1, which range from
approximately 0.01 mm below the true length to 0.01 mm
above the true length, are less consistent than those of
Operator 2.
You can also generate these results by computing additional
descriptive statistics, including the minimum, maximum, and
range.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-8
Basic Data Analysis
Final considerations

Summary and conclusions Additional considerations


Using descriptive statistics and a dotplot, you discovered This example demonstrated a simple gage study, which can
that: quantify biases in a measurement system, identify operator
• Operator 1 appears to take more accurate measurements. effects, and quantify repeatability error. This type of study is
most useful when the primary objective is to compare two
• Operator 2 appears to take more precise measurements. specific operators or instruments.
Also, operator 2 consistently obtains measurements
greater than the standard part length of 25.00 mm. MINITAB also offers a full gage R&R study (Stat ➤ Quality
Tools ➤ Gage Study). A gage R&R study is a designed
You should investigate the process used by the operators to experiment that quantifies sources of measurement system
learn why the measurements by Operator 1 are less precise variability associated with repeated measurements and
than those of Operator 2, and why the measurements by operator effects, relative to process variability. You can use a
Operator 2 are systematically inaccurate. full gage study to qualify a measurement system for use.
Learn more about the gage R&R study in the Statistical
Quality Analysis training course.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-9
Basic Data Analysis

Basic Data Analysis


Example 2 Overadjusting the Process

Problem Tools
A quality engineer is concerned about excessive variation in • Display Descriptive Statistics
the coating amount in a manufacturing process that applies a • Dotplot
polymer coating to a foil substrate. The engineer believes the • Time Series Plot
variation may be a result of operators making too many • Calculator
• Tally Individual Variables
adjustments to the machine. To test this hypothesis, the
engineer designs a simple experiment to compare the
performance of the process with and without operator Data set
adjustments.
ADJUST.MPJ

Data collection
Variable Description
For 3 hours, the process runs with the current operating
procedure; that is, the operators sample and inspect the Adjust Thickness of coating (mm) on substrate during the
process during production and make any needed adjustments process with operator adjustments
to keep the process on target (1.5 mm). Then, the process NoAdjust Thickness of coating (mm) on substrate during the
runs for another 3 hours without any adjustments to the process without operator adjustments
machine after the initial setup.
During each process run, the engineer samples 50 pieces of
substrate and records the coating thickness in millimeters.
The lower specification limit is 1.485 mm and the upper
specification limit is 1.515 mm.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-10
Basic Data Analysis
Displaying a dotplot
Create a dotplot to display a graphical summary of the data. Dotplot
A dotplot is similar to a histogram except that a dotplot
1 Open ADJUST.MPJ.
displays a single dot for each data value (or for a group of
similar data values when there are many observations). 2 Choose Graph ➤ Dotplot.
3 Under Multiple Y’s, choose Simple, then click OK.
4 Complete the dialog box as shown below.

5 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-11
Basic Data Analysis
Interpreting your results
The dotplots indicate that:
• Both processes are centered near the target of 1.5 mm.
• The adjusted data are spread across a greater range of
values, which indicates that the adjusted process results in
greater variation than the nonadjusted process.
Note Double-click the x-axis to create tick marks that display the target of
1.5 mm.

What's next
Display descriptive statistics.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-12
Basic Data Analysis
Displaying descriptive statistics
The target coating thickness is 1.500 mm with a lower Display Descriptive Statistics
specification limit of 1.485 mm and an upper specification
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
limit of 1.515 mm.
Statistics.
Calculate the sample mean of the measurements for each 2 In Variables, enter Adjust NoAdjust.
process to determine which process is closest to the target.
3 Click Statistics.
Calculate the sample standard deviation of the measurements
for each process to determine which process has greater 4 Complete the dialog box as shown below.
variation and how the processes compare to the range set by
the upper and lower specification limits (1.515 −1.485 =
0.030 mm).

5 Click OK in each dialog box.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-13
Basic Data Analysis
Interpreting your results
Examine the two sample means and two sample standard Descriptive Statistics: Adjust, NoAdjust
deviations.
Variable Mean StDev
Adjust 1.5000 0.0128
The mean measurement of the adjusted process is on target NoAdjust 1.4981 0.00701
(1.500 mm), while the mean for the nonadjusted process is
close to target. However, the standard deviation of the
adjusted process is greater than the standard deviation of the
nonadjusted process.

What's next
Display the data in time order using a time series plot.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-14
Basic Data Analysis
Displaying a time series plot
Because the data are from processes run over time, you Time Series Plot
should examine the data relative to time to reveal any time-
1 Choose Graph ➤ Time Series Plot.
order patterns in the data. These patterns are often not
apparent in other plots, such as histograms and dotplots. 2 Choose Simple, then click OK.
MINITAB's time series plot is a very useful tool for examining 3 In Series, enter Adjust NoAdjust.
data in time order.
4 Click Multiple Graphs. Complete the dialog box as shown
Time series plots display measurement data on the y-axis below.
versus a time index on the x-axis. To display the data
correctly, you must arrange them in time order in the
worksheet.

5 Click OK in each dialog box.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-15
Basic Data Analysis
Interpreting your results
The time series plot shows that making adjustments to the
process while it is running results in increased variability in
the coating thickness.
In the process without adjustments, a downward drift in
coating thickness begins around the 40th sample. Such a drift
can indicate that, even though operator adjustment increases
variation in the process, periodic inspection might be
necessary to ensure that the process is not drifting too far
from target. Statistical methods exist that allow you to create
a signal that indicates when a process drifts off target.
Note A time series plot graphs the data by an index (row number). You
can stamp the axis with date/time or other information, but note that
the spacing between x values is fixed and does not truly represent
time.

What's next
Determine the proportion of measurements outside of the
specification limits.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-16
Basic Data Analysis
Proportion outside specification
Another useful statistic is the proportion of measurements Calculator
outside the upper and lower specification limits. The lower
1 Choose Calc ➤ Calculator.
specification limit is 1.485 and the upper specification limit
is 1.515. 2 Complete the dialog box as shown below.
To calculate the proportion, first use MINITAB’s Calculator to
create two columns, one for each process, indicating which
parts are outside of the specification limits. MINITAB
identifies rows where the expression is satisfied with a one
and rows where the expression is not satisfied with a zero. In
this example, measurements outside the limits are identified
with a one and measurements inside the limits are identified
with a zero.
MINITAB stores the result in the next available column using
the name you specify.

What's next
Use Tally Individual Variables to calculate the frequency and
percentage of parts outside the specification limits.
3 Click OK.
4 Repeat steps 1−3 for 'NoAdjust', storing the result in
NoAdjustOutside.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-17
Basic Data Analysis
Proportion outside specification
Using the new columns you created with Calculator, you can Tally Individual Variables
now use Tally Individual Variables to calculate the number
1 Choose Stat ➤ Tables ➤ Tally Individual Variables.
and percentage of measurements that are inside and outside
the specification limits. MINITAB counts the number of 0's 2 Complete the dialog box as shown below.
and 1's in each of these columns, then calculates the
proportion of measurements inside and outside the limits.

3 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-18
Basic Data Analysis
Interpreting your results
The results indicate that: Tally for Discrete Variables: AdjustOutside, NoAdjustOutside
• For the adjusted process, 12 measurements (24%) are AdjustOutside Count Percent NoAdjustOutside Count Percent
outside the specification limits. 0 38 76.00 0 47 94.00
1 12 24.00 1 3 6.00
• For the process that was not adjusted, only 3 N= 50 N= 50
measurements (6%) are outside the limits. Recall from the
time series plot that these measurements were taken at the
very end of the run, when the process began to drift off
target.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-19
Basic Data Analysis
Final considerations

Summary and conclusions Additional considerations


The results from the descriptive statistics and dotplot support All processes have some inherent variability. Usually,
the quality engineer’s hypothesis that making adjustments to constant adjustments to a process are not necessary. Control
the machine during the process increases the variation in the charts are a statistical tool commonly used to monitor
coating amount. The average measurement from the adjusted process variation. These charts are specialized time series
process is on target at 1.5 mm, which may be expected plots that can help determine the amount of variation that is
because the operators made the adjustments to remain on naturally inherent in the process versus the amount due to a
target. However, the frequent adjustments also increased the special cause. You can use control charts to determine when
variation. taking corrective action is necessary.
Increased variability can be an unintended consequence of
making frequent adjustments to a process. You can easily
misinterpret the natural variation in a process as a change in
performance, indicating adjustments are necessary, when in
actuality, they are not.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-20
Basic Data Analysis

Basic Data Analysis


Example 3 Wait Time

Problem Data
Administrators at a health clinic want to know how long WAITTIME.MPJ
patients wait to see a physician. They suspect there might be
a difference between wait times for appointments in the
morning versus the afternoon. Variable Description
Date Day of visit (3/5–5/10)
Data collection
Wait Time Time that a patient waits for doctor (rounded to
For approximately 2 months, administrators record the time nearest 5 minutes)
(rounded to the nearest 5 minutes) that patients spend waiting Time of Day Time of day of the appointment (Morning,
for a doctor and whether the appointment occurs in the Afternoon)
morning or afternoon.

Tools
• Display Descriptive Statistics
• Histogram

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-21
Basic Data Analysis
Displaying descriptive statistics
To summarize important features of the data, use Display Display Descriptive Statistics
Descriptive Statistics to calculate several basic statistics.
1 Open WAITTIME.MPJ.
One particularly useful statistic is the median, which is the 2 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
middle value of a data set that is arranged in ascending or Statistics.
descending order. That is, half the values in the data set are
smaller than the median value and half the values are larger. 3 In Variables, enter 'Wait Time'.

The median can be a better measure of the center of a data set 4 Click Statistics.
than the mean, especially when the distribution of the values 5 Complete the dialog box as shown below.
is not symmetric. For highly skewed data, the mean can
provide an inaccurate representation of the center of the data.
Data such as “wait times” or “time until an event occurs” are
frequently skewed to the right.

6 Click OK in each dialog box.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-22
Basic Data Analysis
Interpreting your results
The descriptive statistics indicate that: Descriptive Statistics: Wait Time
• The average wait time is about 23.2 minutes. Variable Mean StDev Minimum Q1 Median Q3 Maximum IQR
Wait Time 23.201 10.272 5.000 15.000 20.000 30.000 60.000 15.000
• The shortest recorded wait time is 5 minutes and the
longest is 60 minutes.
• The median value is 20 minutes. This indicates that
approximately 50% of the patients waited 20 minutes or
less to see a doctor.
• Q1 (first quartile) indicates that approximately 25% of the
patients in the sample waited 15 minutes or less for the
doctor.
• Q3 (third quartile) indicates that approximately 75% of
the patients in the sample waited 30 minutes or less to see
the doctor.

What's next
Create a histogram to examine the shape and distribution of
the data.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-23
Basic Data Analysis
Displaying a histogram
Histograms divide sample values into many intervals called Histogram
bins. Bars represent the number of observations falling
1 Choose Graph ➤ Histogram.
within each bin (its frequency). A histogram is a good way to
evaluate the center, spread, and general shape of a 2 Choose Simple, then click OK.
distribution—as long as an appropriate number of
observations are available. If you have too few data points, 3 Complete the dialog box as shown below.
the shape of the histogram may not accurately represent the
distribution of the population.

4 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-24
Basic Data Analysis
Interpreting your results
The histogram of the wait times shows that the data are
skewed right because the right tail of the data extends farther
than the left tail.
Notice the space between the bars in the histogram. The
intermediate bins do not contain any values because the wait
times were recorded in increments wider than the default bin
width.
Because of tick marks are not aligned with the bars, you
cannot determine which wait time values the bars represent.

What's next
Change the position of the tick marks on the x-axis.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-25
Basic Data Analysis
Editing scale tick marks
You want to change the position of the tick marks so you can Edit Scale
more easily interpret the data. Because the administrators
1 Double-click the x-scale.
collected the data in 5 minute increments, change the tick
marks on the graph so that they are also in 5 minute 2 Complete the dialog box as shown below.
increments, beginning at 5 minutes and ending at 60 minutes.

3 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-26
Basic Data Analysis
Interpreting your results
Note that the shape of the data has not changed but you now
have a tick mark for each bar in the histogram. The largest
bar on the graph represents 20 minutes, indicating that the
most frequent wait time for patients is 20 minutes.

What's next
Compare wait times for morning and afternoon
appointments.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-27
Basic Data Analysis
Displaying descriptive statistics and histogram
Next, you want to compare the wait times for patients with Display Descriptive Statistics
morning appointments versus those with afternoon
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
appointments. Use the By variable, Time of Day, so that
Statistics.
MINITAB displays the descriptive statistics separately for
morning and afternoon. Also, display a histogram to compare 2 Complete the dialog box as shown below.
the distribution of the data by time of day. Because you have
a By variable, MINITAB automatically panels the histograms.

3 Click Graphs. Check Histogram of data.


4 Click OK in each dialog box.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-28
Basic Data Analysis
Interpreting your results
The descriptive statistics indicate that wait times are Descriptive Statistics: Wait Time
generally longer in the afternoon than the morning. For
Variable Time of Day Mean StDev Minimum Q1 Median Q3
example, the median wait time in the morning is 20 minutes; Wait Time Afternoon 27.191 10.760 10.000 20.000 25.000 35.000
in the afternoon, 25 minutes. The same is true for the mean, Morning 19.231 8.006 5.000 15.000 20.000 25.000
minimum, maximum, 1st quartile (Q1), and 3rd quartile Variable Time of Day Maximum IQR
(Q3). Wait Time Afternoon 60.000 15.000
Morning 55.000 10.000

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-29
Basic Data Analysis
Interpreting your results
Although the distributions are somewhat similar, the majority
of the morning wait times are shorter than the afternoon wait
times. Only a few wait times in the morning are greater than
35 minutes. The bar on the far right side of the histogram for
morning appointments may indicate an outlier exists in the
data.
You can edit the scale to change the position of the tick
marks by double-clicking the x-axis.
Note The Afternoon histogram is on the left side of the plot because
MINITAB displays text variables in alphabetical order unless you
specify a value order in the column.

What’ next
Change the display of the histograms so they are stacked one
above the other, instead of side by side.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-30
Basic Data Analysis
Editing the histogram
Change the display of the histograms to place them in two Display Descriptive Statistics
rows and one column, instead of in two columns and one
1 Right-click the graph and choose Panel.
row. This makes it easier to compare the distributions of the
data. 2 Complete the dialog box as shown below.

3 Click OK.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-31
Basic Data Analysis
Interpreting your results
The histograms have not changed but it is easier to compare
the distributions.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-32
Basic Data Analysis
Final considerations

Summary and conclusions Additional considerations


The initial descriptive statistics indicate that the median wait In this example, the administrators rounded the wait time to
time is 20 minutes. The descriptive statistics and the the nearest 5-minute increment. As a result, some
histogram, however, show that some patients are waiting information about the distribution of wait times is lost.
more than 30 minutes—some as long as 60 minutes.
A better approach may be to record the actual wait time,
The descriptive statistics and the histograms displayed by without rounding it, to get a more accurate measurement.
time of day indicate that wait times tend to be longer in the Additionally, you may want to more thoroughly examine
afternoon than in the morning. wait time as it relates to the time of day. In this example, the
time of day was only classified into two categories, morning
and afternoon. You may want use the actual arrival time
instead of a categorized time of day variable.

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-33
Basic Data Analysis
Exercise G Linen Process

Problem Data set


A textile manufacturer is concerned about the number of LINENPROCESS.MPJ
defects in the fabric produced by one of the lines in the plant.

Data collection Variable Description


Date Month/Year (April 1998 to December 2000)
Each month for 33 months, a quality control technician
records the number and type of defects found in the fabric. Material Amount of material produced that month (meters)
Defects are classified as tears, pulls, skips, punctures, or Tears Number of tears
other.
Pulls Number of pulls
Instructions Skips Number of skips

1 Create a bar chart, using Function of a variable and Sum Punctures Number of punctures
options. Which defect is most prevalent? Other Number of defects that do not match one of the
2 Create a time series plot. Is there any structure or pattern above classifications
in the defect rate over time?
3 Create a scatterplot. Is the number of defects related to the
amount of material run in a month?

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-34
Basic Data Analysis
Exercise H Call Center Data

Problem 6 What percentage of customers are on hold for less than 5


minutes?
A manager of a customer call center wants to examine
several characteristics of the calls received. When customers 7 What percentage of the time are customers on hold for
call the center, they are prompted to enter their account between 5 and 10 minutes?
information, after which they are placed on hold until an 8 What percentage of the time are customers on hold for
operator picks up the call to answer their questions. more than 10 minutes?
9 What percentage of calls are resolved within 30 minutes,
Data collection including hold time?
During a period of 20 days, each call is recorded along with 10 What percentage of calls do customers hang up without
the number of minutes the customer spends on hold and the speaking to an operator?
number of minutes a live operator spends answering each
call. Data set

Instructions CALLCENTER.MPJ

Generate descriptive statistics to answer the following


questions: Variable Description
1 What is the average number of calls received per day? Date Date of call (4/1−4/20)
2 What is the average amount of time a customer spends on OnHold Amount of time customer spends on hold (minutes)
hold?
Talking Amount of time operator spends speaking with
3 What are the maximum and minimum hold times? customer (minutes). (An asterisk (∗) indicates the
customer hung up while on hold.)
4 What is the median on-hold time?
5 What is the average time an operator spends with a
customer?

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-35
Basic Data Analysis
Exercise I Online Application

Problem 6 What is the actual number of applications attempted each


day?
A consumer loan company recently added an online
application to their Web site. Management wants to examine 7 What is the average number of applications attempted
usability performance during the first few weeks using each day?
numerical and graphical analysis. 8 Create a bar chart to visualize the number of applications
attempted each day. Hint: In Bar Chart, choose Count of
Data collection unique values. Then, in Data View, uncheck Bars, and
check Symbols and Connect line.
For the first month (from October 5 to November 8), the
company’s Web team records the time required to complete
Data set
the online application.
ONLINEAPP.MPJ
Instructions
Generate descriptive statistics to answer the following Variable Description
questions:
Date Date customer attempts online application
1 What is the average amount of time a customer spends
completing the application? Time Time customer spends completing application (An
asterisk (∗) means customer quit prior to
2 What are the maximum and minimum times to complete completion.)
the application?
3 What is the median completion time?
4 What percentage of customers complete the application
process within 30 minutes?
5 What percentage of customers start the application
process but don't complete it?

Basic Data Analysis Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4-36
5
Importing and
Manipulating Data

Objectives
• Learn the principles of importing data from other software programs, including Microsoft Excel and Access.
• Learn how to create and run Exec files so you can recreate analysis steps.
• Create Pareto charts, bar charts, time series plots, and dotplots to display data.
• Use MINITAB tools to recode data and create new variables.
• Restructure data for analysis in MINITAB.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-1
Contents

Contents
Examples and Exercises Purpose Page
Example 1 Import data from Microsoft Excel, code new categories, 5-3
Bill Payment create a Pareto chart, and create a simple macro.
Example 2 Import data from Microsoft Excel, restructure worksheet, 5-20
Chemical Purity and perform analyses using Graphical Summary, Tally
Individual Variables, and time series plot.
Example 3 Import data from Microsoft Access, restructure 5-38
Tracking Downtime worksheet, and create Pareto charts, bar charts, and
dotplots.
Exercise J Import data from Microsoft Excel and analyze data using 5-51
Restaurant Complaints Tally Individual Variables and Pareto charts.
Exercise K Import data from Microsoft Excel, restructure worksheet, 5-52
Sulfur Dioxide Levels and analyze data using a scatterplot.
Exercise L Split the worksheet and analyze data using dotplots. 5-53
Investigating Downtime Distributions
Exercise M Import subsets of data using ODBC functions. 5-54
Production Downtime

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-2
Importing and Manipulating Data

Importing and Manipulating Data


Example 1 Bill Payment

Problem Data
Managers at a business-training center are interested in BILLPAYMENT.XLS
summarizing the time needed for their customers to submit
bill payments. In particular, they want to get an idea of what
percentage of customers pay within 30 days, between 31 and Variable Description
60 days, between 61 and 90 days, and in more than 90 days. Customer ID Identification number associated with a
They also want to automate the steps for importing and particular customer
analyzing the data.
Sent Date bill was sent
Data collection Received Date payment was received

The company has been operating for 4.5 years. Billing and
receipt time data for this period are stored in Microsoft
Excel.

Tools
• Importing data from Excel
• Calculator
• Display Descriptive Statistics
• Code Data (Numeric to Text)
• Pareto Chart
• Create and run an Exec

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-3
Importing and Manipulating Data
Importing data from Excel
Before you import the data into MINITAB, you must take into
consideration the structure of the Excel file:
• The variable names are in row 1.
• The data begin in row 2.
• The worksheet has no spaces or blank rows.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-4
Importing and Manipulating Data
Importing data from Excel
Because the data are well structured, you can open the Files of Type ➤ Excel
spreadsheet in MINITAB with no modifications. MINITAB
1 Choose File ➤ New and choose Minitab Project.
imports the data into a worksheet, putting the column
headings and data into the appropriate cells. 1 Choose File ➤ Open ➤ Minitab Worksheet.

File types 2 In Files of type, choose Excel (*.xls).


3 Choose BILLPAYMENT.XLS.
When you choose Open Worksheet, the default file type is a
MINITAB worksheet or project. If you choose to open a 4 Click Open.
project instead of a worksheet, Minitab allows you to open a
worksheet from that project. You can also open several other
file types directly in MINITAB.

Options and previews


When opening non-MINITAB files, the Options and Preview
dialog boxes are available. Use Options to specify column
names, column delimiters, where to begin reading data, and
how to convert missing value codes. Use Preview to see the
column names, data types, and rows before MINITAB opens
the file. You can also change column names and data types in
Preview.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-5
Importing and Manipulating Data
Interpreting your results
Examine the MINITAB worksheet to ensure that the data are
in the proper format.
Notice that the data appear in MINITAB exactly as they did in
Excel: that is, the variable names appear in cells above the
data. MINITAB also correctly formatted columns 2 and 3 as
dates, noted by the -D after the column number.

What's next
Calculate how long customers took to pay their bills.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-6
Importing and Manipulating Data
Making calculations
Before you can calculate the percentage of customers that Calculator
pay within a certain number of days, you first need to
1 Choose Calc ➤ Calculator.
calculate the number of days customers took to pay their
bills. 2 Complete the dialog box as shown below.
Use MINITAB’s Calculator to subtract the Sent column from
the Received column. This calculation results in a column
showing the number of days between these two dates.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-7
Importing and Manipulating Data
Interpreting your results
The worksheet now contains a fourth column, Days, showing
the number of days from the date that the company sent the
bill to the date when they received the customer’s payment.

What's next
With this information, you can now perform a variety of
statistical and graphical analyses.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-8
Importing and Manipulating Data
Displaying descriptive statistics
Use descriptive statistics to summarize important features of Display Descriptive Statistics
the data. In particular, descriptive statistics provide useful
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
information about the location and variability of the data
Statistics.
across customers. You can also display a histogram that will
provide useful information about the shape of the data. 2 In Variables, enter Days.
3 Click Statistics.
4 Complete the dialog box as shown below and click OK.

5 Click Graphs. Check Histogram of data.


6 Click OK in each dialog box.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-9
Importing and Manipulating Data
Interpreting your results
The statistics indicate that: Descriptive Statistics: Days
• Since the company’s inception, 2,745 bills were paid. Variable N N* Mean StDev Minimum Q1 Median Q3 Maximum
Days 2745 66 25.204 15.080 10.000 14.000 20.000 30.000 142.000
• The overall average payment time is about 25 days, while
the median is 20 days. These statistics indicate that the Variable Range IQR
Days 132.000 16.000
distribution of bill payment times is most likely skewed to
the right.
• The third quartile statistic (Q3) indicates that customers
paid approximately 75% percent of the bills within 30
days.
Note that 66 payments have not yet been received, as
indicated by the N* column. Minitab omits these values from
the calculations. Because this information is not accounted
for in the results, the interpretation of some of the statistics
may be biased.
Note One way to handle this situation is to replace the missing values (*)
with the number of days that have passed since the bill was sent,
then conduct an analysis treating these values as “censored” using
MINITAB's reliability analysis tools.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-10
Importing and Manipulating Data
Interpreting your results
The histogram indicates that the payment data are skewed to
the right.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-11
Importing and Manipulating Data
Coding data
Recall that managers want to determine the percentage of Code Numeric to Text
customers that paid their bills according to the following
1 Choose Data ➤ Code ➤ Numeric to Text.
categories:
• Less than 31 days 2 Complete the dialog box as shown below.

• Between 31 and 60 days


• Between 61 and 90 days
• Over 90 days
You can use MINITAB's data coding function to create a new
column of data that codes the original data into these
categories.
To code data values that are larger than 90 days into a new
value identified as Over 90, you must specify an upper limit.
Use a number outside the data range (such as 99,999) to
include all remaining values.
You also should code the unpaid bills currently denoted with
an asterisk.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-12
Importing and Manipulating Data
Interpreting your results
Examine the new Categories column, which describes the
number of days customers took to pay their bills:
• Under 31 days
• 31 to 60 days
• 61 to 90 days
• Over 90 days
• Unpaid

What's next
Generate descriptive statistics and graphs using the
Categories variable.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-13
Importing and Manipulating Data
Displaying a Pareto chart
An appropriate way to summarize frequency data is with a Pareto Chart
Pareto chart. Pareto charts are a type of bar chart where the
1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
bars are arranged in order of descending frequency. A
cumulative percentage line helps you judge the added 2 Complete the dialog box as shown below. In Combine
contribution of each category. defects after the first, make sure to enter 99.9.
Note You can also use Graph ➤ Bar Chart to display frequency data.
However, in this case, a Pareto chart provides more information
than a bar chart.

Chart defects data in


Choose this option when the data are not summarized by
category and frequency. This option is appropriate for the
data in this example.

Chart defects table


Choose the chart defects table when the data are already
summarized by category and frequency; that is, if the data are
stored in MINITAB as shown below:
3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-14
Importing and Manipulating Data
Interpreting your results
The Pareto chart shows the percentages and counts for the 5
categories you created in the categories column. The largest
category (73.8%) represents bills paid within 30 days. The
cumulative percent (Cum %) indicates that in 60 days or less,
93.7% of the bills are paid.
Note that 66 (2.3%) of the bills are unpaid. You should
consider why these bills are unpaid and how they may effect
the results. For example: Is a bill unpaid due to neglect from
a customer but it will be paid eventually? Is it unpaid because
the customer has not received it yet? And into which of the
four paid categories will these bills eventually fall?

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-15
Importing and Manipulating Data
Creating an Exec file
The analysis of the payment data included several steps: Save As
• Importing the data from Excel 1 Click to open the History folder.
• Calculating the number of days from when the company 2 Highlight all the session commands in the History window.
sent the bill to when it received payment (Click and drag a box around them; or click the first line, hold
• Calculating descriptive statistics down s, then click the last line.)

• Coding the days into categories


• Creating a Pareto chart
When you need to perform an analysis on a regular basis or
re-create a graph with unique features, you can automate the
task. In MINITAB, you can create a simple macro, called an
Exec, which enables you to easily reproduce an analysis or
graph.
While you are conducting analyses, MINITAB collects the
commands in the History folder. You can save these
commands as an Exec file. Once saved, you can run the Exec
to re-create the analysis.
For example, with the bill payment data, you can create an
Exec file with the commands used to perform the steps listed
above. Then, every month, as you obtain additional customer
payment information, you can run the Exec file to update the
analysis. The Exec performs all of the steps and reproduces 3 Right-click the selected text, then choose Save As.
the output and graph. 4 In File name, type PaymentMacro.
5 From Save as type, choose Exec files (*.MTB).
6 Click Save.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-16
Importing and Manipulating Data
Running an Exec file
Exec files are editable in most word processing or text editor Run an Exec
programs, such as Microsoft’s Notepad. For example, if you 1 Choose File ➤ New, then choose Minitab Project.
want to analyze another Excel file that has the same data
structure, but a different file name, you can open the Exec in 2 Choose File ➤ Other Files ➤ Run an Exec.
Notepad and change the file name.
3 Click Select File.
When you run an Exec, MINITAB automatically performs all 4 Double-click PAYMENTMACRO.MTB.
of the steps you previously performed manually.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-17
Importing and Manipulating Data
Interpreting your results
Because the data have not changed, the Pareto chart looks
like the one you created earlier.
Note To use an Exec file, column names must be consistent across your
worksheets, unless you use column numbers instead of names.
Also, your worksheet path and name must be the same.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-18
Importing and Manipulating Data
Final considerations

Summary and conclusions Additional considerations


You can use MINITAB to summarize data in many categories. During an analysis, you can examine basic numerical and
MINITAB's Code command converts raw data into separate graphical summaries. But you should also consider
categories, so a Pareto chart can easily summarize the results. additional variables that can influence the results. In this
example, information about the customer, particular bill, or
Based on the results, managers can decide whether the time of year might provide useful insight that can ultimately
payment rates are satisfactory or need improvement. accelerate customer payments.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-19
Importing and Manipulating Data

Importing and Manipulating Data


Example 2 Chemical Purity

Problem Data set


A semiconductor manufacturing facility uses a particular CHEMPURITY.XLS
chemical in one of its cleaning operations. The specifications
of the process dictate that the batches of chemical should
have a purity over 99.25%. The quality team discovers that Variable Description
some of the batches may not meet this purity level. The team Date Date that the material was received
investigates data from a sample of the batches to determine
whether or not this is true. Batch ID Batch identification number
Purity Purity level (%)
Data collection
Employees measured the purity levels for the last 104
batches received, then entered the information in a Microsoft
Excel spreadsheet.

Tools
• Importing data from Excel
• Editing worksheet
• Stack (Blocks of columns)
• Sort
• Graphical Summary
• Calculator
• Tally Individual Variables
• Time Series Plot

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-20
Importing and Manipulating Data
Importing data from Excel
Examine the structure of the data. Notice that the data are Data structure in the Excel file
contained in multiple tables. As a result:
1 Open Excel.
• The data for each variable are dispersed across multiple
columns. 2 Choose File ➤ Open.

• There are many empty rows. 3 Choose CHEMPURITY.XLS.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-21
Importing and Manipulating Data
Importing data from Excel
Because the data are in a series of tables, the worksheet Open the Excel file in MINITAB
requires some modifications after you import it into
1 In MINITAB, choose File ➤ New and choose Minitab Project.
MINITAB. The objective is to manipulate the data in MINITAB
so it looks like the following: 2 Choose File ➤ Open Worksheet.
3 In Files of type, choose Excel (*.xls).
4 Select CHEMPURITY.XLS.
5 Click Options.
6 Complete the dialog box as shown below.

Notice that each row represents a single batch of material.


Each variable is stored in a separate column: Date the
product was received (sorted chronologically), Batch ID
number, and Purity reading.
Note You can also modify the data prior to importing the worksheet.

7 Click OK.
8 Choose Preview.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-22
Importing and Manipulating Data
Interpreting your results
Examine the preview window to confirm that the data are
arranged in MINITAB as they should be. The variable names
are in the top row and the worksheet contains no empty rows.
Although the structure enables you to import the data, the
worksheet is not well-structured for analysis; the variables
Date, Batch ID, and Purity are spread over multiple columns.
You can restructure the worksheet after you import it into
MINITAB.

9 Click OK.
10 Click Open.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-23
Importing and Manipulating Data
Importing data from Excel
Examine the data in the MINITAB worksheet. Note that some File cleanup
modifications must be made to get the data into the desired
1 Highlight row 11 in the worksheet.
structure.
2 Press D.
First, delete the column headings from the original Excel
tables. Identify the rows with missing values and original 3 Repeat steps 1 and 2 for rows 21 and 31.
“Batch ID” headings, then delete them.

What's next
Combine data for the same variable into a single column.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-24
Importing and Manipulating Data
Stacking data
Currently, the data are contained in multiple sets of columns. Stack Blocks of Columns
Use MINITAB’s stacking function to move all the data into a
1 Choose Data ➤ Stack ➤ Blocks of Columns.
single set of three columns (Date, Batch ID, and Purity).
2 Complete the dialog box as shown below. Make sure to
uncheck Use variable names in subscript column.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-25
Importing and Manipulating Data
Interpreting your results
The new MINITAB worksheet should now contain the three
variables, each in separate columns, although the columns
currently have no names. Each row corresponds to the date
received, batch ID, and purity level for a particular batch of
cleaning solution. The Subscripts column indicates from
which block of columns the data came.

What's next
Add column names and sort the data by the Date column.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-26
Importing and Manipulating Data
Sorting data
You can use MINITAB’s sort command to sort the data in Sort
ascending or descending order − numerically, alphabetically,
1 Enter column names in the new worksheet: in C2, enter Date;
or by date. In this example, sort the data by date, in ascending
in C3, enter Batch ID; and in C4, enter Purity.
order. Sorting the data by time makes it easy to plot over time
to look for patterns or trends. 2 Select the Subscripts column and press D.
Note Be sure to include all appropriate columns in the sorting step, to 3 Choose Data ➤ Sort.
preserve the connection between the different columns of data.
4 Complete the dialog box as shown below.
Before you sort the data, enter the column names in the
worksheet and delete the subscripts column, which is not
needed for this example.

5 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-27
Importing and Manipulating Data
Interpreting your results
Examine the data in the MINITAB worksheet to ensure that it
is structured correctly:
• The worksheet has three columns.
• One column contains the date values, sorted
chronologically.
• One column contains the batch ID number.
• One column contains the purity amount.
• Dates are sorted in ascending order.

What's next
Now that you have restructured the data, you can begin the
analysis.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-28
Importing and Manipulating Data
Creating a graphical summary
Recall that the quality team is interested in summarizing the Graphical Summary
purity ratings for the 104 batches. MINITAB’s graphical
1 Choose Stat ➤ Basic Statistics ➤ Graphical Summary.
summary is a convenient way to examine basic features of
the data. The graphical summary contains the following: 2 Complete the dialog box as shown below.
• Histogram
• Boxplot
• Confidence intervals for the mean, median, and standard
deviation
• Normality test
• Descriptive statistics
Note Most of the information in the graphical summary can also be
obtained using Stat ➤ Display Descriptive Statistics and Graph
➤ Histogram.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-29
Importing and Manipulating Data
Interpreting your results
The histogram shows that the purity distribution is skewed
right, and indicates that the process is producing material
below the minimum desired purity level of 99.25%.
The median purity for these batches is 99.19%. Therefore, at
least half the batches have a purity rating below the
specification of 99.25%. This is strong evidence that the
supplier is not meeting the specification.

What's next
Calculate the proportion of batches that have a purity rating
below 99.25%.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-30
Importing and Manipulating Data
Calculating proportions
Another useful statistic is the proportion of the 104 batches Calculator
that have a purity level below 99.25%. First, use MINITAB’s
1 Choose Calc ➤ Calculator.
Calculator to create a new column indicating which batches
are below 99.25%. In the new column, MINITAB enters a 2 Complete the dialog box as shown below.
value of 1 when the expression is true and a 0 when it is false.
MINITAB stores the results in the next available column using
the name you specify.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-31
Importing and Manipulating Data
Interpreting your results
In the new column, <99.25, a value of 1 indicates that the
purity level for the batch is below 99.25%; a value of 0
indicates the purity level for the batch is 99.25% or higher.

What's next
Use Tally Individual Variables to calculate the frequency and
percentage of batches below 99.25%.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-32
Importing and Manipulating Data
Tallying results
Use the new column you created with Calculator to Tally Individual Variables
determine the number and percentage of batches less than
1 Choose Stat ➤ Tables ➤ Tally Individual Variables.
than 99.25%. MINITAB counts the number of 0's and 1's in the
column, then calculates the proportion of batches less than 2 Complete the dialog box as shown below.
and greater than 99.25%.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-33
Importing and Manipulating Data
Interpreting your results
The results indicate that: Tally for Discrete Variables: <99.25
• 67, or 64.42%, of the 104 batches have a purity level <99.25 Count Percent
0 37 35.58
below the specification of 99.25% 1 67 64.42
• 37, or 35.58%, of the 104 batches have a purity level N= 104

greater than or equal to the specification of 99.25%

What's next
Create a time series plot to view the data in time order.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-34
Importing and Manipulating Data
Creating a time series plot
Because the data in the worksheet are sorted by date, create a Time Series Plot
time series plot to provide insight into variation in the purity
1 Choose Graph ➤ Time Series Plot or Stat ➤ Time Series
levels over time.
➤ Time Series Plot.
MINITAB's Time Series Plot is a useful tool for examining 2 Choose Simple.
data in time order.
3 Click OK.
Time series plots display measurement data on the y-axis
versus a time index on the x-axis. To display the data 4 Complete the dialog box as shown below.
correctly, the data must be in time order in the worksheet.
Note The time series plot is most appropriate when data are collected
from fixed intervals from your process. In this application, note that
the x-axis is not a true time scale, but the sequence in which the
data were collected.

5 Click Time/Scale. Under Time Scale, choose Stamp.


6 In Stamp columns (1-3, innermost first), enter Date.
7 Click OK in each dialog box.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-35
Importing and Manipulating Data
Interpreting your results
The data appear to be random over time, with no obvious
trends or patterns. However, note that the purity levels of the
last 11 batches are relatively low, which may be an indication
of a trend forming. You need more information to verify a
trend.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-36
Importing and Manipulating Data
Final considerations

Summary and conclusions Additional considerations


The analysis reveals that over 64% of the batches in the If you store data in another software application and you plan
sample had measured purity levels below the specification of to analyze it later in MINITAB, consider ways to structure the
99.25%. The quality team needs to investigate possible data that make importing it into MINITAB easier. Anticipating
causes for the low purity levels with the supplier or consider the transfer to MINITAB can eliminate subsequent data
changing suppliers. manipulation steps.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-37
Importing and Manipulating Data

Importing and Manipulating Data


Example 3 Tracking Downtime

Problem Data set


A productivity improvement team wants to track and chart PRODUCTION.MDB
equipment downtime for two manufacturing lines.
Specifically, they want to create a summary chart of
downtime, by cause, for each line. After the downtime totals Variable Description
are charted, the quality team wants to analyze the distribution Date Date of downtime occurrence
of downtime for each reason code.
Downtime Amount of time equipment was down (minutes)
Data collection ID Database identification key

For each manufacturing line, daily production sheets track Line Manufacturing line (A, B)
downtime information. At the end of each day, the Reason Cause of downtime occurrence
manufacturing clerk enters information about each downtime
occurrence into a Microsoft Access database. Note that line Shift Shift in which downtime occurred
B does not operate on weekends.

Tools
• Query Database (ODBC)
• Pareto Chart
• Bar Chart
• Panel
• Dotplot

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-38
Importing and Manipulating Data
Importing data from databases
ODBC stands for “open database connectivity.” ODBC is a
protocol shared by many computer applications. To use
ODBC, you may need to install ODBC software on your
system. When ODBC is set up correctly, you can use the
MINITAB Query Database command to connect to a database
file and import the subset of data you want to analyze.
You can use the Query Database (ODBC) command to
directly access data in a database without the added step of
exporting the data from the database software package.
You can create data source names (DSNs) for commonly
accessed database files, which saves time because you do not
have to browse to find the file.
The Query Database (ODBC) function generates command
language that can be used in an Exec (a type of macro).

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-39
Importing and Manipulating Data
Importing data from databases
You can easily import data from a database into MINITAB. Query Database (ODBC)
When imported into a MINITAB worksheet, database files
1 Choose File ➤ New and choose Minitab Project.
often have the appropriate column structure that makes
analysis in MINITAB straightforward. 2 Choose File ➤ Query Database (ODBC).

File data source 3 Click the Machine Data Source tab.

Use the File Data Source tab to set up your own data source
names (DSNs), linking directly to database files you
commonly access.

Machine data source


The DSNs under this tab are determined by the software and
associated ODBC drivers installed on your computer.
Note You can use the Query Database command (ODBC) to open Excel
files by choosing Excel from the Machine Data Source tab. An
advantage to using ODBC is that you can choose the specific
worksheet from the Excel workbook you want to import.

4 Double-click MS Access Database.


5 Click OK in the Login dialog box.
6 Choose PRODUCTION.MDB.
7 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-40
Importing and Manipulating Data
Choosing data to import
From the Query Database (ODBC) dialog box, choose the Query Database (ODBC)
data fields to import into your MINITAB worksheet.
1 From Available tables, choose Downtime.
Available tables 2 Click to move all fields from Available fields to
Selected fields.
Use the Available tables drop-down menu to see a list of
tables and queries within your database file.
You can also use the Available tables drop-down menu to
import the results of a query already written in the database.

Selecting fields
Click the arrow buttons in the dialog box to choose the
database fields you wish to import. These fields become the
columns in your MINITAB worksheet.

Use rows
You can import specific data rows. For example, in this data
set you might be interested in a particular manufacturing line
or shift.

3 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-41
Importing and Manipulating Data
Interpreting your results
The fields you selected from the database are now the
columns in your MINITAB worksheet. The imported data are
ready for analysis.

What's next
Create a Pareto chart to summarize the downtime data.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-42
Importing and Manipulating Data
Creating a Pareto chart
An appropriate way to summarize and prioritize frequency Pareto Chart
data is with a Pareto chart.
1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
Chart defects data in 2 Complete the dialog box as shown below.
Choose this option when the data are not summarized by
category and frequency. In this example, you can chart the
defects in the Reason column, but the results would show
only the counts of each downtime reason, not the total
downtime minutes associated with each reason.

Chart defects table


When the data are summarized by category and frequency,
choose Chart defects table. In this example, the categories are
stored in the Reason column, and the frequencies are stored
in the Downtime column. To create the chart, MINITAB sums
the downtime minutes for each reason, then arranges the
reasons from greatest to fewest downtime minutes.
Note You can choose Graph ➤ Bar Chart to create Pareto charts by 3 Click Options.
using the sum function and ordering the bars by descending Y
values. 4 In Y axis label, enter Total Down Time (min).
5 Click OK in each dialog box.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-43
Importing and Manipulating Data
Interpreting your results
The bars on the chart indicate the total minutes of downtime
for each category (left y-axis scale) and percentages of total
downtime (right y-axis scale). The line above the bars
indicates cumulative percentages.
MINITAB also displays the following statistics for each defect
type:
• Count (in this example, total minutes)
• Percentage of total each category represents
• Cumulative percentage of total each category represents
The chart indicates that changeovers caused 9,885 minutes of
downtime, 44.5% of total downtime. Together, changeovers,
fixture alignment, and recalibrations account for 78% of all
downtime.
Note By default, MINITAB generates bars for each defect type until the
cumulative percent reaches 95, and then groups the remaining
categories into a bar named Others. You can change the cutoff
percent for the Others category.

What's next
Create a bar chart to examine differences between
manufacturing lines.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-44
Importing and Manipulating Data
Using a bar chart for Pareto analysis
To discover whether the two manufacturing lines experience Bar chart
the same downtime issues, use a bar chart to display the
1 Choose Graph ➤ Bar Chart.
downtime data by line. The bar chart provides a convenient
way of displaying frequencies for different groups. 2 Under Bars represent, choose A function of a variable.

3 Under One Y, choose Simple, then click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-45
Importing and Manipulating Data
Plotting a function of the Y variable
To set up the bar chart correctly, choose the function Sum so Bar Chart
that each bar represents the sum of the downtime minutes for
4 Complete the dialog box as shown below.
each reason. In Bar Chart Options, choose Decreasing Y to
display the bars in descending order of downtime minutes.
Use the By Variables tab in Multiple Graphs to create
individual bar charts for each level of another variable in
your data set. You can choose to display the charts on the
same page in separate panels or on different pages. In this
example, using Line as the By variable produces two bar
charts of downtime by reason, one for each manufacturing
line.

5 Click Bar Chart Options.


6 Under Order Main X Groups Based On, choose
Decreasing Y.
7 Click OK.
8 Click Multiple Graphs, then click the By Variables tab.
9 In By variables with groups in separate panels, enter Line.
10 Click OK in each dialog box.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-46
Importing and Manipulating Data
Interpreting your results
The left panel of the bar chart shows the total downtime for
each reason for line A; the right panel, for line B.

Setting priorities for downtime issues


MINITAB sets the order of the bars for both charts based on
downtime frequency for the two lines combined. In this
example, the largest two sources of manufacturing downtime
for both lines are changeovers and problems with fixture
alignment. Note the third largest source of downtime is
different for lines A and B: recalibration for line A, and the
laser marking system for line B.

Comparing line A and line B


Although the chart seems to indicates that line A has had
more downtime during this time period, remember that line B
does not operate on weekends.

What's next
Create a dotplot to examine the distribution of downtime for
each reason.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-47
Importing and Manipulating Data
Creating a comparative dotplot
Use a dotplot to look at the distribution of downtime minutes Dotplot
for each reason. A dotplot is often useful for comparing
1 Choose Graph ➤ Dotplot.
groups of data and assessing the shape of a distribution.
2 Under One Y, choose With Groups. Click OK.
A dotplot is similar to a histogram, but with two key
differences. 3 Complete the dialog box as shown below.
• A dotplot usually has finer resolution on the x-axis; in
fact, the x-axis often represents the actual data values.
• In a dotplot, each observation (or group of observations)
is represented by a dot, while in a histogram, many
observations are represented by a single bar.

4 Click OK.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-48
Importing and Manipulating Data
Interpreting your results
Notice the dotplot gives you information similar to a
histogram. Each downtime reason is in a separate panel,
allowing you to compare the distributions.

Frequency of occurrence
The stacked dots in this plot represent up to 3 observations
each. Categories with more dots occur more frequently.
For example, the plot indicates that changeovers and fixture
alignment problems are more frequent than other reasons.

Distribution of downtimes
Each downtime reason has a different distribution. Notice the
shape, spread, and central tendency of each distribution.
For example, notice that the downtime for recalibrations is
generally longer than the downtime for the laser marking
system. Also, the plot shows a tremendous amount of
variation in time to resolve a vacuum leak. This variation
may exist because the leak site is difficult to diagnose.

Explaining distributional shapes


You can possibly learn about data collection methods and
process behavior from these downtime distributions. For
example, more changeovers take exactly 25, 30 or 35
minutes. Perhaps operators are estimating the changeover
time to the nearest 5 minutes, instead of to the nearest
minute, resulting in inaccurate measurements.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-49
Importing and Manipulating Data
Final considerations

Summary and conclusions Additional considerations


The Pareto chart indicated that the top two downtime issues, The dotplots of downtime by reason give some insight into
changeovers and problems with fixture alignment, are the individual downtime occurrences. For example, total
largest sources of manufacturing downtime for both lines. downtime minutes could be a result of frequent problems that
The third largest source of downtime is different for each are fixed quickly or could result from fewer problems that
line: recalibration for line A and the laser marking system for take a long time to resolve. Patterns in the individual
line B. downtime distributions may give the improvement team a
better understanding of how and where downtime is
occurring, which should lead them to find appropriate
solutions.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-50
Importing and Manipulating Data
Exercise J Restaurant Complaints

Problem Data set


In an effort to improve customer satisfaction, managers of a RESTCOMPLAINT.XLS
hotel restaurant want to examine data associated with
customer complaints to identify potential causes.
Variable Description
Data collection Complaint Type of complaint (Food not cooked to order,
Poor food quality, Slow service, Too expensive,
During the period 11/27/2000–2/14/2001, management Wrong order)
recorded information about customer complaints including:
type of complaint, date of the complaint, shift during which Date Date of complaint
the complaint occurred, and cook staff and manager on duty Shift Shift when complaint occurred (Lunch, Dinner)
at the time of the complaint.
Cook Staff Cook staff on duty at time of complaint (A or B)

Instructions Manager Manager on duty at time of complaint (John,


Robert)
1 Import the data from Excel and examine the data.
2 Using Tally, identify the most frequently occurring
complaints.
3 Create a Pareto chart (chart defects table).
4 Do the type and frequency of the complaints differ for the
two shifts?
5 Do the complaints seem dependent on the cook staff and/
or the manager on duty?
6 Have specific combinations of Shift, Cook Staff, and
Manager led to frequent complaints?

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-51
Importing and Manipulating Data
Exercise K Sulfur Dioxide Levels

Problem Data set


Quality engineers at a beverage processing plant want to SULFURDIOXIDE.XLS
examine the variation in sulfur dioxide levels over different
time periods for different processing tanks and different
production runs. Variable Description
Date Date of production run
Data collection
Tank Processing tank (H1, H2, H4, H9, H10, H11)
During 9 different day-long production runs, operators used Time 1 SO2 level (in ppm) at time 1
6 different processing tanks to produce a beverage product.
Each tank was set with a sulfur dioxide level of 500 parts per Time 2 SO2 level (in ppm) at time 2
million (ppm). They measure sulfur dioxide levels at 4 fixed Time 3 SO2 level (in ppm) at time 3
times. Due to complications in the process, operators were
not always able to take all four measurements. Data are Time 4 SO2 level (in ppm) at time 4
stored in an Excel worksheet.

Instructions
1 Open the Excel worksheet SULFURDIOXIED.XLS in
MINITAB.
Note Blank rows in the Excel file separate the production runs.

2 How stable are the sulfur dioxide levels across the 4


measuring times?
3 How does the consistency compare among the 6 tanks
used for production?
4 Do the sulfur dioxide levels systematically change over
the 9 different production days?
Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-52
Importing and Manipulating Data
Exercise L Investigating Downtime Distributions

Problem Data set


A Pareto analysis in MINITAB determined that the two major DOWNTIME.MPJ
reasons for downtimes are changeovers and fixture
alignment. Investigate the individual downtime occurrences
for these two reasons. Variable Description
Date Date of downtime occurrence
Data collection
Downtime Amount of time equipment was down (minutes)
Daily downtime occurrences were recorded in a database and ID Database identification key
imported into a MINITAB worksheet.
Line Manufacturing line (A, B)
Instructions Reason Reason for downtime occurrence

1 Because you want to further investigate specific Shift Shift on which downtime occurred
downtime reason codes (changeovers and fixture
alignment), split the downtime worksheet by reason.
2 Are shifts recording changeover time the same way? Can
you explain the unusual spikes in the changeover time
data?
3 Can you explain the apparent bimodal (two-peaked)
distribution in the fixture alignment downtimes?

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-53
Importing and Manipulating Data
Exercise M Production Downtime

Problem Data set


You want to import downtime data from a production PRODUCTION.MDB
database. You are specifically interested in the downtime
associated with manufacturing line A.
Variable Description
Data collection Date Date of downtime occurrence
Downtime data are available in an ODBC database and you Downtime Amount of time equipment was down (minutes)
must import them into MINITAB. ID Database identification key
Line Manufacturing line (A, B)
Instructions
Reason Reason for downtime occurrence
1 Choose File ➤ New ➤ Minitab Worksheet.
Shift Shift on which downtime occurred
2 Choose File ➤ Query Database (ODBC), then choose
the PRODUCTION.MDB database.
3 Click Use rows, then import rows for which the Line
column indicates A.
Tip Use single quotes around the line indicator. The syntax used in this
dialog box is determined by the database being queried.

Importing and Manipulating Data Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5-54
6
Additional Examples and
Exercises

Objectives
• Create individual value plots, scatterplots, time series plots, boxplots, and matrix plots to display data.
• Use MINITAB tools to split worksheets, subset data, recode data, and create new variables.
• Calculate descriptive statistics to examine important features of data.
• Brush plots to identify outlying observations.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-1
Contents

Contents
Examples and Exercises Purpose Page
Example 1 Analyze call volume by subsetting a worksheet, time 6-4
Technical Support series plot, and descriptive statistics.
Example 2 Analyze project completion time using scatterplots. 6-26
Program Development Time
Example 3 Examine the relationship among variables and identify 6-32
Election unusual observations using a matrix plot and brushing.
Example 4 Analyze results from an experiment using the Calculator, 6-44
Contamination Level 3D scatterplot, and descriptive statistics.
Example 5 Analyze the relationship among variables by splitting a 6-52
Equipment Setup Time worksheet, creating graphs, and tallying variables.
Example 6 Analyze mutual fund performance by importing data from 6-74
Investment Performance Excel, recoding data, and creating plots.
Exercise N Analyze adhesive force using graphs and descriptive 6-87
Bond Strength statistics.
Exercise O Analyze the reliability of electronic components using 6-88
Electronic Component Reliability graphs and descriptive statistics.
Exercise P Analyze the accuracy of project development time using 6-89
Project Prediction graphs and descriptive statistics.
Exercise Q Analyze the performance of industrial belts using graphs. 6-90
Belt Wear

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-2
Contents

Examples and Exercises Purpose Page


Exercise R Examine the relationship between healthcare costs and 6-91
Healthcare Expenditures demographic variables using graphs.
Exercise S Examine call volume data using graphs. 6-92
Call Center Staffing

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-3
Additional Examples and Exercises

Additional Examples and Exercises


Example 1 Technical Support

Problem Tools
Managers are preparing a budget for the technical support • Subset Worksheet
department of a software firm. To help determine staffing • Extract from Date/Time
needs, they want to analyze typical tech support incidents • Time Series
(e.g., customer requests for help via phone and e-mail): • Brushing
• Value Order
• Does the number of tech support incidents vary by day of • Boxplot
the week? • Display Descriptive Statistics
• Do seasonal trends or patterns influence the number of
tech support incidents? Data set
TECHSUPPORT.MPJ
Data collection
A database that dates back to 1992 stores records of the
Variable Description
number of tech support incidents per day. The company
began offering technical support via the Internet in 2001 and Date Date of call (11/2/92–9/4/03)
as a result, the data prior to this time are obsolete. You need Incidents Number of tech support incidents
to analyze only the data collected since February 26, 2001.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-4
Additional Examples and Exercises
Subsetting the worksheet
To assess current personnel needs, you only need data on Subset Worksheet
tech support incidents occurring since February 26, 2001.
1 Choose File ➤ Open Project.
Using Subset Worksheet, create a separate worksheet that 2 Open TECHSUPPORT.MPJ.
contains only the recent data.
3 Choose Data ➤ Subset Worksheet.
What's next 4 In Name, enter Feb 26, 2001 onwards.
Create a column for the day of the week. 5 Under Include or Exclude, choose Specify which rows to
include.
6 Under Specify which rows to include, choose Rows that
match. Click Condition.
7 Complete the dialog box as shown below.

8 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-5
Additional Examples and Exercises
Extracting from date/time data
Occasionally, the technical support staff handles customer Extract from Date/Time to Text
requests on weekends. Because management only needs to
1 Choose Data ➤ Extract from Date/Time ➤ To Text.
determine staffing for weekdays, delete the data for incidents
occurring on Saturday and Sunday. 2 Complete the dialog box as shown below.
Using Extract from Date/Time to Text, you can create a new
column that specifies the day of the week associated with
each calendar date.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-6
Additional Examples and Exercises
Interpreting your results
The new Weekday column contains the day of the week
corresponding to each date value.

What's next
Create a new worksheet without the weekend calls.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-7
Additional Examples and Exercises
Subsetting the worksheet
Use the Weekday variable to create a worksheet that does not Subset Worksheet
include Saturday and Sunday calls.
1 Choose Data ➤ Subset Worksheet.
2 In Name, enter Feb 26, 2001 onwards - no weekends.
3 Under Include or Exclude, choose Specify which rows to
exclude.
4 Under Specify which rows to exclude, choose Rows that
match. Click Condition.
5 Complete the dialog box as shown below.

6 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-8
Additional Examples and Exercises
Interpreting your results
The data now contain only the number of incidents that
occurred during weekdays from 2/26/2001 to 9/4/2003.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-9
Additional Examples and Exercises
Creating a time series plot
Because these data are in time order, a time series plot is a Time Series Plot
useful way to observe the data over time. Create a time series
1 Choose Graph ➤ Time Series Plot.
plot of the data to determine whether the number of tech
support incidents varies systematically during this period. 2 Choose Simple. Click OK.
Note The time series plot is most appropriate when data are collected 3 Complete the dialog box as shown below.
from fixed intervals from your process. In this application, note that
the x-axis is not a true time scale, but the sequence in which the
data were collected.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-10
Additional Examples and Exercises
Interpreting your results
The plot indicates that the number of incidents may change
cyclically during the time period. Several values are
unusually low relative to the majority of the data points but
you cannot tell what date these points correspond.

What’s next
Display the date on the x-axis to make it easier to determine
whether the cyclic pattern is related to seasonal effects and
on what days the outlying values occur.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-11
Additional Examples and Exercises
Using a date stamp
The default x-axis label is a simple count index that Edit Scale
corresponds to the worksheet row number. To aid in the
1 Double-click the x-axis on the time series plot.
interpretation of the results, you can display date values on
the x-axis using the Stamp function. 2 Click the Time tab.
3 Complete the dialog box as shown below.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-12
Additional Examples and Exercises
Interpreting your results
The date values now appear on the x-axis. Examining the
plot more closely shows that the center has fewer tech
support incidents at the end of the year.
The days with usually low numbers of incidents make it
difficult to interpret trends in the data. Removing those days
changes the y-axis scaling, making it easier to identify
trends.

What’s next
Identify the points that have unusually numbers of incidents.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-13
Additional Examples and Exercises
Brushing data points
If points correspond to a row in the worksheet, you can use Set ID Variables
the brushing feature. Brushing allows you to highlight points
1 Right-click anywhere in the time series plot.
on a graph to learn more about them.
2 Choose Brush.
Brushing mode turns your cursor into a pointing hand and
displays a brushing palette. The brushing palette, by default, 3 Right-click the plot again. Choose Set ID Variables.
contains the row numbers of the brushed points. The brushed
4 Complete the dialog box as shown below.
points are highlighted in light blue in the graph. Rows in the
worksheet that correspond to brushed points are marked with
a black dot.
Brushing can be especially useful for:
• Showing the characteristics of outliers
• Determining whether points that lie in a brushed region
share the same characteristics
• Subsetting data
Brush the outliers in the graph to determine the dates to
which they correspond.

5 Click OK.
6 Click and drag the cursor to form a rectangle around the
outlying points.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-14
Additional Examples and Exercises
Interpreting your results
The brushing palette appears on the graph, displaying the
row and date of each outlying observation. When you select
an outlying point in the brushing window, MINITAB
highlights that point on the graph.
The results indicate that the observations with unusually low
numbers of incidents occurred on or around major holidays.
To focus on scheduling for typical workdays, not holidays,
remove the outlying observations from the data set.

What’s next
Create a new worksheet without the outlying observations.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-15
Additional Examples and Exercises
Subsetting the worksheet
Use the brushed rows to create a new worksheet that does not Subset Worksheet
contain these observations.
1 Choose Data ➤ Subset Worksheet.

What's next 2 Complete the dialog box as shown below.

Create a time series plot using the new worksheet.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-16
Additional Examples and Exercises
Creating a time series plot
Create a new time series plot without data from holidays, Time Series Plot
which enables you to see the seasonal pattern more clearly.
1 Choose Graph ➤ Time Series Plot.
2 Choose Simple. Click OK.
3 In Series, enter Incidents.
4 Click Time/Scale. Complete the dialog as shown below.

5 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-17
Additional Examples and Exercises
Interpreting your results
The time series plot now contains data from 2/26/2001–9/4/
2003 excluding weekends and unusually low-volume days.
The data appear to have a seasonal trend, with a lower
number of incidents at the very end of the year. In general,
there is an overall increase in tech support incidents at the
start of 2002; the highest volume occurs in the late spring to
early summer of 2002.

What's next
Summarize the data by month to quantify the seasonal
effects.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-18
Additional Examples and Exercises
Extracting from date/time data
If you analyze the entire data set, instead of examining each Extract Date/Time
month or quarter, the seasonal effect may be hidden. By
1 Choose Data ➤ Extract From Date/Time ➤ To Text.
creating a month column, you can produce summary
statistics for each month that enable you to analyze the 2 Complete the dialog box as shown below.
seasonal effect.
Using Extract from Date/Time to Text, you can create a new
column that specifies the month associated with each
calendar date.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-19
Additional Examples and Exercises
Ordering values
By default, MINITAB orders text values alphabetically in Value Order
graphs and Session window output. You can also order these
1 In the worksheet, select the Month column by clicking once
values differently.
in the column heading.
For example, for the Month variable, a more logical order is 2 Right-click and choose Column ➤ Value Order.
Jan-Dec; for the Weekday variable, a more logical order is
Sun–Sat. 3 Complete the dialog box as shown below.

4 Click OK.
5 Repeat steps 1–4 for the Weekday variable, using the order
Sun, Mon, Tue, Wed, Thu, Fri, Sat.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-20
Additional Examples and Exercises
Displaying a graphical summary
Create a boxplot of your data for each weekday and month to Boxplot
assess the mean, range, and shape of the data.
1 Choose Graph ➤ Boxplot.
2 Under One Y, choose With Groups. Click OK.
3 Complete the dialog box as shown below.

4 Click OK.
5 Repeat steps 1–4 for the Weekday variable.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-21
Additional Examples and Exercises
Interpreting your results
MINITAB creates each box and whisker (the lines extending
from the ends of the box) based on the spread of the data.
Assess the within-group variability by looking at the spread
in the interquartile range, whiskers, and outliers. Assess the
variability between months and between weekdays by
comparing the medians and the relative locations of the
interquartile ranges.
The boxplot for incidents by month shows that:
• The number of tech support incidents during January,
July, August, and December tend to be slightly lower than
the number of incidents in other months.
• February contains a point beyond the whisker, represented
by an asterisk. This point might be an unusual
observation, or outlier.
The boxplot for incidents by weekday shows that:
• The median number of requests for most days was about
135.
• Friday tends to have a slightly lower central tendency than
the other days.
Note You can also display statistics, such as the median and the first and
third quartiles by hovering the cursor over the boxes.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-22
Additional Examples and Exercises
Displaying descriptive statistics
Display a numerical summary of the data, including the first Display Descriptive Statistics
and third quartiles, the median, and the minimum and
1 Choose Stat ➤ Basic Statistics ➤ Display Descriptive
maximum values for calls by month and by weekday.
Statistics.
2 In Graph Variables, enter Incidents; in By Variables, enter
Month.
3 Click Statistics.
4 Complete the dialog box as shown below.

5 Click OK in each dialog box.


6 Repeat steps 1-5 for Weekday. In By Variables, enter
Weekday.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-23
Additional Examples and Exercises
Interpreting your results
The descriptive statistics provide an indication of the number Descriptive Statistics: Incidents
of incidents that can be expected for different months of the
Variable Month Minimum Q1 Median Q3 Maximum
year and days of the week. This information can help Incidents Jan 95.00 119.00 130.00 142.25 163.00
managers plan for staffing needs in the future. Feb 103.00 133.00 146.00 152.00 168.00
Mar 107.00 122.00 135.00 148.00 178.00
Apr 94.00 124.00 135.00 146.00 173.00
May 102.00 127.00 135.00 146.50 166.00
Jun 103.00 125.00 136.50 157.50 192.00
Jul 98.00 115.00 131.00 142.00 176.00
Aug 97.00 115.75 129.00 139.25 163.00
Sep 117.00 132.50 141.50 153.50 174.00
Oct 117.00 129.00 137.00 142.25 160.00
Nov 109.00 121.00 134.00 148.00 169.00
Dec 87.00 102.50 115.00 128.00 161.00

Descriptive Statistics: Incidents


Variable Weekday Minimum Q1 Median Q3 Maximum
Incidents Mon 87.00 122.75 132.00 148.00 178.00
Tue 97.00 123.75 135.50 148.00 186.00
Wed 96.00 124.00 135.00 148.00 181.00
Thu 94.00 126.00 135.00 147.50 192.00
Fri 95.00 116.25 128.00 138.75 173.00

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-24
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


By creating new columns for day and month and using Trends or differences observed in sample statistics can be the
Subset Worksheet, you created a data set containing only the result of sample-to-sample variation or the result of a real
data needed for the analysis. Using time series plots and effect. You can use more advanced techniques, such as
descriptive statistics, you found that: ANOVA and time series analysis, to determine the presence
• January, July, August, and December tend to have fewer of significant effects. Subsequent training courses provide
tech support incidents than other months during the year, instruction for these advanced statistical tools.
indicating a potential seasonal trend in the data.
• Friday tends to have fewer incidents than the rest of the
weekdays.
• Overall, the number of tech support incidents is greater in
2002 and 2003 than in 2001.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-25
Additional Examples and Exercises

Additional Examples and Exercises


Example 2 Program Development Time

Problem Data
A development company is studying work performance on PROGRAMDEVELOP.MPJ
48 software projects. In particular, they want to know how
the size of the program is related to the development time.
Variable Description
In addition, managers have explored using one, two, or three
mid-development quality assurance checks on these projects. Size Total lines of code required
They want to examine how the number of quality checks Time Total number of programmer hours used for
influences total development time. development
QA Check Number of middevelopment QA checks (1-3)
Data collection
For the 48 projects, project managers record the size of the
program (measured by total number of lines of code),
development time (measured in total number of programmer
hours), and number of mid-development quality checks.

Tools
• Scatterplot

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-26
Additional Examples and Exercises
Examining the relationship between two quantitative variables
Use a scatterplot to examine the relationship between two Scatterplot
quantitative variables. Often, one of the variables is the
1 Open PROGRAMDEVELOP.MPJ.
predictor variable (X) and the other variable is the response
(Y). 2 Choose Graph ➤ ScatterPlot.
Because the company wants to determine whether the size of 3 Choose Simple. Click OK.
a software program influences the development time, in this
4 Complete the dialog box as shown below.
example, size is the explanatory variable and time is the
response.
Traditionally, the response variable is plotted on the y-axis
and the explanatory variable is plotted on the x-axis.

5 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-27
Additional Examples and Exercises
Interpreting your results
Examine the scatterplot to see whether a relationship
between the size of the program and the development time
exists.
• A positive relationship exists between these two variables.
As the size of the program increases, the development
time increases.
• You should investigate several outlying data values.

What's next
Categorize data values on the scatterplot by the QA check
variable.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-28
Additional Examples and Exercises
Examining the relationship between multiple variables
Adding a grouping variable to a scatterplot allows you to Editing symbols
examine the relationship between the two quantitative
1 Select and double-click all symbols on the graph.
variables for each level of the categorical variable and to
view the effect the categorical variable has on the response. 2 Click the Groups tab.
Create a scatterplot using QA check as a grouping variable to 3 Complete the dialog box as shown below.
examine the effect it has on development time.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-29
Additional Examples and Exercises
Interpreting your results
The plot indicates that projects of similar size with two or
three QA checks have lower development time than projects
with only one QA check.
Examine the outlying observations to determine if these
projects were unusual.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-30
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


The scatterplots indicated that: To explore this relationship further, use regression analysis to
• Overall, a positive linear relationship exists between the create an equation that predicts the development time based
development time and program size. on the program size and the number of QA checks.
Regression, an analytical tool included in MINITAB, is
• Projects with two or three QA checks have lower covered in the Basic Statistics course.
development time.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-31
Additional Examples and Exercises

Additional Examples and Exercises


Example 3 Election

Problem Data collection


The presidential election of 2000 was one of the most An Excel worksheet contains the county and the number of
fascinating elections in United States history, not just votes for Gore, Bush, Nader, and Buchanan, before any
because of the narrow margin of votes separating George W. recounts.
Bush and Al Gore, but because of the controversial butterfly
ballot used in Palm Beach County, Florida. Apparently, a Tools
number of people complained that the format of the ballot
was confusing and that they inadvertently voted for Pat • Importing Excel file
Buchanan when they intended to vote for Al Gore. An • Matrix Plot
examination of the ballot (see following page) shows this • Brushing
complaint to be plausible.
Data set
Errors in voting can occur in various ways, such as
overvoting (punching more than one hole), undervoting ELECTION.XLS
(incompletely punching a single hole), and misvoting (voting
for a different candidate than intended). Through careful
examination, election monitors can obtain the number of Variable Description
voting errors of the first two types. However, they cannot County Name Florida county
know the number of misvoting errors or how many misvotes
should have been votes for Bush and how many for Gore. Gore Number of Gore votes

Use MINITAB’s graphical tools to determine whether the Bush Number of Bush votes
results in Palm Beach County were in fact unusual and Nader Number of Nader votes
possibly a result of the butterfly ballot.
Buchanan Number of Buchanan votes

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-32
Additional Examples and Exercises
Importing data from Excel
Open the Excel worksheet that contains the election data to Open Excel File
check the data structure. If the data are appropriately
1 Open Excel.
structured, the file can be open directly in MINITAB.
2 Choose File ➤ Open.
Note If the data are not well structured or you do not know what format
the data are in, MINITAB provides several options to convert the file
3 Choose ELECTION.XLS.
into a usable format.
4 Click Open.
The now-famous butterfly ballot

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-33
Additional Examples and Exercises
Interpreting your results
The structure of the election data is ideal for importing
directly into MINITAB.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-34
Additional Examples and Exercises
Importing data from Excel
Because the data are well structured, you can open the Open Excel File
spreadsheet in MINITAB with no modifications. MINITAB
1 In MINITAB, choose File ➤ Open Worksheet.
imports the data into a worksheet, putting the column
headings and data into the appropriate cells. 2 In Files of type, choose Excel (*.xls).
3 Choose ELECTION.XLS.
4 Click Open.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-35
Additional Examples and Exercises
Interpreting your results
Examine the MINITAB worksheet to ensure that the data are
in the proper format.
Notice that the data appear in MINITAB exactly as they did in
Excel; the variable names are in the appropriate cells above
the data. MINITAB also correctly formatted C1 as text data
and C2 through C5 as numeric.

What's next
Create a plot to examine the relationships among the number
of votes for each candidate.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-36
Additional Examples and Exercises
Examining relationships among multiple quantitative variables
Use matrix plots to examine the relationships among Matrix Plot
multiple quantitative variables. A matrix plot is a set of
1 Choose Graph ➤ Matrix Plot.
individual scatterplots displayed together. Such a display
provides an efficient way to see the meaningful relationships 2 Under Matrix of plots, choose Simple. Click OK.
among many pairs of variables.
3 Complete the dialog box as shown below.
Create a matrix plot to display scatterplots containing each
pair of candidates on one page to examine the relationships
of vote counts between candidates.

4 Click Matrix Options. Under Matrix Display, choose Lower


left.
5 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-37
Additional Examples and Exercises
Interpreting your results
The plots show that a positive linear relationship exists
between candidate vote counts. This relationship is expected
because of the size of the counties: a county with more voters
tends to have more votes for each candidate than a county
with fewer voters.
Points that tend to fall away from the overall structure or
trend in the data may be outliers. Several points on the plot
seem to be outliers.

What's next
Examine unusual observations by brushing the graph.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-38
Additional Examples and Exercises
Brushing
Graphs allow you to investigate potential relationships Brush
between the points within a data set. However, after you
1 Right-click in the matrix plot. Choose Brush.
make a graph, you might need to learn more about a specific
point or group of points. 2 Click the outlier on the right side of the plot in the upper
corner at the intersection of the Buchanan column and the
Brushing allows you to highlight points on a graph to learn Gore row.
more about them. You can brush points that correspond to a
row in the worksheet.
Row numbers for each brushed point appear in the brushing
palette. A black dot indicates rows in the worksheet that
correspond to a brushed point.
Brushing mode turns your cursor into a pointing hand and
displays a brushing palette. The brushing palette, by default,
contains the row numbers of the brushed points. The brushed
points are highlighted in light blue in the graph.
Brushing can be especially useful for:
• Showing the characteristics of outliers
• Determining whether points that lie in a brushed region
share the same characteristics
• Subsetting data

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-39
Additional Examples and Exercises
Interpreting your results
The matrix plot now includes additional information from the
brushing feature. The outlier from Buchanan is highlighted
and corresponds to row 50. Notice that the point is
highlighted in every scatterplot in the matrix plot and is an
outlier in every plot.

What's next
Determine what county is associated with the brushed point.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-40
Additional Examples and Exercises
Identifying brushed points
Look at the county for each brushed point by choosing Set ID Set ID Variables
Variables.
1 In brushing mode, right-click the graph.
2 Choose Set ID Variables.
3 Complete the dialog box as shown below.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-41
Additional Examples and Exercises
Interpreting your results
The brushed point on the plot corresponds to Palm Beach
County. Of the 65 counties that had votes for Buchanan, 63
(97%) had 845 or fewer votes for him. Palm Beach County,
however, had 3407 votes for Buchanan, which seems
unusually high. On the ballot, Bush’s name appears right
above Buchanan’s name and Gore’s right below it. Many of
the Buchanan votes (maybe up to 2000 votes) possibly were
meant for either Bush or Gore. Of course, these votes became
the subject of great debate.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-42
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


The matrix plot is a powerful tool that enables you to quickly If you create several graphs in separate windows, points that
examine associations among multiple variables in one graph. you brush in one graph are highlighted in the other graphs if
Using the plot, you identified one point on the plot that you have turned on the brushing feature for each graph. A
looked very different from the others. Brushing enabled you general principle in data analysis is to examine the overall
to identify that this point corresponded to Palm Beach structure in the data to discover any deviations from that
County. structure. At times, the unusual or unexpected results can
provide more information about the process under study than
Brushing is particularly interesting in a matrix plot, because the general features in the data.
you can see the data point brushed on one graph highlighted
in all of the others.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-43
Additional Examples and Exercises

Additional Examples and Exercises


Example 4 Contamination Level

Problem Tools
A chemical engineer believes that a second processing stage • Calculator
can reduce contaminants in the company’s final product by • 3D Scatterplot
over 25%—possibly as much as 40%. The second stage has • Descriptive Statistics (Tables)
two critical process variables (temperature and time) that the
engineer believes influence its efficacy. The engineer designs Data set
an experiment to discover whether the second processing
stage reduces the contamination level and to determine the DECONTAMINATE.MPJ
best settings for time and temperature.
Variable Description
Data collection
PPM Pre Contamination level (in ppm) before secondary
The engineer randomly selects several finished batches and processing
sends them through the secondary processing stage. To PPM Post Contamination level (in ppm) after secondary
determine the best process settings for time and temperature, processing
the engineer conducts the experiment at five different
combinations of temperature and time. To ensure the validity Temp Secondary processing temperature (Celsius)
of the results, the engineer repeats the experiment on three Time Secondary processing time (min)
different processors (A, B, and C).
Processor Processor used for experimental run (A, B, C)
Before and after secondary processing, the engineer
Order Experimental run order
measures the contamination level (in parts per million) of
each batch.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-44
Additional Examples and Exercises
Creating a new variable
Not all of the experiment batches have the same level of pre- Calculator
treatment contamination. Analyzing only the post-treatment
1 Choose File ➤ Open Project.
contamination levels provides misleading results. Instead,
analyze the difference in pre- and post-treatment levels. To 1 Open DECONTAMINATE.MPJ.
do this, create a new variable that represents the reduction in
contamination as a percent of the pre-treatment level. 2 Choose Calc ➤ Calculator.
3 Complete the dialog box as shown below.
Use MINITAB’s Calculator to create the variable Reduction %
using the following equation:
Reduction % = 100*(PPM Pre − PPM Post) / PPM Pre

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-45
Additional Examples and Exercises
Interpreting your results
The new variable displays the percent reduction in the
amount of contaminants in the chemical batches after the
second processing stage.

What’s next
Create a 3D scatterplot to graphically summarize the data.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-46
Additional Examples and Exercises
Creating a 3D scatterplot
To display a graphical representation of the chemical yield 3D Scatterplot
data, create a three-dimensional (3D) scatterplot.
1 Choose Graph ➤ 3D Scatterplot.
A 3D scatterplot is a useful way of summarizing data when 2 Choose With Groups. Click OK.
you want to examine how a response variable, such as
reduction in contamination level, is influenced by two 3 Complete the dialog box as shown below.
quantitative factors, such as temperature and time.
You can also include categorical variables in a 3D scatterplot.
MINITAB incorporates categorical variables by using different
symbols or colors for the different levels of the variable. In
this example, include the three processors (A,B, and C) on
the plot so you can compare the results across the different
processors.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-47
Additional Examples and Exercises
Interpreting your results
The 3D scatterplot indicates that:
• The second processing stage reduces the contaminant
levels by 24% to 30%, supporting the engineer’s
hypothesis.
• The highest reduction in contaminants occurs at a time of
25 minutes and a temperature of 74 degrees Celsius.
• The percent reduction values are relatively similar for the
other 4 process settings and across the 3 different
processors.
• A 40% reduction in contamination was not achieved in
this experiment.
Use MINITAB’s 3D Graph Tools toolbar to rotate the 3D
scatterplot so you can view the plot from different
perspectives, enhancing your ability to interpret it.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-48
Additional Examples and Exercises
Displaying descriptive statistics
Use descriptive statistics to summarize important features of Descriptive Statistics
the data. The sample mean provides the average of the
1 Choose Stat ➤ Tables ➤ Descriptive Statistics.
percent reduction in contamination for each combination of
the process variable settings. The standard deviation provides 2 In For Rows, enter Temp; in For Columns, enter Time.
a measure of the variability in percent reduction for each
combination of the process variable settings. 3 Click Associated Variables.
4 Complete the dialog box as shown below.

5 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-49
Additional Examples and Exercises
Interpreting your results
MINITAB displays the mean and standard deviation for each Tabulated statistics: Temp, Time
combination of the process variable settings in a table. The Rows: Temp Columns: Time
results indicate that:
20 25 30 All
• For all factor settings, the second processing stage
reduced the contamination by 25% or more, on average, 72 26.10 * 25.25 25.67
1.122 * 0.801 1.030
supporting the engineer’s hypothesis. 6 0 6 12
• The highest average reduction (28.63%) occurs at a 74 * 28.63 * 28.63
temperature of 74 and a time of 25, indicating that these * 0.591 * 0.591
may be the best settings for temperature and time. These 0 12 0 12
settings also have the lowest standard deviations. 76 25.09 * 25.70 25.39
0.892 * 1.631 1.294
6 0 6 12

All 25.59 28.63 25.47 26.56


1.101 0.591 1.248 1.781
12 12 12 36

Cell Contents: Reduction % : Mean


Reduction % : Standard deviation
Count

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-50
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


An experiment was designed to study the effect that a You can apply more advanced statistical procedures, such as
secondary processing stage has on reducing contaminants in regression and analysis of variance, to this problem. You can
a final product. The engineer speculated that a 25% to 40% use a residual plot, which is an important component of these
reduction in contaminants could be achieved. Using a 3- types of statistical procedures, to examine the experimental
dimensional graph and tabulated statistics, the evidence data to discover if any unknown factors influence the results.
indicates that a 25% reduction was achieved, but that the Subsequent training courses provide instruction for these
40% reduction requires further investigation and advanced statistical tools.
experimentation.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-51
Additional Examples and Exercises

Additional Examples and Exercises


Example 5 Equipment Setup Time

Problem Tools
In a short-run manufacturing environment, employees need • Importing Excel file
to customize production equipment for each specific job. • Scatterplot
Production managers conjecture that the required setup time • Paneling graphs
might be related to the production time of the job. They think • Individual value plot
• Brushing
that jobs with longer run times may also have longer setup
• Tally Individual Variables
times.
• Split Worksheet
Jobs can be classified as one of seven general types. The
managers want to examine both setup time and run time as Data set
they relate to job type.
SETUPTIME.XLS
Data collection
A production database exists in an Excel file that contains the Variable Description
setup time, run time, and job type for 197 jobs from Date Date job was run
2/2/2003 to 7/24/2003. The database also contains comments
Type Job type (A, B, C, D, E, F, G)
relating to the specific jobs.
Setup Time Setup time for job (rounded to the nearest 10
minutes)
Run Time Production time for job (in hours)
Notes Special comments related to either the set up
or running of job

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-52
Additional Examples and Exercises
Importing data from Excel
Open the Excel worksheet that contains the database to check Open Excel File
the data structure. If the data are appropriately structured, the
1 Open Excel.
file can be open directly in MINITAB.
2 Choose File ➤ Open.
Note If the data are not well structured, or you do not know what format
the data are in, MINITAB provides several options to convert the file
3 Choose SETUPTIME.XLS.
into a known format.
4 Click Open.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-53
Additional Examples and Exercises
Importing data from Excel
Note the structure of the data in Excel: Data Structure in an Excel File
• The variable names are in row 1.
• The data begin in row 2 and are contained in columns.
• The worksheet has no blank rows or columns.
This structure is ideal for importing directly into MINITAB.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-54
Additional Examples and Exercises
Importing data from Excel
Because the data are well structured, you can open the Open Excel File
spreadsheet in MINITAB with no modifications. MINITAB
1 In MINITAB, choose File ➤ Open Worksheet.
imports the data into a worksheet, putting the column
headings and data into the appropriate cells. 2 In Files of type, choose Excel (*.xls).
3 Choose SETUPTIME.XLS.

4 Click Open.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-55
Additional Examples and Exercises
Interpreting your results
Examine the MINITAB worksheet to ensure that the data are
in the proper format.
Notice that the data appear in MINITAB exactly as they did in
Excel; the variable names are in the appropriate cells above
the data. MINITAB also correctly formatted C1 as date/time
data, C2 and C5 as text, and C3 and C4 as numeric.

What's next
Create a scatterplot to examine the relationship between
setup time and run time.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-56
Additional Examples and Exercises
Examining the relationship between two quantitative variables
Use a scatterplot to examine the relationship between two Scatterplot
quantitative variables. Often, one of the variables is the
1 Choose Graph ➤ Scatterplot.
predictor variable (X) and the other variable is the response
(Y). 2 Choose Simple. Click OK.
Because management wants to determine whether the run 3 Complete the dialog box as shown below.
time influences the setup time, in this example, run time is
the explanatory variable and setup time is the response.
Traditionally, the response variable is plotted on the y-axis
and the explanatory variable on the x-axis.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-57
Additional Examples and Exercises
Interpreting your results
Examine the scatterplot to see whether a relationship exists
between the setup time and the run time required for the jobs.
• No apparent trend appears, indicating that there may not
be a relationship between these two variables.
• The data appear to be mostly contained within a central
cluster, with a handful of outlying data values.
Note A good approach when analyzing data is to look for general patterns
and structure in the data, then identify deviations from that structure.
Often the unusual observations in a data set reveal valuable
information about a process.

What's next
Categorize data values on the scatterplot by the seven job
types.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-58
Additional Examples and Exercises
Examining the relationship between multiple variables
Adding a grouping variable to a scatterplot allows you to Editing symbols
examine the relationship between the two quantitative
1 Select and double-click all symbols on the graph.
variables for each level of the categorical variable and to
view the effect the categorical variable has on the response. 2 Click the Groups tab.
Create a scatterplot using job type as a grouping variable to 3 Complete the dialog box as shown below.
examine its relationship with setup time and run time.
Note The grouping variable should be categorical in nature and should
not have too many levels, otherwise the plot may be cumbersome to
examine.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-59
Additional Examples and Exercises
Interpreting your results
The plot show that:
• No trend appears in the data.
• You cannot clearly determine whether setup times are
significantly different for the seven job types.
• Evidence suggests that run times are different for the
seven different job types, although this is difficult to
determine.
• The data values that stand out from the main cluster do
not appear to be from a specific job type.

What's next
Because the points overlap, examine these data separately for
each of the seven job types.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-60
Additional Examples and Exercises
Paneling a scatterplot by a grouping variable
Because the data points on the scatterplot overlap, you Paneling
cannot easily determine how job type relates to setup time
1 Right-click anywhere in the scatterplot.
and run time. Use MINITAB’s paneling feature to display a
separate scatterplot for each job type on one page. This 2 Choose Panel.
display makes it easy to compare the job types and their
relationships with setup time and run time. 3 In By variables with groups in separate panels, choose
Type. Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-61
Additional Examples and Exercises
Interpreting your results
Examine the setup and run times across the seven job types.
The plot indicates that:
• Jobs B and G appear to have the longest run times.
• Setup times across the job types do not appear to vary
significantly.
• No relationship between run time and setup time appears
to exist for any job type.

What’s next
Compare setup times for each job type.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-62
Additional Examples and Exercises
Examining the relationship between a quantitative and a categorical variable
An individual value plot is one way to examine the Individual Value Plot
relationship between a quantitative variable and a categorical
1 Choose Graph ➤ Individual Value Plot.
variable. Create an individual value plot to examine setup
times by job type. 2 Under One Y, choose With Groups. Click OK.
3 Complete the dialog box as shown below.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-63
Additional Examples and Exercises
Interpreting your results
Examine the individual value plot to compare the setup times
for the seven different job types.
Note Notice that MINITAB spreads (or jitters) the points along the x-axis.
Jittering prevents the points from overlapping. The direction and
amount of jitter is determined randomly each time the graph is
made. Because the x-variable is categorical, you can still determine
in what group each point belongs.

• Setup times across the job types do not appear to vary


significantly.
• Most of the values for job type G are lower than the other
job types, which may suggest this job type has a shorter
setup time. Notice, however, that job G has noticeably
less data than the other groups. You need more data to
confirm this conclusion.
• Many unusually large data values stand out and should be
investigated.
Note Although examining graphs is an important first step in any analysis,
more formal statistical approaches in MINITAB can lead to more
objective comparisons.

What’s next
Examine the unusual data values.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-64
Additional Examples and Exercises
Brushing data points
You can brush points on a graph to learn more about them. Set ID Variables
However, you can brush points only if they correspond to a
1 Right-click anywhere in the individual value plot.
single row in the worksheet.
2 Choose Brush.
The brushing palette identifies the row number of the
brushed point, in addition to the contents of any column you 3 Right-click the plot again. Choose Set ID Variables.
specify as an ID variable. Brush the outliers in the individual
4 Complete the dialog box as shown below.
values plot to look for notes about outlying jobs that may
explain the lengthy setup time.
Note Keeping notes is an important part of good data collection. Notes
often assist in interpreting results during the data analysis stage.

5 Click OK.
6 Click and drag the cursor to form a rectangle around the
outlying points.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-65
Additional Examples and Exercises
Examining unusual data values
The notes for all of the brushed points indicate a problem
during the job that delayed the setup, such as a power failure
or an incorrect job ticket. These notes provide important
information that help aid in the interpretation of the plot and
help the quality improvement effort.

What’s next
Create an individual value plot to examine runtime by job
type.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-66
Additional Examples and Exercises
Examining the relationship between a quantitative and a categorical variable
You can examine the relationship between a quantitative Individual Value Plot
variable and a categorical variable using an individual value
1 Choose Graph ➤ Individual Value Plot.
plot. Create an individual plot to examine run times by job
type. 2 Under One Y, choose With Groups. Click OK.
3 Complete the dialog box as shown below.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-67
Additional Examples and Exercises
Examining the relationship between a quantitative and a categorical variable
The plot indicates that:
• Although some similarities appear in run times, so do
some systematic differences. These indications might be
an inherent feature of the different job types. For example,
B and G type jobs might be more complicated and,
therefore, require longer run times. Or perhaps some job
types are for large-volume products that run longer to
meet customer demand.
• The few unusual values are worth investigating.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-68
Additional Examples and Exercises
Examining frequency of occurrences
By brushing the individual value plots, you learned that the Tally Individual Variables
outlying data points corresponded to jobs in which a problem
1 Choose Stat ➤ Table ➤ Tally Individual Variables.
or unusual situation occurred. Using Tally Individual
Variables, you can determine how many times each problem 2 Complete the dialog box as shown below.
in the Note column occurred.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-69
Additional Examples and Exercises
Interpreting your results
Examine the table to see the frequency of each problem. Tally for Discrete Variables: Notes
Tip You can also display the data in this table with a Pareto chart.
Notes Count
Bearings needed cleaning 2
• Of the 197 jobs, 68 had a problem that may have affected C6 bracket broken during setup 4
the setup or run time of a job. These jobs are worth Control problems 9
investigating, especially if you want to reduce setup time Incorrect job ticket 11
Power failure on main station 5
or run time. Roller replacement required during setup 4
Shift change during setup 21
• A frequent problem is shift change during setup (21). This Team was 1 worker short 10
problem may be unavoidable, but you might want to Team was 2 workers short 2
investigate what effect shift change has on the setup and N= 68
*= 126
run times.
• The second most frequently occurring problem is
incorrect information on job ticket (11). If incorrect
information adversely affects the time required to either
setup or produce a job, you should try to discover what is
causing these mistakes.
Note In the Notes column, the total of the number of comments (68) plus
the number of missing values (126) is 194, not 197, the total number
of jobs. This is because the values in the last 3 rows of the Notes
column are missing. In text columns, MINITAB has no way of knowing
that these values are missing and not just empty cells. To resolve
this issue in any text column, type a blank space in the last cell of
the column.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-70
Additional Examples and Exercises
Examining subsets of data
Of the 197 jobs included in these data, 68 had problems, Split Worksheet
which are described in the Note column. You can summarize
1 In the last row of the Notes column, press the z once.
and examine the data separately for each problem; one way is
by splitting the data into separate worksheets, one for each 2 Choose Data ➤ Split Worksheet.
type of problem.
3 Complete the dialog box as shown below.

4 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-71
Additional Examples and Exercises
Interpreting your results
You now have a separate worksheet for each type of Original Worksheet containing
comment made in the Notes column, which allows you to worksheet jobs with no comments
explore each problem individually.
For example, you can analyze all jobs that did not have a
particular problem or all jobs that had a shift change during
setup.

Worksheet containing all jobs that


had a shift change during setup

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-72
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


• Overall, no relationship appears between the setup time Because the jobs are in time order, you should investigate
and run time for these jobs. these data over time using MINITAB's time plot capabilities.
• The setup times appear to be reasonably similar for the Trends or other patterns can reveal other features in this
seven different jobs types. process not shown in the scatterplot or individual value plots.
• The run times for the seven job types have some
systematic differences.
• The presence of unusual events resulted in a significant
increase in the setup time or run time of several jobs.
• Shift change during setup and incorrect information
recorded on job tickets were the most frequently
occurring problems.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-73
Additional Examples and Exercises

Additional Examples and Exercises


Example 6 Investment Performance

Problem Data set


Managers at a financial institution want to summarize the INVESTMENT.XLS
performance of 30 different mutual funds in which they
currently invest. They want to compare the 1- and 5-year
returns of: Variable Description
• Different types of funds Name Name of the mutual fund
• Funds with different risk levels RiskLevel Level of risk for the fund (Low, Below Average,
• Small, medium, and large funds Average, Above Average, High)

• Growth versus value funds Type Fund category (LGrowth, LValue, MGrowth,
MValue, SGrowth, SValue)
The managers also want to compare the 1- and 5-year returns 1-yr 1-year percent return for fund
across all funds.
5-yr 5-year percent return for fund
Data collection
The managers gathered the 1-year and 5-year returns, risk
level, and fund category for each mutual fund. The data are in
a Microsoft Excel file.

Tools
• Importing from Excel
• Descriptive Statistics (Tables)
• Code Data
• Value Order
• Individual Value Plot

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-74
Additional Examples and Exercises
Importing data
Before you import the data into MINITAB, examine its current Importing Data from Excel
format in Excel.
1 Open Excel.
2 Open INVESTMENT.XLS.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-75
Additional Examples and Exercises
Importing data from Excel
Because the data are well structured, you can open the Open Excel File
worksheet in MINITAB with no modifications.
1 In MINITAB, choose File ➤ Open Worksheet.
• The variable names are in row 1.
2 In Files of type, choose Excel (*.xls).
• The data begin in row 2 and are contained in columns.
3 Choose INVESTMENT.XLS, then click Open.
• The worksheet has no blank rows or columns.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-76
Additional Examples and Exercises
Displaying descriptive statistics
Use descriptive statistics to summarize important features of Descriptive Statistics
the data. In particular, calculate the median, minimum, and
1 Choose Stat ➤ Tables ➤ Descriptive Statistics.
maximum of 1- and 5-year returns by fund type and by risk
level. 2 In For Rows, enter Type.
3 Click Associated Variables.
4 Complete the dialog box as shown below.

5 Click OK in each dialog box.


6 Repeat steps 1–5 for RiskLevel. In For Rows, enter
RiskLevel.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-77
Additional Examples and Exercises
Interpreting your results
The tables display the minimum, median, and maximum 1- Tabulated statistics: Type
and 5-year returns for the 6 fund categories and 5 risk-levels.
Rows: Type
The results indicate that:
• Overall, growth funds had high 1-year returns. The lowest 1-yr 5-yr
Median Median
1-yr
Minimum
5-yr
Minimum
1-yr
Maximum
5-yr
Maximum Count
performing growth fund is in the LGrowth family,
yielding 28.76%. The highest performing growth fund is LGrowth 50.98 39.37 28.76 33.65 103.0 42.11 5
LValue 18.59 28.22 17.96 27.47 28.0 38.99 5
in the SGrowth family, yielding 151.9%. In fact, this fund MGrowth 76.86 40.78 41.77 32.94 108.8 44.16 5
is the best overall performer for the past year. The median MValue 20.97 23.38 13.64 16.95 39.5 28.91 5
1-year returns for the small, medium, and large growth SGrowth 94.59 36.70 84.59 31.45 151.9 38.56 5
SValue 6.18 16.45 1.69 15.97 25.0 20.42 5
funds are all over 50%. All 36.43 32.31 1.69 15.97 151.9 44.16 30
• Value funds also performed well in this 1-year period,
Tabulated statistics: RiskLevel
though not as well as the growth funds. The lowest
performing value fund is in the SValue family, yielding Rows: RiskLevel
1.69%. The highest performing value fund is in the 1-yr 5-yr 1-yr 5-yr 1-yr 5-yr
MValue family, yielding 39.5%. The median 1-year Median Median Minimum Minimum Maximum Maximum Count
returns for the small, medium, and large value funds are
Above Average 84.59 38.56 25.010 16.45 151.85 44.16 11
considerably lower than growth funds. Average 20.70 29.23 5.080 16.23 50.98 39.12 6
• In general, the above-average and high-risk funds have Below Average
High
18.59
85.04
27.47
37.99
13.640
76.860
16.95
31.68
39.47
94.59
33.65
40.78
9
3
the highest returns. However, over the 5-year period, the Low 1.69 15.97 1.690 15.97 1.69 15.97 1
difference in the returns between the higher risk funds and All 36.43 32.31 1.690 15.97 151.85 44.16 30

the average and below-average risk funds is not as great.

What's next
Generate a plot showing each fund’s performance by size
(Small, Medium, Large) and Category (Growth, Value).

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-78
Additional Examples and Exercises
Examining individual values by type
Often, it is easier to examine patterns in data with a graph. Individual Value Plot
You can compare the 1- and 5-year returns on investment for
1 Choose Graph ➤ Individual Value Plot.
the mutual funds using an individual values plot, which
shows the actual performance of each fund in the portfolio. 2 Under Multiple Y's, check With Groups. Click OK.
Examine the 1- and 5-year returns by type of fund.
3 Complete the dialog box as shown below.

4 Click Data View.


5 Under Data Display, check Median Symbol.
6 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-79
Additional Examples and Exercises
Interpreting your results
The individual value plot shows the individual data points
organized by type of fund. The blue symbol with an x
represents the median value for a group. The plot indicates
that:
• The 1-year returns for growth funds are much greater than
the 1-year returns for value funds, although the difference
is much smaller for 5-year returns.
• In general, more variation exists in 1-year returns than in
5-year returns.
Note Use MINITAB’s brushing command to identify individual funds on the
plot.

What’s next
Plot the return by fund size and risk level after you recode the
data in the risk and type columns.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-80
Additional Examples and Exercises
Coding data
To simplify the interpretation of the fund returns by risk Code Text to Text
level, recode the risk levels into fewer categories.
1 Choose Data ➤ Code ➤ Text to Text.
Reducing risk categories 2 Complete the dialog box as shown below.
The current risk level variable has five categories: Low,
Below Average, Average, Above Average, and High. Recode
them into three levels (Low, Average, High) by coding Below
Average as Low and Above Average as High. Do not change
the funds already categorized as Low and High.
Store the results in a new column called Risk.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-81
Additional Examples and Exercises
Coding data
To simplify the interpretation of fund return by fund size, Code Text to Text
recode the fund categories into small, medium, and large.
1 Choose Data ➤ Code ➤ Text to Text.
Recoding the fund sizes 2 Complete the dialog box as shown below.
From the variable Type, create a new column that indicates
the fund size (Small, Medium, Large); that is, group SValue
and SGrowth into Small, MValue and MGrowth into
Medium, and LValue and LGrowth into Large. Store the new
sizes in a variable called Size.

3 Click OK.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-82
Additional Examples and Exercises
Ordering values
By default, MINITAB orders text values alphabetically in Value Order
graphs and Session window output. However, you can order
1 Choose the Risk column in the worksheet.
these values differently.
2 Right-click in the Risk column, then choose Column ➤
For example, a more logical order for the Risk variable is Value Order.
Low, Average, and High instead of Average, High, and Low.
For the Size variable, an order of Small, Medium, and Large 3 Complete the dialog box as shown below.
is more logical than Large, Medium, and Small.

What's next
Use the individual plot to investigate the new coded
variables.

4 Click OK.
5 Repeat steps 1–4 for Size, using Small, Medium, and Large
as the order.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-83
Additional Examples and Exercises
Examining individual values by size and risk
You can compare the 1- and 5-year returns on investment for Individual Value Plot
the mutual funds using an individual values plot, which
1 Choose Graph ➤ Individual Value Plot.
shows the actual performance of each fund in the portfolio.
Examine the 1- and 5-year returns by size and risk. 2 Under Multiple Y's, choose With Groups. Click OK.
3 Complete the dialog box as shown below.

4 Click Data View.


5 Under Data Display, check Median Symbol.
6 Click OK in each dialog box.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-84
Additional Examples and Exercises
Interpreting your results
The plot shows that:
• Within each fund size, the median return tends to increase
with risk level.
• The portfolio contains no medium size, average risk
funds.
• More variability exists in 1-year returns than in 5-year
returns.
Note Use MINITAB’s brushing command to identify individual funds on the
plot.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-85
Additional Examples and Exercises
Final considerations

Summary and practical conclusions Additional considerations


• The high-risk funds in this portfolio generally performed You can incorporate other variables into your analysis to
better than the low- and average-risk funds. provide additional insight into fund performance. For
• The growth funds in this portfolio generally performed example, you can analyze and compare fund returns relative
better the value funds during the 5-year period. to fund sector or fund expenses.
• The 5-year returns across funds are much more consistent As in all statistical analyses, exercise caution when using
than the 1-year returns. data from the past to predict future performance. Many
factors, which can be difficult to measure, influence results
and can change over time.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-86
Additional Examples and Exercises
Exercise N Bond Strength

Problem 4 Decide which formulations meet the minimum bond


strength requirement of 50 kg.
An engineer is planning an experiment to study the force
required to separate two components held together by an 5 Which formulations would you recommend based on
adhesive bonding agent. The objective of the experiment is to your analysis?
find the strongest adhesive formulation. The application
dictates a minimum strength of 50 kg. Data set
BONDSTRENGTH.MPJ
Data collection
The engineer is considering 6 different adhesive
formulations. An important characteristic of the adhesive is Variable Description
its ability to maintain strength over time. Using each Pull Force Force (kg) required to separate components
formulation, the engineer prepares 72 assemblies so that 24
samples of each formulation can be pull-tested every three Formulation Formulation used to bond the assembly
months to assess strength degradation. Month Age of the bond (months)

Instructions
Use graphs and descriptive statistics to answer the following
questions:
1 Compare the average bond strength between the six
formulations.
2 Do any of the formulations have significantly more
variability?
3 Determine if bond strength degrades over time for any of
the formulations.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-87
Additional Examples and Exercises
Exercise O Electronic Component Reliability

Problem Data set


An engineer needs to examine the reliability of three RELIABILITY.MPJ
different types of electronic components to find the one that
has the longest survival rate when subjected to various
conditions of an accelerated stress test. Variable Description
Temperature Temperature of testing chamber (°C)
Data collection
Humidity Relative humidity of testing chamber
The engineer designs an accelerated stress experiment that Type Type of electronic component tested (II-V9,
subjects sample parts to four levels of temperature (100, 120, II-V10, S9)
140, and 200 °C) and three levels of humidity (45, 55, and 65
%RH). The experiment requires that four of each component C4–C7 Number of cycles until failure for each of four
type (II-V9, II-V10, S9) be subjected to each stress components
combination. The engineer records the number of cycles until
the component fails.

Instructions
1 Analyze these data to determine what effect the
accelerating variables, temperature and humidity, have on
the reliability of the electronic components.
2 Determine which component type seems the most
reliable.
Notice that the data are unstacked.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-88
Additional Examples and Exercises
Exercise P Project Prediction

Problem Instructions
A software firm builds custom computer solutions. Before a 1 Examine the data to determine whether the amount or
new project begins, the firm forecasts the number of type of experience influence the forecast values that a
production days needed to complete the project, in order to programmer provides.
allocate the appropriate resources. 2 The project lasted 315 days. Which group’s estimate was
In the past, forecasts have tended to underestimate the actual closest to the actual days required?
development time for projects, which has caused scheduling
problems and a miscalculation of development costs. Data set
A project manager decides to see whether different FORECAST.MPJ
characteristics of the company’s programmers influence the
accuracy of forecasts.
Variable Description
Data collection Forecast Estimated number of days the project will take
The manager studies two characteristics of the programmers: Amount of Number of years experience
the years of programming experience (less than 3 years, 3–5 Experience
years, and more than 5 years) and the type of experience Type of Type of project experience
(small or large project). He asks three programmers from Experience
each of the six different combinations to estimate the number
of days a particular project will take.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-89
Additional Examples and Exercises
Exercise Q Belt Wear

Problem Data set


A manufacturer supplies belts for industrial motors. The BELTWEAR.MPJ
manufacturer carries 4 different lines of belts (TR-33A, TR-
33X, TY-91A, and TY-91X). The 91 series uses an additive
intended to impede wear, which makes them more expensive Variable Description
than the 33 series. The X belts are designed to perform better Belt Type Type of belt
at higher speeds than the A belts. The manufacturer wants to
compare the performance of each belt type. Hours Time in service until belt showed signs of wear
Speed Average operating speed of equipment (rpm)
Data collection
Working with their customers, the manufacturer conducts a
study of the belts. To measure performance, customers
recorded the number of hours at which belts showed a
predefined level of wear.
The operating speed at which the belts are used has a
significant influence on their lifetime. Customers recorded
the average speed (measured in rpm) at which the belts were
operated during the usage period.

Instructions
1 Examine the data generated in this study and compare the
performance of the four different types of belts.
2 Using a scatterplot, determine what effect speed has on
belt life.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-90
Additional Examples and Exercises
Exercise R Healthcare Expenditures

Problem Data set


A county health care system has the second highest costs HEALTHCARE.MTW
among 23 counties that have healthcare systems of similar
size. Administrators want to compare the health care costs, as
well as several other relevant characteristics, across the 23 Variable Description
counties to determine whether their high health costs have Hos Exp Yearly health care expenditures per capita for
any reasonable justification. county
Pop Dens Number of residents per square mile
Data collection
Birth Rate Birth rate per 1,000 population
Administrators gather the yearly health care expenditures for
the 23 different counties along with some other important Over65 % population over age 65
variables that often impact health costs: population density, % Poverty % population below poverty level
birth rate, percent of population over age 65, and percent of
population below the poverty level. County County (county of interest =1; other counties = 0)

Instructions
1 Generate graphs to compare the health care expenditures
for the 23 different counties to determine whether the
county in question is unusually large.
2 Determine whether the counties differ on the other
characteristics and whether they seem to relate to health
care costs.

Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-91
Additional Examples and Exercises
Exercise S Call Center Staffing

Problem Data set


The customer service manager at a call center wants to study STAFFING.MTW
the percentage of calls that are on hold for less than 30
seconds. A benchmark study indicates that the industry
generally strives to answer 75% of all calls within 30 seconds Variable Description
after being put on hold. Calls Number of incoming calls
30sec Number of calls in which customer was on hold
Data collection
for under 30 seconds
From 1/1/2000 to 3/31/2000, the call center records the total People Number of support specialists available during
number of calls answered during the AM and PM shifts, shift
along with the number of customer support specialists
available during each shift. In addition, they record the
number of calls in which the customer was on hold for less
than 30 seconds.

Instructions
1 Create two new variables, using Make Patterned Data, for
Date and Shift.
2 Create a new response column, the percentage of calls
answered in 30 seconds.
3 Create several graphs, such as a graphical summary,
boxplot, and time series plot, to examine the data. What
do the three months of data reveal about the process?
4 Estimate the number of people who should be on staff to
achieve the benchmark, given recently observed call
volumes.
Additional Examples and Exercises Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6-92
Exercise Solutions
Exercise Page
Data Entry and Basic Charts
Exercise A 3
Solar Cell Manufacturer (2-29)
Exercise B 5
Quality Assurance Testing (2-30)
Exercise C 6
Travel Expenses (2-31)
Patterned Data
Exercise D 8
Investigating Variability in Surface Finish (3-25)
Exercise E 13
Customer Focus Group (3-26)
Exercise F 15
Parts Sampling (3-27)
Basic Data Analysis
Exercise G 16
Linen Process (4-34)
Exercise H 19
Call Center Data (4-35)
Exercise I 21
Online Application (4-36)
Importing and Manipulating Data
Exercise J 24
Restaurant Complaints (5-51)
Exercise K 28
Sulfur Dioxide Levels (5-52)
Exercise L 31
Investigating Downtime Distributions (5-53)

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1
Exercise Page
Exercise M 33
Production Downtime (5-54)
Additional Examples and Exercises
Exercise N 34
Bond Strength (6-87)
Exercise O 35
Electronic Component Reliability (6-88)
Exercise P 36
Project Prediction (6-89)
Exercise Q 38
Belt Wear (6-90)
Exercise R 40
Healthcare Expenditures (6-91)
Exercise S 42
Call Center Staffing (6-92)

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2
Exercise A Solar Cell Manufacturer (2-29)
1 Open SOLAR.MPJ.
2 Choose Graph ➤ Bar Chart.
3 In Bars represent, choose Values from a table.
4 Under One column of values, choose Simple. Click OK.
5 In Graph variables, enter 2000, 2001, and 2002; in Categorical variable, enter Month.
6 Click OK.

The charts indicate that there is


little difference in the production
of solar cells from year to year.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3
To compare production levels with the managers’ goals, add reference lines to the 2002 graph.
1 Right-click the bar chart for 2002.
2 Choose Add ➤ Reference Lines. In Show reference lines of Y (value scale) positions, enter
23000 26000.
3 Click OK.

The graph indicates that the goals


set by management for 2002 are
not being met. The manufacturer
met the goal of 26,000 units in
only 3 months (September,
October, and November). The
production goal of 23,000 units
was achieved in 2 of the 4 months
(July and August).
Note the production goal of
26,000 units was met in 2000 and
2001 in the months of September,
October, and November.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4
Exercise B Quality Assurance Testing (2-30)
1 Open BUGS.MPJ.
2 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
3 Choose Chart Defects Table.
4 In Labels in, enter Bug Type; in Frequencies in, enter Frequency.
5 Click OK.

During product testing, 3 main types


of bugs appear: Functionality,
Performance, and Navigation. When
combined, these bugs make up
65.3% of the total, with each type
contributing about the same
proportion (20%). Together with
Layout and Copy Specification,
these 5 bug types account for 80% of
all bug occurrences.

A Pareto chart based solely frequencies assumes that each type of item costs the same to fix in
terms of time and money. If they do not cost the same, a better approach would be to weigh the
items by their cost, then draw the Pareto chart using weighted values.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5
Exercise C Travel Expenses (2-31)
1 Open TRAVEL.MPJ.
2 Choose Graph ➤ Bar Chart.
3 In Bars represent, choose Values from a table.
4 Under One column of values, choose Simple. Click OK.
5 In Graph variables, enter North South Central; in Categorical variable, enter Month.
6 Click Multiple Graphs. Under Show Graph Variables, choose In separate panels of the same
graph.
7 Click OK in each dialog box.

Roughly the same pattern across


months appears for each sales team.
The travel expenses are higher in the
Spring and Fall and lower in the
Summer and in December. Looking
closely at the scales on the y-axis,
the Central sales team is spending
more on travel expenses.

Calculate the total yearly expenditures of each sales team.


1 Choose Calc ➤ Column Statistics.
2 In Input variable, enter North. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 6
3 Repeat steps 1 and 2 for the South and Central teams.

The statistics reveal that the North Sum of North


team spent a total of $88,103; the Sum of North = 88103
South team, $89,203; and the
Central team, $125,656. Sum of South
Sum of South = 89203

Sum of Central
Sum of Central = 125656

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 7
Exercise D Investigating Variability in Surface Finish (3-25)
1 Open ROUGHNESS.MPJ.
2 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Date/Time Values.
3 In Store patterned data in, enter DateTime.
4 In Start date, enter 9/15/2003 6:00am; in End date, enter 9/16/2003 1:00am.
5 In Increment, choose Hour. In List each value, enter 5. Click OK.
6 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Numbers.
7 In Store patterned data in, enter Shift.
8 In From first value, enter 1; in To last value, enter 2.
9 In List each value, enter 50. Click OK.
10 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Numbers.
11 In Store patterned data in, enter Fixture.
12 In From first value, enter 1; in To last value, enter 5.
13 In List each value, enter 1. In List the whole sequence, enter 20.
14 Click OK.

Your worksheet should look like the


one on the right.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 8
Create a graphical summary to examine the data.
1 Choose Stat ➤ Basic Statistics ➤ Graphical Summary.
2 In Variables, enter Roughness. Click OK.

As suggested by the leak test


failures, the process is producing
some parts with surface roughness
higher than the specification of 64.

Graph the data by shift and by fixture using boxplots.


1 Choose Graph ➤ Boxplot.
2 Under One Y, choose With Groups. Click OK.
3 In Graph variables, enter Roughness. In Categorical variables, enter Shift. Click OK
4 Press c+E.
5 In Categorical variables, enter Fixture. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 9
The graph indicates that the most
extreme values were produced by
the second shift. Both shifts seem to
be producing parts with
approximately the same median or
central tendency.

The graph indicates that fixture 5 is


contributing most to the overall
variation in the roughness of parts.
The measurements from fixture 5 are
positively skewed. Further
investigation of fixture 5 may
provide more insight into the nature
of the variability observed.

Create a dotplot to examine shift and fixture on the same plot.


1 Choose Graph ➤ Dotplot.
2 Under One Y, choose Stack Groups. Click OK.
3 In Graph variables, enter Roughness. In Categorical variables, enter Fixture Shift.
4 Click Data View. In Categorical variables for attribute assignment, enter Shift.
5 Click OK in each dialog box.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 10
The dotplot does not provide
conclusive evidence of an interaction
between Shift and Fixture. However,
the most extreme values of surface
roughness were produced during
shift 2 on fixture 5.

Subset the worksheet by fixture 5 so you can evaluate the data for fixture 5 separately.
1 Choose Data ➤ Subset Worksheet.
2 In Name, enter Fixture 5.
3 Under Include or Exclude, choose Specify which rows to include.
4 Under Specify which rows to include, choose Rows that match. Click Condition.
5 In Condition, enter 'Fixture' = 5.
6 Click OK in each dialog box.

Create a time series plot to examine the data for fixture 5 over time.
1 Choose Graph ➤ Time Series.
2 Choose Simple, then click OK.
3 In Series, enter Roughness. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 11
The plot shows a trend of increasing
variability over time.

After a full analysis of the data, the engineers and operators determined that fixture 5 was not
consistently clamping the parts. Depending on the size of the part, the clamping force was
sometimes firm (large parts) and sometime loose (smaller parts). The parts that were loosely
clamped vibrated in the fixture, causing poor surface finish.
The operator on the second shift was trained how to adjust the fixture clamping force to ensure
consistent surface finish.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 12
Exercise E Customer Focus Group (3-26)
1 Open CUSTFOCUS.MPJ.
2 Choose Data ➤ Split Worksheet.
3 In By variables, enter Status, Gender, and Ownership.
4 In the worksheet named (Status=Gold, Gender=Male, Ownership=6-month), choose Calc ➤
Random Data ➤ Sample from Columns.
5 In Sample, enter 6. In rows from column(s), enter Customer ID Status Gender Ownership.
6 In Store samples in, enter c6 c7 c8 c9.
7 Click OK.
8 Repeat steps 4−7 for the other three worksheets that include customers with gold status.

The above worksheet is one possible sample from customers who are gold status and male with
six-month ownership. Your sample will not match the sample shown above.
Next, copy and paste the sampled customer data into a new MINITAB worksheet.
1 Choose File ➤ New, then choose Minitab Worksheet. Label the first four columns Customer ID,
Status, Gender, and Ownership, respectively.
2 In the worksheet named (Status=Gold, Gender=Male, Ownership=6-month), select sampled
customers in the worksheet, then choose Edit ➤ Copy Cells. Make sure to select all four columns
of data.
3 Go to the new worksheet. Place your cursor in the first cell in the column C1, then choose Edit ➤
Paste Cells.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 13
4 Repeat steps 2 and 3 for the remaining worksheets.

An example of the new MINITAB


worksheet with the sampled
customers is shown on the right.
Your worksheet will not match
this worksheet.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 14
Exercise F Parts Sampling (3-27)
1 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Numbers.
2 In Store patterned data in, enter Pallet.
3 In From first value, enter 1; in To last value, enter 1250; in In steps of, enter 1.
4 In List each value, enter 100; in List the whole sequence, enter1. Click OK.
5 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Numbers.
6 In Store patterned data in, enter Part.
7 In From first value, enter 1; in To last value, enter 100; in In steps of, enter 1.
8 In List each value, enter 1; in List the whole sequence, enter 1250. Click OK.
9 Choose Calc ➤ Random Data ➤ Sample From Columns.
10 In Sample, enter 150; in rows from columns, enter Pallet Part.
11 In Store samples in, type 'Sample Pallet' 'Sample Part'. Click OK.
12 Choose Data ➤ Sort.
13 In Sort columns, enter 'Sample Pallet' 'Sample Part'.
14 In By column, enter 'Sample Pallet'.
15 Choose Original columns, then click OK.

The sampled parts are shown in the


worksheet on the right. Note that the first
two columns of your worksheet should
match the worksheet on the right, while the
Sample Pallet and Sample Part columns
will not.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 15
Exercise G Linen Process (4-34)
Because the amount of material produced each month varies, you should look at a defect rate.
Use the MINITAB calculator to figure the number of defects for each type per meter of material.
1 Open LINENPROCESS.MPJ.
2 Choose Calc ➤ Calculator.
3 In Store result in variable, enter TearsMat.
4 In Expression, enter 'Tears' / 'Material'. Click OK.
5 Press c+E and repeat the calculations for each defect type.

Create a bar chart of the data to compare the defect rates.


1 Choose Graph ➤ Bar Chart.
2 Under Bars represent, choose A function of a variable.
3 Under Multiple Y’s, choose Simple. Click OK.
4 In Function, choose Sum.
5 In Graph variables, enter TearsMat PullsMat SkipsMat PuncturesMat OtherMat.
6 Click OK.

The graph indicates that pulls and


punctures have the greatest defect
rates.

Create a time series plot to examine the defect rates over time.
1 Choose Graph ➤ Time Series Plot.
2 Choose Simple. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 16
3 In Variables, enter TearsMat PullsMat SkipsMat PuncturesMat OtherMat.
4 Click Multiple Graphs, choose In separate panels of the same page. Click OK.
5 Click Time/Scale, then choose Stamp.
6 In Stamp columns, enter Date.
7 Click OK in each dialog box.

Tears show a possible cyclic


effect. The number of pulls has
increased over the last year. In
early 1999, the number of skips
decreased. Punctures and other
defects appear to be random.

Create a scatterplot to see whether the defect count and the amount of material run are related.
1 Choose Graph ➤ Scatterplot.
2 Choose Groups, then click OK.
3 Under Y variables, enter Tears, Pulls, Skips, Punctures, and Other. Under X variables, enter
Material in all five rows.
4 Check X-Y pairs form groups.
5 Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 17
The graph indicates that little or
no relationship exists between
the number of meters of material
and the frequency of defects.
This is unusual, because the area
of opportunity for defects is
larger when more material is run.
A possible explanation is that the
amount of variation in the defect
counts is so large that it obscures
this relationship. For example,
the number of pulls varies
greatly, which the previous time
series plots show is mostly time-
related. This time effect is likely
larger than the variation induced
by running different quantities of
material.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 18
Exercise H Call Center Data (4-35)
1 Open CALLCENTER.MPJ.
2 Choose Stat ➤ Basic Statistics ➤ Display Descriptive Statistics.
3 In Variables, enter OnHold Talking.
4 Click Statistics. Check Mean, First Quartile, Median, Third Quartile, Minimum, Maximum,
N nonmissing, and N missing.
5 Click OK in each dialog box.

Descriptive Statistics: OnHold, Talking


Variable N N* Mean Minimum Q1 Median Q3 Maximum
OnHold 466 0 6.736 1.000 4.000 6.000 9.000 25.000
Talking 451 15 28.324 1.000 14.000 24.000 38.000 121.000

The results indicate that:


• The average number of calls received per day is 466/20 = 23.3.
• The average amount of time a customer spends holding is 6.7 minutes.
• The minimum hold time is 1 minute and the maximum hold time is 25 minutes.
• Fifty percent of the calls are on hold for 6 minutes or less.
• The average time spent talking with an operator is 28.3 minutes.
Create new variables based on hold time.
1 Name columns C4, C5, and C6:
• Label C4 'Less than 5'
• Label C5 'Between 5 and 10'
• Label C6 'More than 10'
2 Choose Calc ➤ Calculator.
3 In Store result in variable, enter C4.
4 In Expression, enter 'OnHold' < 5. Click OK.
5 Repeat steps 2–4 for C5, using 'OnHold' >= 5 and 'OnHold' <=10.
6 Repeat steps 2–4 for C6, using 'OnHold' >10.

Determine the number and percentage of calls that fall into each hold time category.
1 Choose Stat ➤ Tables ➤ Tally Individual Variables.
2 In Variables, enter C4, C5, and C6.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 19
3 Under Display, check Counts and Percents. Click OK.

Tally for Discrete Variables: Less than 5, Between 5 and 10, More
than 10
Less Between More
than 5 and than
5 Count Percent 10 Count Percent 10 Count Percent
0 325 69.74 0 209 44.85 0 398 85.41
1 141 30.26 1 257 55.15 1 68 14.59
N= 466 N= 466 N= 466

The results indicate that the percentage of customers who hold for less than 5 minutes is 30.26%.
The percentage of customers who hold for 5 to 10 minutes is 55.15%. The percentage of
customers who hold for more than 10 minutes is 14.59%
Determine the percentage of calls resolved within 30 minutes.
1 Choose Calc ➤ Calculator.
2 In Store result in variable, enter 'Total below 30'.
3 In Expression, enter ('OnHold' + 'Talking') < 30. Click OK.
4 Choose Stat ➤ Tables ➤ Tally Individual Variables.
5 In Variables, enter 'Total below 30'. Ensure that Counts and Percents are checked.
6 Click OK.

The percentage of calls resolved within 30 Tally for Discrete Variables: Total below 30
minutes is 45.68%. Total
below
The percentage of the calls where 30 Count Percent
customers hang up is 15/466 = 3%. 0 245 54.32
1 206 45.68
N= 451
*= 15

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 20
Exercise I Online Application (4-36)
1 Open ONLINEAPP.MPJ.
2 Choose Stat ➤ Basic Statistics ➤ Display Descriptive Statistics.
3 In Variables, enter Time.
4 Click Statistics. Check Mean, First Quartile, Median, Third Quartile, Minimum, Maximum, N
nonmissing, and N missing.
5 Click OK in each dialog box.

Descriptive Statistics: Time


Variable N N* Mean Minimum Q1 Median Q3 Maximum
Time 375 92 36.091 20.000 30.000 35.000 40.000 95.000

The results indicate that:


• The average amount of time to complete the application is 36.091 minutes.
• The maximum time to complete the application is 95 minutes; the minimum, 20 minutes.
• The median time to complete the application is 35 minutes.
• One quarter (Q1 or first quartile) of customers complete the application within 30 minutes.
• The percentage of customers who start the application process, but do not complete it, is N*/
N + N*= 92/467 = 19.7%.
Calculate the actual number of applications attempted each day and the average number
completed each day.
1 Choose Stat ➤ Tables ➤ Tally Individual Variables.
2 In Variable, enter Date. Under Display, check Count.
3 Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 21
The results indicate the number of Tally for Discrete Variables: Date
applications attempted each day. Date Count
10/5/2001 15
The study was carried out over 35 days 10/6/2001 20
during which customers attempted 467 10/7/2001 17
10/8/2001 10
applications. The average applications per 10/9/2001 19
day is 467/35 =13.34. 10/10/2001 6
10/11/2001 16
10/12/2001 11
10/13/2001 18
10/14/2001 9
10/15/2001 12
10/16/2001 13
10/17/2001 11
10/18/2001 7
10/19/2001 14
10/20/2001 19
10/21/2001 20
10/22/2001 9
10/23/2001 15
10/24/2001 20
10/25/2001 18
10/26/2001 7
10/27/2001 15
10/28/2001 13
10/29/2001 9
10/30/2001 13
10/31/2001 11
11/1/2001 14
11/2/2001 10
11/3/2001 8
11/4/2001 12
11/5/2001 14
11/6/2001 11
11/7/2001 16
11/8/2001 15
N= 467

Create a bar chart to display the number of applications attempted each day.
1 Choose Graph ➤ Bar Chart. In Bar represent, choose Counts of unique values. Choose
Simple and click OK.
2 In Categorical variables, enter Date.
3 Click Data View. Under Data Display, uncheck Bars and check Symbols and Connect line.
4 Click OK in each dialog box.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 22
The chart shows the number of
applications customers attempted
each day. The greatest number of
applications were attempted on
October 5, 21 and 24. The fewest
number of applications were
attempted on October 10.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 23
Exercise J Restaurant Complaints (5-51)
1 Open RESTCOMPLAINT.XLS.
2 Choose Stat ➤ Tables ➤ Tally Individual Variables.
3 In Variables, enter Complaint Shift 'Cook Staff' Manager. Click OK.

Tally for Discrete Variables: Complaint, Shift, Cook Staff, Manager


Cook
Complaint Count Shift Count Staff Count
Food not cooked to order 9 Dinner 43 A 22
Poor food quality 6 Lunch 11 B 32
Slow service 23 N= 54 N= 54
Too expensive 2
Wrong order 12
Wrong Order 2
N= 54

Manager Count
John 25
Robert 29
N= 54

Be careful when summarizing data by one variable at a time with many factors; you can miss an
important interaction. For example, perhaps when cooking staff B works the morning shift, they
make mistakes; but, when working the afternoon shift, they do not make mistakes.
Notice that the complaint code for incorrect orders appears in two forms: "Wrong order" versus
"Wrong Order." Merge them using the Code function in MINITAB:
1 Choose Data ➤ Code ➤ Text to Text.
2 In Code data from columns, enter Complaint.
3 In Into columns, enter Complaint.
4 Under Original values, enter 'Wrong Order'; under New, enter 'Wrong order' (including the
quotation marks).
5 Click OK.
Note You could also use MINITAB’s Find and Replace feature in Editor > Replace to change the text values.

Create a Pareto charts of the complaints.


1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
2 In Chart defects data in, enter Complaint.
3 In Combine defects after the first, enter 99.9, then click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 24
The three biggest complaints are
slow service, wrong order, and
food not cooked to order. Slow
service and wrong order are
typically attributed to the wait
staff, while food not cooked to
order is usually a kitchen
problem. However, all three
complaints might be kitchen
problems.

Create a Pareto chart for each shift.


1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
2 In Chart defects data in, enter Complaints.
3 In By variable in, enter Shift.
4 Click OK.

The graph indicates that most of


the complaints come at dinner,
but no complaints about cost
come at lunch.

Create a Pareto chart for each cook staff.


1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
2 In Chart defects data in, enter Complaints.
3 In By variable in, enter 'Cook Staff'.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 25
4 Click OK.

More complaints are made about


cooking staff B. All the complaints
about poor food quality and almost
all those about food not cooked to
order are about staff B.
Do not look at the raw counts and
decide that the difference in the
number of complaints is due entirely
to the cook staff. One cooking staff
may work more dinners or longer
shifts than the other staff.

Create a Pareto chart for each manager.


1 Choose Stat ➤ Quality Tools ➤ Pareto Chart.
2 In Chart defects data in, enter Complaints.
3 In By variable in, enter Manager.
4 Click OK.

Most of the slow-service complaints


occurred under Robert’s supervision.
Also, most of the wrong orders
occurred under John’s supervision.

To look at the total number of complaints versus all the variables (Shift, Cook Staff, and
Manager), use the cross-tabulation function. Shown below is one of many ways to display this
count data in a table:

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 26
1 Choose Stat ➤ Tables ➤ Cross Tabulation and Chi-Square.
2 In For rows, enter Manager 'Cook Staff'.
3 In For columns, enter Shift.
4 Click OK.

This table shows the number of complaints Tabulated statistics: Manager, Cook Staff, Shift
for each combination of the variables shift, Rows: Manager / Cook Staff Columns: Shift
manager, and cook staff.
Dinner Lunch All
The combination of variables that produce John
A 5 4 9
the most complaints are: B 12 4 16
• Dinner with Robert as manager Robert
A 12 1 13
(complaints do not vary by Cook Staff) B 14 2 16
All
• Dinner with John as manager and Cook All 43 11 54
Staff B
Cell Contents: Count

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 27
Exercise K Sulfur Dioxide Levels (5-52)
Often, you can overlook important relationships or make improper generalizations when
response data is assessed one by-variable at a time. When possible, you should account for all the
input variables simultaneously when plotting the response data. However, you cannot always do
this or you may find it difficult to do, depending on the number of variables and the quantity of
data.
In this exercise, you can create a single plot that succinctly summarizes the important features in
the response data.

Import the Data


1 Choose File ➤ Open Worksheet.
2 In Files of type, choose Excel(*.xls).
3 Choose SULFURDIOXIDE.XLS.
4 Click Options, then check Ignore blank data rows. Click OK.
5 Click Preview.

Scroll over to view cell C7, the Time 4 column. Though the column is mostly numeric data, the
first observation is text. The experimenters typed "missed" when they did not obtain a scheduled
reading. You can change the way MINITAB reads this column by selecting the proper data type
(Numeric) from the pull-down menu.
6 Under C7, in Text, choose Numeric.
7 Click OK.
8 Click Open.
Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 28
To generate a graph that shows the within-batch time trend, between-run variation, and tank-to-
tank variation, you must stack the data. Stacking keeps the within-tank data together for each
production run (date).
1 Choose Data ➤ Stack ➤ Rows.
2 In Rows to be stacked are in the following columns, enter 'Time 1' 'Time 2' 'Time 3' 'Time 4'
(including the quotation marks).
3 In Store stacked data in, type SO2.
4 Check Store column subscripts in, then type Time.
5 Check Expand the following columns while stacking rows, then enter Date Tank.
6 In Store the expanded columns in, enter Day Tnk. Click OK.

The data are structured for analysis now, but you need to code the time data before you plot
them, because text variables cannot be used on the x-axis of a scatterplot.
1 Choose Data ➤ Code ➤ Text to Numeric.
2 In Code data from columns, enter Time.
3 In Into columns, enter Time.
4 Under Original values, row 1, enter 'Time 1'. Under New, enter 1.
5 Repeat for 'Time 2', 'Time 3', and 'Time 4', naming them 2, 3, and 4.
6 Click OK.

Create a scatterplot to examine the relationship among SO2 levels and time for each tank.
1 Choose Graph ➤ Scatterplot.
2 Choose With Connect and Groups. Click OK.
3 Under Y variables, enter SO2; under X variables, enter Time.
4 Under Categorical variables for grouping, enter Day.
Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 29
5 Click Multiple Graphs, then click the By Variables tab.
6 Under By variables with groups in separate panels, enter Tnk.
7 Click OK in each dialog box.

For each tank, the SO2 levels


decrease over time within a
production run (date).
For tanks H10, H11, and H9, the
decline in SO2 levels over the
production time is much more
dramatic than for the other tanks.
The production run-to-production
run (date-to-date) variation seems to
have a much greater impact on H10,
H11, and H9 than on the other tanks.

You should further investigate the tank-to-tank performance variations to better understand the
process.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 30
Exercise L Investigating Downtime Distributions (5-53)
1 Open DOWNTIME.MPJ.
2 Choose Data ➤ Split Worksheet.
3 In By variables, enter Reason. Click OK.

Analyze the data for changeovers:


1 Make the Change Over worksheet active.
2 Choose Graph ➤ Dotplot.
3 Under One Y, choose With Groups. Click OK.
4 In Graph variables, enter Downtime; in Categorical variables for grouping, enter Shift.
5 Click OK.

The graph shows that downtime


spikes at 5-minute intervals: 25, 30,
and 35 minutes. These spikes might
mean that the person recording and
entering the data rounded to the
nearest 5 minutes.

Analyze the data for fixture alignment:


1 Make the Fixture Alignment worksheet active.
2 Choose Graph ➤ Dotplot.
3 Under One Y, choose Simple. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 31
4 In Graph Variables, enter Downtime, then click OK.

Two different distributions appear


for downtime due to fixture
alignments. One is centered around
20 minutes and one around 30.
Recall from the example that line A
had more overall downtime from
fixture alignments than line B.
Therefore, the two distributions
might be due to the lines.

Create a dotplot by line for the fixture alignment data:


1 Choose Editor ➤ Panel.
2 In By variables with groups in separate panels, enter Line.
3 Click OK.

The graph shows that line A has


more overall downtime occurrences
but each occurrence lasts for less
time compared to line B. The
workers on line A may be resolving
fixture alignment problems faster or
they may be just rushing because
they have more downtime
occurrences to resolve.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 32
Exercise M Production Downtime (5-54)
1 Choose File ➤ New ➤ Minitab Project.
2 Choose File ➤ Query Database (ODBC).
3 Click the Machine Data Source tab, then choose MS Access. Click OK.
4 If the Login appears, click OK.
5 Choose PRODUCTION.MDB, then click OK.
6 Move all variables from Available fields to Selected fields.
7 Click Use rows.
8 In Use rows, enter where 'Line' = 'A'.
Tip The syntax used in this dialog box is determined by the database being queried. Be sure to use single quotes
around the line indicators.

9 Click OK in each dialog box.

Your worksheet should look like the one above.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 33
Exercise N Bond Strength (6-87)
1 Open BONDSTRENGTH.MPJ.
2 Select the Month column. Right-click and choose Column ➤ Value Order.
3 Choose User-specified order. Ensure that the order Three, Six, Nine appears in Define an order.
4 Click OK.

Create an individual value plot to examine the data:


1 Choose Graph ➤ Individual Value Plot.
2 Under One Y, choose With Groups.
3 In Graph variables, enter 'Pull Force'. In Categorical variables for grouping, enter Formulation
Month.
4 Click OK.

Notice that the month is the


innermost category on the x-axis,
making it easy to identify the month
effect within each formulation.
Only 1+1B and 2+2B consistently
exceed the strength requirement. No
differences in strength appear across
the nine months for either
formulation.
The strength of the SS and XF
formulations decline in each three-
month period. Apparently, the SS
and XF also have more variability in
strength than the other formulations.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 34
Exercise O Electronic Component Reliability (6-88)
1 Open RELIABILITY.MPJ.
2 Choose Data ➤ Stack ➤ Rows.
3 In Rows to be stacked are in the following columns, enter C4 C5 C6 C7.
4 In Store stacked data in, type Cycles.
5 Check Expand the following columns while stacking rows, then enter Temperature Humidity
Type.
6 In Store the expanded columns in, type Temp Humid Component. Click OK.

Create a three-dimensional scatterplot to examine the data:


7 Choose Graph ➤ 3D Scatterplot.
8 Choose With Groups, then click OK.
9 In Z variable, enter Cycles; in Y variable, enter Temp; in X variable, enter Humid.
10 In Categorical variables for grouping, enter Component. Click OK.

S9 appears to be the worst


component type because its failures
occur with fewer cycles. The
components II-V9 and II-V10 seem
to be about equal.
Use the 3D Graph Tools toolbar to
rotate the graph. By rotating the
graph, the increased temperature
seem to cause more failures; but,
changing humidity does not appear
to have an effect.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 35
Exercise P Project Prediction (6-89)
1 Open FORECAST.MPJ.
2 Select the Amount of Experience column. Right-click and choose Column ➤ Value Order.
3 Choose User-specified order. Ensure that the order Less than 3 years, 3–5 years, More than 5
years appears in Define an order.
4 Click OK.
5 Repeat steps 2–4 for the Type of Experience column. Ensure that the order is Small Project,
Large Project.

Generate descriptive statistics to examine the data:


6 Choose Stat ➤ Tables ➤ Descriptive Statistics.
7 In For rows, enter 'Amount of Experience'. In For columns, enter 'Type of Experience'.
8 Click Associated Variables. In Associated variables, enter Forecast. Under Display, check
Means.
9 Click OK in each dialog box.

Tabulated statistics: Amount of Experience, Type of Experience


Rows: Amount of Experience Columns: Type of Experience

Small Large
Project Project All

Less than 3 Years 220.0 246.7 233.3


3 3 6

3-5 Years 208.3 250.0 229.2


3 3 6

More than 5 Years 251.7 296.7 274.2


3 3 6

All 226.7 264.4 245.6


9 9 18

Cell Contents: Forecast : Mean


Count

Programmers with more than five years’ experience on large projects came closest to estimating
the correct number of days for the project. In general, programmers with experience on larger
projects had closer average forecasts.
1 Choose Graph ➤ Individual Value Plot.
2 Under One Y, choose With Groups. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 36
3 In Graph variables, enter Forecast; in Categorical variables for grouping, enter 'Amount of
Experience' and 'Type of Experience'.
4 Click Data View, then choose Mean symbol. Click OK.
5 Click Scale, then click the Reference Lines tab.
6 In Show reference lines at Y, enter 315.
7 Click OK in each dialog box.

The best forecasts—including one at


exactly 315—occur for
programmers with more than five
years experience on large projects.
Those with fewer than five years on
large projects and those with more
than five years on small projects,
performed about the same.
Programmers with less than 5 years
of small project experience provided
the worst forecasts.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 37
Exercise Q Belt Wear (6-90)
1 Open BELTWEAR.MPJ.
2 Choose Graph ➤ Individual Value Plot.
3 Under One Y, choose With Groups. Click OK.
4 In Graph variables, enter Hours; in Categorical variables for grouping, enter 'Belt Type'.
5 Click OK.

No major differences appear among


the four belt types, other than a few
low points for TR-33A and a few
high points for TY-91X. These two
belt types have greater variability
than the other two. However, this
plot does not account for the speed,
so it could be misleading.

Create a scatterplot to examine the relationship between speed and belt life, for each belt type.
1 Choose Graph ➤ Scatterplot.
2 Choose With Groups, then click OK.
3 In Y variables, enter Hours; in X variables, enter Speed, in Categorical variables for grouping,
enter 'Belt Type'.
4 Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 38
For all belt types, the linear
relationship between hours and
speed is negative. The individual
value plot indicated that the four belt
types have about the same average
life span, not considering speed.
However, the scatterplot shows that
both the TR-33A and TY-91A were
used at lower average speeds in the
field than the other two belt types.
The A belts saw application speeds
in the 2200 to 3000 range, while the
X belts saw speeds from about 2700
to over 3600.
You can compare all 4 belt types
where the average speed is the same
for all. For example, in the 2800 to
3000 range, the X belts appear to last
longer than the A belts.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 39
Exercise R Healthcare Expenditures (6-91)
1 Open HEALTHCARE.MPJ.
2 Choose Graph ➤ Scatterplot.
3 Choose With Groups, then click OK.
4 Under Y variables, enter 'Hos Exp' in the first four rows.
5 Under X variables, enter 'Pop Dens' in row 1, 'Birth Rate' in 2, 'Over65' in 3, and '% Poverty' in 4.
6 In Categorical variables for grouping, enter County.
7 Click Multiple Graphs. Under Show Pairs of Graph Variables, choose In separate panels of the
same graph.
8 Click OK in each dialog box.

The county of interest, indicated by


the red point, does not have a
particularly high population density,
but it does have a relatively high
birth rate.
The county has the second highest
percentage of people over 65 and the
highest percentage of poverty, which
could provide some justification for
why the county has the second
highest healthcare costs among the
23 counties.

You can also use a matrix plot for this analysis. In addition to the plots shown above, a matrix
plot may show whether any of the predictor variables are related to one another.
1 Choose Graph ➤ Matrix Plot.
2 Under Matrix of plots, choose With Groups. Click OK.
3 In Graph variables, enter 'Hos Exp' 'Pop Dens' 'Birth Rate' 'Over65' '% Poverty'.
4 In Categorical variables for grouping, enter County.
5 Click Matrix Options. Under Matrix Display, choose Lower left.
6 Click OK in each dialog box.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 40
Note that the first column of plots is
identical to the scatterplots you
created on the previous page.
The matrix plot indicates no strong
relationships between the predictors.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 41
Exercise S Call Center Staffing (6-92)
1 Open STAFFING.MPJ.
2 Choose Calc ➤ Make Patterned Data ➤ Simple Set of Date/Time Values.
3 In Store patterned data in, enter Date.
4 In Start Date, enter 1/1/2000; in End Date, enter 3/31/2000.
5 In Increment, enter Day; in by, enter 1.
6 In List each value, enter 2; in List the whole sequence, enter 1. Click OK.
7 Choose Calc ➤ Make Patterned Data ➤ Text Values.
8 In Store patterned data in, enter Shift.
9 In Text values, enter AM PM.
10 In List each value, enter 1; in List the whole sequence, enter 91. Click OK.

Use the calculator to create the response column of interest:


1 Choose Calc ➤ Calculator.
2 In Store result in variable, enter '30Sec%'.
3 Under Expression, enter '30Sec' / 'Calls' ∗100. Click OK.

Generate a graphical summary of this new response column:


1 Choose Stat ➤ Basic Stats ➤ Graphical Summary.
2 In Variables, enter '30Sec%'. Click OK.

The histogram indicates a bimodal


pattern in the data. The two peaks
could be related to the two shifts.
Perhaps one shift is understaffed, or
is not as efficient as the other shift.
Also, note the natural boundary in
the data at 100%.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 42
Generate a boxplot to see whether the shifts perform differently:
1 Choose Graph ➤ Boxplot.
2 Under One Y, choose With Groups. Click OK.
3 In Graph variables, enter '30Sec%'. In Categorical variables for grouping, enter Shift.
4 Click OK.

Apparently, the two shifts do not


differ enough to have caused the
bimodal pattern in the histogram.

Generate time series plots of the variables 30Sec%, Calls, and People.
1 Choose Graph ➤ Time Series Plot.
2 Under One Y, choose Simple. Click OK.
3 In Series, enter '30Sec%' Calls People.
4 Click Time/Scale.
5 Under Time Scale, choose Stamp. In Stamp columns, enter Date. Click OK.
6 Click Multiple Graphs. Under Show Graph Variables, choose In separate panels of the same
graph.
7 Click OK in each dialog box.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 43
Notice that the time series plot of
30Sec% shows an increasing trend,
then levels off. The bimodal pattern
seems to be associated with time.
Call volumes decreased over time,
while staffing levels increased about
halfway the three-month period. As
a result, in January and part of
February, the center ran at high call
volumes with low staffing levels. In
late February and through March,
the center ran at lower call volumes
with higher staffing levels.

One way to determine what staffing levels are needed to answer 75% of calls within 30 seconds
is to normalize the call volume and staffing, because they vary so much throughout the study. In
other words, determine how the number of calls per person (workload) affects the percent of
calls answered in 30 seconds. Then, you can determine the correct workload for each person to
ensure that 75% of calls are answered within 30 seconds.
Create a column named Calls/Person using the calculator:
1 Choose Calc ➤ Calculator.
2 In Store result in variable, enter 'Calls/Person'.
3 In Expression, enter 'Calls' / 'People'. Click OK.

Create a scatterplot that shows the relationship between 30Sec% and Calls/Person.
1 Choose Graph ➤ Scatterplot.
2 Choose Simple, then click OK.
3 In Y variables, enter '30Sec%'. In X variables, enter 'Calls/Person'. Click OK.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 44
From this scatterplot, you can see
that, as the number of calls per
person goes up, the percent of calls
answered in 30 seconds decreases.
To ensure that the call center
operates above the 75% goal,
enough people should be working so
that, on any given day, the workload
does not exceed around 60 calls per
person.

Exercise Solutions Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 45
Suggested Reading

Suggested Reading
A. Agresti (1996). An Introduction to Categorical Data J. Devore (1999). Probability & Statistics for Engineering
Analysis. John Wiley & Sons, Inc. and the Sciences, 5th edition. Brooks/Cole Publishing Co.
AIAG Measurement System Analysis (MSA) Reference N. Draper and H. Smith (1998). Applied Regression Analysis,
Manual, 3rd edition (2002). Automotive Industry Action 3rd edition. John Wiley & Sons, Inc.
Group.
D. Gorman and K. Bower (2002). “Measurement System
AIAG Statistical Process Control (SPC) Reference Manual Analysis and Destructive Tests,” Six Sigma Forum, 1-4,
(1995). Automotive Industry Action Group. August. The American Society for Quality (ASQ). Also
reproduced on the Minitab Web site.
G.E.P. Box and N. Draper (1987). Empirical Model-Building
and Response Surfaces. John Wiley & Sons, Inc. G. Hahn and W. Meeker (1991). Statistical Intervals: A
Guide for Practitioners. John Wiley & Sons, Inc.
G.E.P. Box, W.G. Hunter, and J.S. Hunter (1978). Statistics
for Experimenters. John Wiley & Sons, Inc. A. Khuri and J. Cornell (1996). Response Surfaces: Designs
& Analyses, 2nd edition. Marcel Dekker Inc.
F.W. Breyfogle III (1999). Implementing Six Sigma: Smarter
Solutions Using Statistical Methods. John Wiley & Sons, Inc. J. Lawless (1982). Statistical Models and Methods for
Lifetime Data. John Wiley & Sons, Inc.
S. Chatterjee, A. Hardi, and B. Price (1999). Regression
Analysis by Example, 3rd edition. John Wiley & Sons, Inc. T. Lorenzen and V. Anderson (1993). Design of Experiments:
A No-Name Approach. Marcel Dekker Inc.
J. Cornell (1990). Experiments with Mixtures: Designs,
Models, and the Analysis of Mixture Data, 2nd edition. John W. Meeker and L. Escobar (1998). Statistical Methods for
Wiley & Sons, Inc. Reliability Data. John Wiley & Sons, Inc.
A. Dean and D. Voss (1999). Design and Analysis of D.C. Montgomery (2000). Introduction to Statistical Quality
Experiments. John Wiley & Sons, Inc. Control, 4th edition. John Wiley & Sons, Inc.
R. Devor, T. Chang, and J. Sutherland (1992). Statistical D.C. Montgomery (2001). Design and Analysis of
Quality Design and Control: Contemporary Concepts and Experiments, 5th edition. John Wiley & Sons, Inc.
Methods. Prentice Hall.

Introduction to MINITAB Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 1
Suggested Reading
D.C. Montgomery, A. Peck, and G. Vinning (2001). R. O’Connor (1996). Practical Reliability Engineering,
Introduction to Linear Regression Analysis, 3rd edition. John 3rd edition. John Wiley & Sons, Inc.
Wiley & Sons, Inc.
J. Petruccelli, B. Nandram, and M. Chen (1999). Applied
R.H. Myers (2000). Classical and Modern Regression with Statistics for Engineers & Scientists. Prentice Hall.
Application, 2nd edition. PWS-KENT Publishing Co.
R. Ryan (2000). Statistical Methods for Quality
R.H. Myers and D.C. Montgomery (1995). Response Surface Improvement, 2nd edition. John Wiley & Sons, Inc.
Methodology: Process and Product Optimization Using
Designed Experiments. John Wiley & Sons, Inc. W.A. Shewhart (1990). Statistical Method from the Viewpoint
of Quality Control. Dover Publications, Inc.
I. Miller and M. Miller (1994). Statistical Methods for
Quality with Applications to Engineering & Management. P. Tobias and D. Trindade (1995). Applied Reliability,
Prentice Hall. 2nd edition. Van Nostrand Reinhold.

W. Nelson (1982). Applied Life Data Analysis. John Wiley & S. Vardeman (1993). Statistics for Engineering Problem
Sons, Inc. Solving. PWS Publishing Co.

J. Neter, M. Kutner, C. Nachtsheim, and W. Wasserman For a list of books and periodicals that feature Minitab, visit:
(1996). Applied Linear Statistical Models, 4th edition. Irwin. http://www.minitab.com/resources/ctl/index.asp

Introduction to MINITAB Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 2
Index

Index

Numerics column descriptions 1-13 grouping variable 5-48


columns 1-12 interpreting 5-49
3D Graph Tools 6-48 confidence intervals 1-28
3D scatterplot 6-47 creating Exec files 5-16
creating reports with
E
ReportPad 1-35 editing a graph 2-13
A word processor 1-37
Excel files 5-4, 5-21, 6-34, 6-54, 6-75
Access files 5-40 Exec files
active worksheet 1-11 D creating 5-16
running 5-17
Autofill 2-9, 2-17
data entry Extract from Date/Time to Text 3-13, 6-6, 6-19
arrow 1-12
B Autofill 2-9
Make Patterned Data 3-4, 3-12 F
Backspace key 1-24 data types 1-12
file types 5-5
bar chart Data window 1-4, 1-5
creating 2-10, 2-18, 5-45 data/time data 1-12
panel 5-46
stacking 2-20
Delete key 1-24 G
descriptions
boxplot 4-8, 6-21 column 1-13 Gage R&R 3-3
brushing data 6-14, 6-39, 6-65 project 1-13 Graph folder 1-7
worksheet 1-13 Graph window 1-5
descriptive statistics 1-30 Graphical Summary 1-27, 5-29
C Tables 6-49, 6-77
graphs
Display Descriptive Statistics 1-29, 4-6, 4-13, 3D Graph Tools 6-48
Calculator 4-4, 4-17, 5-7, 5-31, 6-45
4-22, 5-9, 6-23 brushing 6-14, 6-39, 6-65
clearing cells 1-24 by variable 1-29, 4-28 displaying 1-16
coding data 5-12, 6-81 dotplot 1-22, 4-11, 5-48 edit scale 4-26, 6-12

Introduction to MINITAB Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 3
Index
editing a selected item 2-13 M toolbar 1-7
editing tools 1-40 projects 1-5
folder 1-16 macros descriptions 1-13
grouping variables 5-48, 6-29, 6-59, 6-79 creating Exec files 5-16 environment 1-4
layout 1-33 running an Exec file 5-17 open 1-9, 1-21
panel 4-16, 4-28, 5-46, 6-61 windows 1-5
Make Patterned Data 3-4, 3-12
pasting 1-37
reference lines 2-22 matrix plot 6-37
Minitab Inc. 1-3
selecting a graph item 2-13
history 1-3
R
tile 1-10, 1-16
time stamp 6-12 products 1-3 random sample 3-22
updating 1-25 Minitab Quality Companion 1-3 randomizing data 3-7
missing value symbol 1-24 recoding data 5-12, 6-81
Related Documents folder 1-7, 1-18
H
N ReportPad 1-7, 1-17, 1-35
histogram 1-28, 4-24, 5-11 editing 1-36
panel 4-28 navigating Minitab 1-6 exporting 1-37
History folder 1-7 numeric data 1-12 reports 1-37
word processor 1-37
running an Exec file 5-17
I O
importing data from Access 5-39 ODBC 5-39 S
importing data from Excel 5-4, 5-21, 6-33, 6-54, open project 1-9, 1-21
6-75 scatterplot 6-27, 6-57
opening an Access file 5-40
individual value plot 6-63, 6-67, 6-79, 6-84 3D 6-47
opening an Excel file 5-5, 5-22, 6-33, 6-55, 6-76 grouping variable 6-29, 6-59
grouping variable 6-79 ordering data 6-20, 6-83 panel 6-61
Session folder 1-7, 1-15
L P Session window 1-4, 1-5
shortcut keys 1-39
layout tool 1-33 Pareto chart 2-5, 5-14, 5-43 Show Info folder 1-7, 1-14
links pasting graphs 1-37 sorting data 3-8, 5-27
to documents 1-18 Split Worksheet 3-20, 6-71
Project Manager
to web pages 1-18
overview 1-7 stacking data 5-25

Introduction to MINITAB Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 4
Index
StatGuide 1-31
stratified random sample 3-24
subsetting a worksheet 6-5

T
Tally Individual Variables 4-18, 5-33, 6-69
text data 1-12
time series plot 4-15, 5-35, 6-10
date stamp 6-12
panel 4-16
tool tip 1-11

U
updating a graph 1-25, 1-26

V
Value Order 6-20, 6-83

W
worksheet
active 1-11
descriptions 1-13
folder 1-7, 1-11
splitting 6-71
subsetting 6-5

Introduction to MINITAB Copyright Minitab Inc. 2003. All rights reserved. Rel14 Ver 1.0 TRMEM14W.INTRO 5

Вам также может понравиться