Академический Документы
Профессиональный Документы
Культура Документы
TOR-2014-02163
May 8, 2014
Roland Duphily
Acquisition Risk and Reliability Engineering Department
Systems Engineering Division
Prepared for:
National Reconnaissance Office
14675 Lee Road
Chantilly, VA 20151-1715
Developed in conjunction with Government and Industry contributions as part of the U.S. Space Program Mission Assurance
Improvement Workshop.
Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and
maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information,
including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington
VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it
does not display a currently valid OMB control number.
16. SECURITY CLASSIFICATION OF: 17. LIMITATION 18. NUMBER 19a. NAME OF
OF ABSTRACT OF PAGES RESPONSIBLE PERSON
a. REPORT b. ABSTRACT c. THIS PAGE
UU 42
unclassified unclassified unclassified
This presentation provides a summary overview of TOR-2014-02202, “Root Cause Investigation Best Practices
Guide”, which was produced as part of the 2014 Mission Assurance Improvement Workshop.
This document was created by multiple authors throughout the government and the aerospace industry. For their
content contributions, we thank the following contributing authors for making this collaborative effort possible:
A special thank you to Harold Harder, The Boeing Company, for co-leading this team, and to Andrew King, The
Boeing Company, for sponsoring this team; your efforts to ensure the completeness and quality of this document are
appreciated.
i
Acknowledgments (Cont)
The Topic Team would like to acknowledge the contributions and feedback from the following subject matter experts
(SMEs) who reviewed the document:
ii
Root Cause Investigation (RCI)
Best Practices Guide
Overview
Harold Harder, The Boeing Company
Roland Duphily, The Aerospace Corporation
May 21, 2014
This presentation provides a summary overview of TOR-2014-02202, “Root Cause Investigation Best
Practices Guide”, which was produced as part of the 2014 Mission Assurance Improvement Workshop.
1
RCI Overview Agenda
• Motivation for RCI Guide
• Example
• Product Overview
• Topic Details
• Product Implementation Recommendations
• Topic Follow-on Recommendations
• Team Membership and Recognition
• Some root cause failure investigations failed to identify true root cause
– Unfortunately, failure recurred after corrective actions were implemented for
what was believed to be the root cause
• Projects and teams may lack leadership and guidance documents on
performance of root cause analysis (RCA)
• Variability in RCA techniques used can result in ineffective or inefficient
root cause investigations
• For the National Security Space community, no recognized RCI best
practice exists
Overview of basis for RCAs, definitions and Sec 2, Sec 3, Sec 8, Sec 5.3
terminology, commonly used techniques, needed
skills/experience
Key early actions to take following anomaly/failure Sec 5.0
Data/information collection approaches Sec 6.0
Structured RCA approaches – pros/cons Sec 8, Table 11
Survey/review of available RCA tools Sec 9.0
Guidance on criteria for determining when a RCA is Sec 10.0
sufficient (when do you stop)
RCA of on orbit vs. on ground anomalies Sec 11.0
Handling of root cause unknown and unverified Sec 12.0
failures
6. Collect 9. Return to
7. 8. Root 10. 11. Validate
& Corrective Step #6 if
Problem Cause Implement Actions
Classify Action Problem
Definition Analysis & Verify Effective
Data Plan Recurs
• Focus is on Root Cause Analysis Methods and Tools utilized by the RCI
Space Systems Team and companies
– No magic methods or tools; identified common issues and facilitation techniques
– Many ground failure RCI’s validate true root cause quickly (hardware available)
– A few complex anomalies benefit from a combination of items in the RCI guide
8.4 .4 Fault Tree Analysis (FTA) ......................... ........... ............ ......................... ..35
8.5 Process Flow Style ..... ................................. ......................................................38
9. Root Cause Analysis Tools (Software Package Survey) ........ ... ......... ... ............ ......... ..40
, U.S. Space Program
l' Mission Assuranc~ U.S. SPACE PROGRAM MAIW | DULLES, VA | MAY 7 - 8, 2014
~
. ..:1' Improvement
15
.., -~• Workshop
RCI Outline – 3 of 3
9.1 Surveyed Candidates .. _... . _.. .._._ .... _.... _._.... _.... _.. .. _._.... _.... _._... . _.. .. _._.. _._.... _...._._....41
9.2 Rea lity Charting (Apollo ) ..... ..... .. .......... ............ ... ........ ........ .......... .. ........ .. ... ......41
9.3 TapRooT ..... ...... ......... ..... .... ..... .......... ........ ...... ... ............ ..... ... .... ..... ....... ........ ...43
9.4 GoldFire ..................... ...... ...... ..... ...... ..... .... ... ..... ............. ..... .. .......... ...... ..... .... ... 46
9.5 RCAT (NASA Tool) ......... .... ..... ... ....... .......... .... ........ ............ .... ... ............ .......... .48
9.6 Think Reliability ....... ... ............. .... ........ ....... .... .... ... ....... .... ....... ..... .... ........ ...... .... 50
10. When is RCA Depth Sufficient ... ....... .. ............... ...... ... ......... ...... ..... ...................... ....... .52
10. 1 Prioritization Techn iques ..... ..... ... ....... ..... ............ ............... ...... .... ..... ............ ......54
10. 1.1 Risk Cube (Harder-need Clarifi cation) .. .... ........ ............ .... ....... ............. ......54
10.1.2 Solution Evaluati on Tem plate (Harder-need Clarification ) .. ..... .. ...... ...........55
11. RCA On-O rbit versus On-Grou nd ... ...... ...... ............. .. .... ...... .... ...... ..... ....... ....... ............ 55
12. RCA Unverified and Unknown Cause Fa ilures .. ...... ... ........... .... ..... ......... ................. ... .58
12. 1 Unverified Fa ilure (UVF) ... .. ....... ......................... ... ....... ...... ............ ......... ..........58
12.2 Unknown Direct/Root Cause Fa ilu re ..... .... .... ...... .... ...... ..... .............. ... ....... .... .... 61
13. RCA Pitfalls ... .. ....... ......... ...... ........ .... ..... ....... ....... ...... .... ............ ..... ... .... ..... ....... ....... .... 62
A .1 Type B Reaction Wheel Root Cause Case Study ........ ........ ......... ............... ......63
, U.S. Space Program
l' Mission Assuranc~ U.S. SPACE PROGRAM MAIW | DULLES, VA | MAY 7 - 8, 2014
~
. ..:1' Improvement
16
.., -~• Workshop
Reasons For Missing True Root Cause
• Rush to judgment
• Approach the problem from both right brain creative and left brain
logical perspectives
~ Low
v Lowest Risk
Items Level 2 RCA Level3 RCA
Level 1 RCA
Recurrence
Raytheon Space and Tom Reinsel Ball Aerospace & Mary D’Odine
Airborne Systems Technologies Corp
SSL Jim Loman
Eric Lau
Jacqueline M. Wyrwitzke, PRINC Russell E. Averill, GENERAL MANAGER Manuel De Ponte, SR VP NATL SYS
DIRECTOR SPACE BASED SURVEILLANCE NATIONAL SYSTEMS GROUP
MISSION ASSURANCE SUBDIVISION DIVISION
SYSTEMS ENGINEERING DIVISION SPACE PROGRAM OPERATIONS
ENGINEERING & TECHNOLOGY
GROUP
REPORT TITLE