FMEA
FMEA
FMEA
Introduction
Product development and operations managers can run a failure modes and effects analysis (FMEA) to analyze potential failure risks
within systems, classifying them according to severity and likelihood, based on past experience with similar products or processes. The object of
FMEA is to help design identified failures out of the system with the least cost in terms of time and money. FMEA defines the term “failure
mode” to identify defects or errors, potential or actual, in a product design or process, with emphasis on those affecting the customer or end
user. A “failure effect” is the result of a failure mode on the product or system function as perceived by the user. Failure effects can be described
in terms of what the end user may see or experience. The study of consequences of identified failures is called effects analysis. FMEA prioritizes
failures according to severity, frequency and detectability. Severity describes the seriousness of failure consequences. Frequency describes how
often failures can occur. Detectability refers to degree of difficulty in detecting failures. FMEA also involves documenting current knowledge
about failure risks. FMEA seeks to mitigate risk at all levels with resulting prioritized actions that prevent failures or at least reduce their severity
and/or probability of occurrence. It also defines and aids in selecting remedial activities that mitigate the impact and consequences of failures.
FMEA can be employed from the earliest design and conceptual stages onward through development and testing processes, into process control
during ongoing operations throughout the life of the product or system
The first FMEA step is to analyze functional requirements and their effects to identify all failure modes. Examples: warping, electrical short
circuit, oxidation, fracture. Failure modes in one component can induce them in others. List all failure modes per function in technical terms,
considering the ultimate effect(s) of each failure mode and noting the failure effect(s). Examples of failure effects include: overheating, noise,
abnormal shutdown, user injury.
Severity
Severity is the seriousness of failure consequences of failure effects. Usual practice rates failure effect severity (S) on a scale of one to 10 where
one is lowest severity and 10 is highest. The following table shows typical FMEA severity ratings and their meanings:
Rating Meaning
1 No effect, no danger
2 Very minor – usually noticed only by discriminating or very observant users
3 Minor – only minor part of the system affected; noticed by average users
4-6 Moderate – most users are inconvenienced and/or annoyed
7-8 High – loss of primary function; users are dissatisfied
9-10 Very high – hazardous. Product becomes inoperative, customers angered.
Failure constitutes a safety hazard and can cause injury or death.
Examine cause(s) of each failure mode and how often failure occurs. Look at similar processes or products and their documented failure modes.
All potential failure causes should be identified and documented in technical terms. Failure causes are often indicative of weaknesses in the
design.
Examples of causes include: incorrect algorithm, insufficient or excess voltage, operating environment too hot, cold, humid, etc. Failure modes
are assigned an occurrence ranking (O), again from one to 10, as shown in the following table.
Rating Meaning
1 No documented failures on similar products/processes
2-3 Low – relatively few failures
4-6 Moderate – some occasional failures
7-8 High – repeated failures
9-10 Very high – failure is almost certain
After remedial actions are determined, they should be tested for efficacy and efficiency. Also, the design should be verified and inspections
procedures specified.
1. Engineers inspect current system controls that prevent failure mode occurrence, or detect failures before they impact the user/customer.
These steps enable engineers to determine the likelihood of identifying or detecting failures. Then, each combination from steps one and two is
assigned a detection value (D), which indicates how likely it is that failures will be detected, and ranks the ability of identified actions to remedy
or remove defects or detect failures. The higher the value of D, the more likely the failure will not be detected.
Rating Meaning
1 Fault is certain to be caught by testing
2 Fault almost certain to be caught by testing
3 High probability that tests will catch fault
4-6 Moderate probability that tests will catch fault
7-8 Low probability that tests will catch fault
9-10 Fault will be passed undetected to user/customer
RPN should be calculated for the entire design and/or process and documented in the FMEA. Results should reveal the most problematic areas,
and the highest RPNs should get highest priority for corrective measures. These measures can include a variety of actions: new inspections, tests
or procedures, design changes, different components, added redundancy, modified limits, etc. Goals of corrective measures include, in order of
desirability:
When corrective measures are implemented, RPN is calculated again and the results documented in the FMEA.
10 Steps to do a Design FMES
STEP 1: Review the design
Use a blueprint or schematic of the design/product to identify each component and interface.
List each component in the FMEA table.
If it feels like the scope is too large, it probably is. This is a good time to break the Design Failure Mode and Effects Analysis into
more manageable chunks.
How potential
Identifying and assessing potential risks with process
failures of intended Identifying and assessing potential risks
operating parameters and meeting product
functions are of the design requirements.
specifications.
evaluated
Criteria DFMEAs PFMEAs
The evaluation An evaluation of the ability of design An evaluation of the ability of process controls
criteria for controls (related to the product or (mistake-proofing, fail-safes, gages) to prevent a
Detection ratings process) to prevent or detect failure mode (or cause) from occurring or detect the
usually focuses on: mechanisms of failure. effect of a failure if a failure has occurred.
Developed in the 1950s, FMEA was one of the earliest structured reliability improvement methods. Today it is still a highly
effective method of lowering the possibility of failure.
FMEA is not a substitute for good engineering. Rather, it enhances good engineering by applying the knowledge and
experience of a Cross Functional Team (CFT) to review the design progress of a product or process by assessing its risk of failure.
Design FMEA
Design FMEA (DFMEA) explores the possibility of product malfunctions, reduced product life, and safety and regulatory concerns
derived from:
Material Properties
Geometry
Tolerances
Process FMEA
Process FMEA (PFMEA) discovers failure that impacts product quality, reduced reliability of the process, customer dissatisfaction,
and safety or environmental hazards derived from:
Human Factors
Methods followed while processing
Materials used
Machines utilized
Measurement systems impact on acceptance
Environment Factors on process performance
Design FMEA Worksheet
Process FMEA Worksheet
Why Perform Failure Mode and Effects Analysis (FMEA)
Historically, the sooner a failure is discovered, the less it will cost. If a failure is discovered late in product development or launch, the
impact is exponentially more devastating.
FMEA is one of many tools used to discover failure at its earliest possible point in product or process design. Discovering a failure
early in Product Development (PD) using FMEA provides the benefits of:
Ultimately, this methodology is effective at identifying and correcting process failures early on so that you can avoid the nasty
consequences of poor performance.
Late Failure Mode Discovery
FMEA is performed in seven steps, with key activities at each step. The steps are separated to assure that only the appropriate
team members for each step are required to be present. The FMEA approach used by Quality-One has been developed to avoid
typical pitfalls which make the analysis slow and ineffective. The Quality-One Three Path Model allows for prioritization of activity
and efficient use of team time.
3. Path 2 Development (Potential Causes and Prevention Controls through Occurrence Ranking)
Pre-work involves the collection and creation of key documents. FMEA works smoothly through the development phases when an
investigation of past failures and preparatory documents is performed from its onset. Preparatory documents may include:
A pre-work Checklist is recommended for an efficient FMEA event. Checklist items may include:
Requirements to be included
Design and / or Process Assumptions
Preliminary Bill of Material / Components
Known causes from surrogate products
Potential causes from interfaces
Potential causes from design choices
Potential causes from noises and environments
Family or Baseline FMEA (Historical FMEA)
Past Test and Control Methods used on similar products
2. Path 1 Development- (Requirements through Severity Ranking)
Path 1 consists of inserting the functions, failure modes, effects of failure and Severity rankings. The pre-work documents assist in
this task by taking information previously captured to populate the first few columns (depending on the worksheet selected) of the
FMEA.
Functions should be written in verb-noun context. Each function must have an associated measurable. Functions may include:
o Wants, needs and desires translated
o Specifications of a design
o Government regulations
o Program-specific requirements
o Characteristics of product to be analyzed
o Desired process outputs
Failure Modes are written as anti-functions or anti-requirements in five potential ways:
o Full function failure
o Partial / degraded function failure
o Intermittent function failure
o Over function failure
o Unintended function failure
Effects are the results of failure, where each individual effect is given a Severity ranking. Actions are considered at this stage if the
Severity is 9 or 10
o Recommended Actions may be considered that impact the product or process design addressing Failure Modes on High Severity
Rankings (Safety and Regulatory)
3. Path 2 Development – (Potential Causes and Prevention Controls through Occurrence Ranking)
Causes are selected from the design / process inputs or past failures and placed in the Cause column when applicable to a specific
failure mode. The columns completed in Path 2 are:
Potential Causes / Mechanisms of Failure
Current Prevention Controls (i.e. standard work, previously successful designs, etc.)
Occurrence Rankings for each cause
Classification of Special Characteristics, if indicated
Actions are developed to address high risk Severity and Occurrence combinations, defined in the Quality-One Criticality Matrix
4. Path 3 Development- (Testing and Detection Controls through Detection Ranking)
Path 3 Development involves the addition of Detection Controls that verify that the design meets requirements (for Design FMEA) or
cause and/or failure mode, if undetected, may reach a customer (for Process FMEA).
The columns completed in Path 3 are:
o Detection Controls
o Detection Ranking
Actions are determined to improve the controls if they are insufficient to the Risks determined in Paths 1 and 2. Recommended Actions
should address weakness in the testing and/or control strategy.
Review and updates of the Design Verification Plan and Report (DVP&R) or Control Plans are also possible outcomes of Path 3.
5. Action Priority & Assignment
The Actions that were previously determined in Paths 1, 2 or 3 are assigned a Risk Priority Number (RPN) for action follow-up.
RPN is calculated by multiplying the Severity, Occurrence and Detection Rankings for each potential failure / effect, cause and
control combination. Actions should not be determined based on an RPN threshold value. This is done commonly and is a practice
that leads to poor team behavior. The columns completed are:
Review Recommended Actions and assign RPN for additional follow-up
Assign Actions to appropriate personnel
Assign action due dates
6. Actions Taken / Design Review
FMEA Actions are closed when counter measures have been taken and are successful at reducing risk. The purpose of an FMEA is to
discover and mitigate risk. FMEAs which do not find risk are considered to be weak and non-value added. Effort of the team did not
produce improvement and therefore time was wasted in the analysis.
7. Re-Ranking RPN and Closure
After successful confirmation of Risk Mitigation Actions, the Core Team or Team Leader will re-rank the appropriate ranking value
(Severity, Occurrence or Detection). The new rankings will be multiplied to attain the new RPN. The original RPN is compared to the
revised RPN and the relative improvement to the design or process has been confirmed. Columns completed in Step 7:
Re-ranked Severity
Re-ranked Occurrence
Re-ranked Detection
Re-ranked RPN
Generate new Actions, repeating Step 5, until risk has been mitigated
Comparison of initial RPN and revised RPN
FMEA Document Analysis
Deciding when to take an action on the FMEA has historically been determined by RPN thresholds. Quality-One does not
recommend the use of RPN thresholds for setting action targets. Such targets are believed to negatively change team behavior
because teams select the lowest numbers to get below the threshold and not actual risk, requiring mitigation.
The analysis of an FMEA should include multiple level considerations, including:
Severity of 9 / 10 or Safety and Regulatory alone (Failure Mode Actions)
Criticality combinations for Severity and Occurrence (Cause Actions)
Detection Controls (Test and Control Plan Actions)
RPN Pareto
When completed, Actions move the risk from its current position in the Quality-One FMEA Criticality Matrix to a lower risk
position.
RPN Action Priority
When risk is determined to be unacceptable, Quality-One recommends a priority of action to be applied as follows:
1. Error Proofing (Eliminate Failure Mode or Address Cause)
o Failure Mode (Only Severity of 9 or 10)
o Causes with High Occurrence
2. Improve Potential Process Capability
o Increase Tolerance (Tolerance Design)
o Reduce Variation of the Process (Statistical Process Control and Process Capability)
3. Improve Controls
o Mistake Proofing of the tooling or process
o Improve the inspection / evaluation techniques