Fault Tree Analysis (FTA) is a graphical technique for Reliability and Safety Analysis of Systems. It is used:
to investigate potential faults
its mode and causes and
quantify their contribution to system unreliability in the course of product design.
The basic constructs in a fault tree diagram are gates (conditions) and events (causes leading to failure).
Fault tree diagrams are logic block diagrams that display the state of a system (top event) in terms of the states of its components (basic events).
An FTD is built top-down in term of events. It begins with the foreseeable, undesirable loss event (or a fault condition). Subsequently, it attempts to determine the specific causes (events) by constructing a logic diagram using a graphic model of the pathways within a system that can lead to the failure. Each cause is further broken down till a basic fault: human, hardware or software is reached. The pathways connect contributory events and conditions, using standard logic symbols (AND, OR, etc.).
Example of an FTD – The Root Causes of Hazard to Patients during surgery [1]
The two most commonly used gates in a fault tree are the AND and OR gates.
OR Gate represent Logical Addition.
Even if one of the Inputs to an OR gate is “1” or “TRUE”, then the Output is “1” or “TRUE”.
If all the inputs are “0” or “FALSE”, then the Output is “0” or “FALSE”
AND Gate represents Logical Multiplication.
Even if one of the Inputs to an AND gate is “0” or “FALSE”, then the Output is “0” or “FALSE”.
If all the inputs are “1” or “TRUE”, then the Output is “1” or “TRUE”
The main purpose of the fault tree analysis is to help identify potential causes of system failures before the failures actually occur.
It can also be used to evaluate the probability of the top event using analytical or statistical methods. These calculations involve system quantitative reliability and maintainability information, such as failure probability, failure rate and repair rate.
After completing an FTA, efforts can be focused on improving system safety and reliability.
Situations where FTD is most effective:
It works well to identify possible causal relationships in cases where Output has a Boolean (True/False) relation with inputs, especially in small and medium sized systems where all causes /events can be conceived.
It can be used in situations where specific data regarding known failure rates of components is known.
It is used to supplement Root Cause analysis on engineered systems, by reviewing assumptions and design decisions made during initial system design.
Situations where FTD is least useful:
It is not effective in large complex systems as it is difficult to conceive all possible scenarios leading to the top event. The construction of fault trees can become very tedious and are prone to have errors.
It does not function well as a Root Cause Analysis tool because FTD does not work well when some of the causes could be Human actions. This is because wide variance of possible human failure rates prevents FTD from providing accurate results.
FTD is not very effective when there is event dependency or load sharing i.e. the occurrence of each event (cause) affects the probability of occurrence of the other events.
[1] http://asq.org/quality-progress/2002/03/problem-solving/what-is-a-fault-tree-analysis.html