FM – a pragmatic tool to model, analyse and predict complex behaviour of industrial systems

The Authors

Rajiv Kumar Sharma, Department of Mechanical Engineering, National Institute of Technology, Hamirpur, India

Dinesh Kumar, Department of Mechanical and Industrial Engineering, Indian Institute of Technology, Roorkee, India

Pradeep Kumar, Department of Mechanical and Industrial Engineering, Indian Institute of Technology, Roorkee, India

Abstract

Purpose – This paper aims to permit the system reliability analysts/managers/practitioners/engineers to analyze the system failure behavior using fuzzy methodology (FM)

Design/methodology/approach – In order to deal with both qualitative and quantitative information related to system performance the authors have adopted failure mode effect analysis (FMEA) and Petrinets (PNs), the well-known tools for reliability analysis, to build an integrated framework aimed at helping the reliability and maintenance managers in decision-making.

Findings – Using the proposed framework an industrial case from the paper mill is examined. From the results it is observed that the limitations associated with the traditional procedure of risk ranking in FMEA are efficiently modeled using fuzzy decision-making system (FDMS) based on FM. Also, the fuzzy synthesis of system failure and repair data helps to quantify the system behavior in a more realistic manner.

Originality/value – The simultaneous adoption of the proposed techniques to model, analyze and predict the uncertain behavior of an industrial system will not only help the reliability engineers/managers/practitioners to understand the behavioral dynamics of system but also to plan/adapt suitable maintenance practices to improve system reliability, availability and maintainability (RAM) aspects.

Article Type:

Research paper

Keyword(s):

System monitoring; Reliability management; Maintenance; Failure (mechanical); Paper industry.

Journal:

Engineering Computations: International Journal for Computer-Aided Engineering and Software

Volume:

24

Number:

4

Year:

2007

pp:

319-346

Copyright ©

Emerald Group Publishing Limited

ISSN:

0264-4401

1 Introduction

If everything went well and meets desired requirements, then there would be perhaps no failures but unfortunately failure is nearly an unavoidable phenomenon in mechanical systems/components. One can observe various kinds of failures in past under various circumstances such as nuclear explosions (Chernobyl nuclear disaster, 1986), Industrial plant leakages (Union carbide plant, Bhopal, 1984, Oil pipeline at Jesse Nigeria, 1998), aero plane crashes, and electrical network shutdowns, etc. which may be due to human error, poor maintenance, inadequate testing/inspection. With advances in technology and growing complexity of technological systems the job of reliability/system analyst has become more challenging. As they have to study, characterize measure and analyze the behavior of system using various qualitative and quantitative techniques (Cai, 1996; Modarres and Kaminsky, 1999; O'Connor, 2000; Vaurio, 2005; Aksu et al., 2006).

In reliability and maintainability studies a small number of researchers have seriously addressed the issue of handling uncertainties especially related with failure data (Fonseca and Knapp, 2001). The traditional analytical techniques (mathematical and statistical models) needs large amount of data, which is difficult to obtain because of numerous constraints such as rare events of component failure, human errors and economic considerations. Even if data are available, it is often inaccurate and thus, subjected to uncertainty, i.e. historical records can only represent the past behavior but may be unable to predict the future behavior of the equipment. Further age, adverse operating conditions and the vagaries of manufacturing processes affects each part/unit of system differently (Cizelj et al., 2001; Sergaki and Kalaitzakis, 2002). Therefore, it may be difficult or even impossible to establish rational database to accommodate all operating and environmental conditions. Though, virtually all the commercially available Computerized Maintenance Management System (CMMS) software packages offer data collection facilities but lack any decision analysis support for management. Hence, as shown in Table I (P i , i=1 to 7 CMMS packages), a black hole exists in the column titled decision analysis (Shorrocks and Labib, 2000; Exton and Labib, 2002; Labib, 2003) because virtually no CMMS package offers decision support. Figure 1 shows how CMMS lacks in predictive maintenance data analysis and equipment failure analysis? (Swanson, 1997). According to Boznos (1998) most of companies are either dissatisfied or are neutral with the performance of CMMS. To counter the problem of uncertainty in decision analysis with respect to reliability and maintainability aspects both probabilistic and non-probabilistic methods are available in the literature. Based on the mature scientific theory, the probabilistic methods deals with uncertainty which is essentially random in nature but of an ordered kind. For instance, Bayesian methodology appeared in late 1970s is widely used in probabilistic risk assessment, an exercise aimed at estimating the probability and consequences of accidents for the facility/process under study (Siu and Kelly, 1998; Aven and Kvaløy, 2002). The non-probabilistic/inexact reasoning methods on the other hand study problems which are not probabilistic but cause uncertainty due to imprecision associated with the complexity of the systems as well as vagueness of human judgment. These methods are still developing and often use fuzzy sets, possibility theory and belief functions. For instance, in their work Sergaki and Kalaitzakis (2002) developed a fuzzy relational database model for manipulating the data required for criticality ranking of components in thermal powers plants. Liu et al. (2005), in their work proposed a framework for modeling, analyzing and synthesizing system safety of engineering systems on the basis of rule-based inference methodology using evidential reasoning. The framework has been applied to model system safety of an offshore and marine engineering system. From the literature studies it is observed that the field of inexact reasoning provides necessary help in coping with the imprecise and uncertain information pertaining to the system component/parts. In case of non-existence of precise data, approximate estimates of probabilities can be worked out.

Liao (2005) surveyed the development of various expert system methodologies over the last four decades. Among them fuzzy methodology (FM) has been widely applied in variety of areas such as fault diagnosis (Hauptmanns, 2004); structural reliability (Savoia, 2002); software reliability (Popstojanova and Trivedi, 2001; Wang et al., 2006); human reliability (Konstandinidou et al., 2006); safety and risk engineering (Sii et al., 2001; Guimarães and Lapa, 2007) and quality (Yang et al., 2003; Sharma et al., 2007a). In the words of Cai (1996) “Undoubtedly fuzzy methodology in system failure engineering is noticeable and growing area and is still lying in speculative research period and is premature”. In a hierarchical structure the reliability of a system is determined by the constituent sub-systems and reliability of each subsystem is, in turn, determined by the associated components and their possible failure modes. Therefore, to model, analyze and predict the system failure behavior a structured framework which makes use of information produced at lower level is required. Owing to its sound logic, effectiveness in quantifying the vagueness and imprecision in human judgment, the FM has been used as an effective tool in the study to synthesize the system information (obtained from operators experience or manufacturer's specification and expert opinions) with the help of fuzzy set principles. In the paper the author's presents an integrated framework to analyze the complex behavior of a system. The proposed framework makes use of both qualitative and qualitative techniques. In the qualitative framework first the root cause analysis (RCA) of the system has been done to list out potential failure causes using failure mode and effect analysis (FMEA). Using the selected experts, the values of failure of occurrence (O f), likelihood of non-detection of failure (O d), and severity (S) of failure of various components are ascertained and resulting risk priority number (RPN) is computed. Then, the problems associated with the traditional FMEA are handled by developing a decision support system based on FM. Further, to analyze the system behavior in quantitative terms, the failure rate and repair times associated with the system components are modeled using fuzzy synthesis of information. Various system parameters such as availability, mean time between failures (MTBF), reliability, expected number of failures (ENOF), etc. are quantified in terms of fuzzy, crisp and defuzzified values.

2 Failure analysis techniques

According to International Electro technical Vocabulary (IEV), failure is defined as “the termination of the ability of an item to perform a required function” (IEC 50 (191), 1990). Among various failure methods, which include reliability block diagrams (RBDs), Monte Carlo simulation (MCS), Markov Modeling (MM), FMEA, fault tree analysis (FTA) and Petri nets (PNs) (Ebeling, 2001; Adamyan and David, 2002, 2004). The section presents brief account of only those which are used in the study for analyzing the system behavior both in qualitative and quantitative terms.

A fault tree is used to analyze the probabilities associated with the various failure causes and their effects on system performance. FTA starts by identifying a problem (an accident or an undesirable event) and all possible ways that the problem (failure) occurs. Since, 1960 the tool has been widely used for obtaining reliability information about the complex systems. The system failure analysis using fault tree methodology makes use of either qualitative or quantitative techniques. In quantitative techniques Monte-Carlo simulation and analytical solution approach is used to determine system reliability parameters where as in qualitative technique minimal cut set and path sets are used to determine system reliability parameters. Contrary to fault trees, Petri nets can more efficiently derive the minimal cut and path sets (Liu and Chiou, 1997; Adamyan and David, 2002). Also, the absorption property of PNs helps to simplify the Petri net model and determine minimal cut set and path sets by reorganizing the transitions which is possible as long as the firing time is not taken into consideration, i.e. transfer of tokens does not take place (static condition). Similar to fault tree, PNs makes use of digraph to describe cause and effect relationship between conditions and events. Petri nets have two types of nodes named place “P” and transition “T”. These nodes are connected by arcs “A”, i.e. arcs connect transitions to places or places to transitions. The basic symbols used in Petri net model (Figure 2) are defined as under:

○: Place, drawn as a circle.

–: Transition, drawn as a bar.

↑: Arc, drawn as an arrow, between places and transitions.

•: Token, drawn as a dot, contained in places.

Petri net, a directed bipartite graph is defined by a six-tuple (Peterson, 2000): Equation 1 where: T={t 1,t 2, … ,t n }: a set of transitions, each transition representing an event or an action; P={p 1,p 2, … ,p l }: a set of places, where a place is used to represent either the condition for the event or the consequences of the event; A⊆{T×P}∪{P×T}: a set of directed arcs that connect transitions to places and places to transitions; M 0: the initial marking of the system that represents initial state of the system; I(t)={p|(p, t)∈A}: a set of input places of a transition t; and O(t)={p| (t,p)∈A}: a set of output places of a transition t.

It has two parts, i.e. static and dynamic. The static part consists of places (p), transitions (t) and arrows (A). While the dynamic part is related with marking of graph by tokens which are present, not present or evolves dynamically on firing of valid transitions. As shown in Figure 2(a), the static part, and Figure 2(b) the dynamic parts, i.e. before firing there is one token in each of input places P 1 and P 2 but no token in output place P 3. Accordingly, the Petri net marking is M=(1, 1, and 0) and after firing of transition (based on enabling rules) the token moves from each of P 1 and P 2 to the output place P 3.

From the literature studies it is observed that both Petri nets and fault tree methods are used for software reliability analysis (Kumar and Aggarwal, 1993), analysis of coherent fault trees (Hauptmanns, 2004) and fault diagnosis (Mustapha et al., 2004). Exclusively in the field of reliability engineering the application of Petri nets has been presented for reliability evaluation (Adamyan and David, 2002, 2004), Markov analysis (German, 2000; Aneziris and Papazoglou, 2004; Schoenig et al., 2006) and stochastic modeling (Ciardo et al., 1994; Sahner and Trivedi, 1996). In the paper, the authors has used the static part of PNs to model the quantitative behavior of system.

To diagnose the unreliable aspects of the system, RCA provides comprehensive classification of causes related to the various units comprising a system. The information obtained from RCA helps to establish a knowledge base for conducting FMEA Banking upon this information, FMEA is carried out by listing all the possible failure causes related to the system its components/parts. The tool has been used by reliability engineers to identify critical components/parts/functions whose failure will lead to undesirable outcomes such as production loss, injury or even an accident. It was developed at Grumman Aircraft Corporation in the 1950 and 1960s. Since, then, it has been extensively used as a powerful technique for system safety and reliability analysis of products and processes in wide range of industries – particularly aerospace, nuclear, automotive and medical (O'Connor, 2000; Ebeling, 2001; Xu et al., 2002; Bowles, 2003; Sharma et al., 2005b; Tay and Lim, 2006). The main objective of FMEA is to discover and prioritize the potential failure modes (by computing respective RPN), which pose a detrimental effect on the system and its performance. The results of the analysis help managers and engineers to identify the failure modes, their causes and correct them during the stages of design and production. The general procedure for carrying out FMEA process is shown by a flow chart (Figure 3).

The main disadvantage of RPN approach is that various sets of input terms, i.e. O f, S and O d, may produce an identical value, however, the risk implication may be totally different which may result in high-risk events may go unnoticed. For instance, consider two different events having values of O f=3, S=4, O d=5 and O f=1, S=10, O d=6, respectively. Both these events will have a total RPN value of 60 however; the risk implications of these two events may not necessarily be the same which may result in high-risk events may go unnoticed. The other disadvantage of the RPN ranking is that it neglects the relative importance among O f, S and O d. The three factors are assumed to have the same importance but in real practical applications the relative importance among the factors exists. For instance, a failure mode with a very high severity, low rate of occurrence, and moderate detectability (say 9, 3, and 5, respectively) may have a lower RPN (135) than one with all parameters moderate (say 5, 6, and 6 yielding an RPN of 180) even though it should have a higher priority for corrective action. To address these disadvantages a computer-based intelligent fuzzy decision support system is designed to prioritize the failure causes.

3 Basic notions on fuzzy approach

The section presents only those basic concepts related to fuzzy set theory, which are helpful for analyzing system behavior and developing a decision support system for FMEA (Zadeh, 1996; Zimmermann, 1996; Kokso, 1999; Ross, 2000; Tanaka, 2001).

3.1 Crisp versus fuzzy sets

Crisp (classical) sets contain objects that satisfy precise properties of membership functions (MF). Only two possibilities whether an element belongs to, or not belongs to a set exist. A crisp set “A” can be represented by a characteristic function M A/ u={0, 1}: Equation 2 where: U: universe of discourse; X: element of U, A: crisp set and M: characteristic function.

On the other hand fuzzy sets contain objects that satisfy imprecise properties of MFs, i.e. membership of an object in a fuzzy set can be partial. Contrary to classical sets, fuzzy sets accommodate various degree of membership on continuous interval [0, 1] where “0” conforms to no membership and “1” conforms to full membership. Mathematically defined by equation (2): Equation 3 where: μ (x): degree of membership of element x in fuzzy set .

Larger the μ (x), stronger is the degree of belongingness for x in . If μ (x)=1, the element x (with 100 percent certainty) completely belongs to fuzzy set A. otherwise if μ (x)=0, the element x does not belongs to A. Values between “0” and “1” mean intermediate values of certainty.

3.2 Membership functions, α cuts and linguistic variables

A MF is a curve that defines how each point in the input space is mapped to a membership value (partial truth) between 0 and 1. Various types of MFs such as triangular, trapezoidal, γ and rectangular can be used for reliability analysis. However, triangular membership functions (TMFs) are widely used for calculating and interpreting reliability data because of their simplicity and understandability (Yadav et al., 2003; Bai and Asgarpoor, 2004). For instance, imprecise or incomplete information such as low/high failure rate, i.e. about 5 or between 6 and 8 is well represented by TMF. In the study, TMF is used as it reflects the dispersion of the data adequately.

To resolve fuzzy sets in terms of constituent crisp sets, the concept of α cuts is used. They are indispensable in performing arithmetic operations with fuzzy sets. The α cut of a fuzzy set denoted by α, is the crisp set comprised of all elements x of universe of discourse for which membership of greater than or equal to α that is: Equation 4 where α is a parameter in the range 0≤x≤1 with introduction of α cuts, is defined as: Equation 5 The confidence interval defined by a cut is written as (equation (3)) (Figure 4). The confidence interval defined by a cut is written as (equation (3)): Equation 6 Moreover, when an event is imprecisely or vaguely defined, the experts would simply say that the possibility of occurrence of a given event is “low” “high” and “fairly high”. To estimate such subjective events linguistic expressions are used. The analyst can use linguistic variables to assess and compute the events using well-defined fuzzy MFs. A linguistic variable is characterized by (X, T, U, M) where:

X: the linguistic variable, for example, X is the FL of an item.

T: the set of linguistic values that X can take, for example, T={Very low, Low, Average, Frequent, Highly frequent}.

U: the actual physical domain in which the linguistic variable X takes its quantitative (crisp) values, for example, U=[F Very  low,F Highly  frequent].

M: semantic rule that relates each linguistic value in T with a fuzzy set in U, for example, M relates “very low” “low” “highly frequent” with the specific MF.

In the study, the linguistic terms such as remote, low, moderate, high and very high are used to represent the probability of failure of occurrence, non-detectability and severity related with the failure causes in FMEA.

3.3 Fuzzy rule base and inference system

The rule base describes the criticality level or riskiness of the system for each combination of input variables. Rules are formulated in linguistic terms using two approaches:

  1. expert knowledge and expertise; and
  2. fuzzy model of the process.

Oftenly expressed in “If-Then” form, where, If: an antecedent which is compared to the inputs and Then: a consequent, which is the result/output.

For instance, the format of rules is defined as: Equation 7 In practical applications the fuzziness of the antecedents eliminates the need for precise match with the inputs. All the rules that have any truth in their antecedent will fire and contributes towards the fuzzy conclusion set. Each rule is fired to a degree that is function of the degree to which its antecedent matches the input. This imprecise matching provides a basis for interpolation between possible input states and serves to minimize the number of rules needed to describe the input-output relation (Bowles and Pelaez, 1995).

By using the inference mechanism an output fuzzy set is obtained from the rules and the input variables. Figure 5, shows the schematic representation of the fuzzy reasoning mechanism with two rules. First, the numerical input variables (occurrence, severity) are fuzzified using appropriate MFs. Then, the min operator is used for the conjunction and for the implication operations. The outputs (individual fuzzy sets) are aggregated by using the max operator and finally, the aggregated output is defuzzified to obtain a crisp value.

The two most common types of inference systems frequently used are:

  1. the max-min inference; and
  2. the max-prod inference method.

Examples of t-norms are the minimum, oftenly called “Mamdani implication” and the product, called the Larsen implication. In the study Mamdani's max-min inference method is used. The min operator is used for the conjunction of the rule and for the implication function and the max operator is used for the aggregation of the fuzzy sets. The compositional rule of inference proposed by Zadeh (1975) results in equation (5): Equation 8 where, β k =minα i,k [α i,k =sup min (μ A (x i ),μ Ai,k (x)) ]

3.4 Defuzzification

In order to obtain a crisp result from fuzzy output, deffuzification is carried out. In the literature various techniques for defuzzification such as centroid, bisector, middle of the max, weighted average exists. The criterion's for their selection are:

In the study, the centroid method is used for defuzzification in both qualitative and quantitative analysis as it gives mean value of the parameters. Mathematically, represented as (equation (6)): Equation 9 where, B′ is the output fuzzy set, and μ B i is the MF.

4 An example

As an example a case from process industry (paper mill) situated in northern part of India (producing 180 tons of paper per day) is taken to discuss the failure behavior of system using the failure analysis techniques discussed in Section 2. There are many functional units in a paper mill such as feeding, pulp preparation, pulp washing, screening, bleaching and preparation of paper. The present analysis is based on the study of washing system. The schematic diagram of the system is shown in Figure 6. The system consists of four main subsystems defined as:

  1. Filter [SS1]: filter is employed to drain black liquor from the cooked pulp.
  2. Cleaners [SS2]: cleaners has three units in parallel. Here, water is mixed with pulp to cleanse by centrifugal action. Failure of any one will reduce the efficiency of the system, which reduces the quality of paper.
  3. Screen [SS3]: screener has two units in series. These are used to remove oversized, uncooked and odd shaped fibers from pulp through straining action. Failure of any one unit will cause system to fail.
  4. Decker [SS4]: decker has two units in parallel. Complete failure of decker occurs when both fail.

4.1 Qualitative framework

The unreliable aspects of the washing system (Figure 6) are determined by comprehensive classification of failure causes pertaining to the system components (deckers, filters, screeners and cleaners) using RCA as shown in Figure 7. Then, FMEA of the system is carried out by listing all the possible failure causes, their function and effect on system. In brief the methodology used to compute the scores related to failure of occurrence (O f), likelihood of non-detection of failure (O d), and severity (S) of failure of various components in FMEA is discussed as under (Sharma et al., 2005b).

4.1.1 Probability of occurrence of failure (O f)

Probability of occurrence of failure is evaluated as a function of MTBF. The data related to MTBF of components is obtained from previous historical records, maintenance log-books and is then integrated with the experience of maintenance personnel. For instance, if MTBF of component is between two and four months then probability of occurrence of failure is high (occurrence rate 0.5-1 percent) with the score ranging between seven and eight. Table II presents the linguistic assessment of probability of failure occurrence with corresponding MTBF and scores assigned.

4.1.2 Probability of non-detection of failures (O d)

The chance of detecting a failure cause or mechanism depends on various factors such as ability of operator or maintenance personnel to detect failure through naked eye or by periodical inspection or with the help of machine diagnostic aids such as automatic controls, alarms and sensors. For instance, probability of non-detection of failure of a component through naked eye is say, 0-5 percent is ranked 1 with non-detectability remote.

4.1.3 Severity of failure(S)

Severity of failure is assessed by the possible outcome of failure effect on the system performance. If the effect is not noticed and has no effect on performance than the severity is termed as remote. The slight deterioration in system performance, regards severity as moderate and significant deterioration regards severity as high. The effect is said to be very high when non-conforming products are produced or there is appreciable production loss. For instance, if MTTR of facility/component in washing unit is less, say lies between 1/4 and 1/5 h, than effect may be regarded as remote. If external intervention is required for repairs, or MTTR exceeds 1/2 days and there is appreciable deterioration in the quality of the pulp than effect may be regarded as high and if system degrades resulting in line shut down/production stoppage than the severity may be regarded as very high. Apart from the parameters listed in Table II the economics (associated with maintenance, spares and manpower) and safety aspects can also be used for determining severity effect.

Table III presents the detailed FMEA analysis for the system. The numerical values of FMEA parameters, i.e. O f, S and O d are obtained by using the discussed methodology. Then RPN number for each failure cause is evaluated by multiplying the factor scores, obtained from the selected experts.

From Table III it is observed that for deckers, WC48 and WC49 represented by different sets of linguistic terms produce an identical RPN, i.e. 336, however, the risk implication for both the causes may be totally different. For pumps, the causes WC54 and WC57 represented by same linguistic terms produce different RPN and are ranked 2nd and 1st, respectively, which could be misleading and so on. The above listed limitations of traditional FMEA are addressed by using FM. A decision support system based on fuzzy set theory is developed using MATLAB Toolbox 6.3. The basic system architecture of the proposed system as shown in Figure 8 consists of three main modules, i.e. knowledge base module and user input/output interface module. The input parameters, i.e. O f, S and O d, used in FMEA, are fuzzified using appropriate MFs to determine degree of membership in each input class. For the output variable, riskiness/priority level (Figure 9 (a) and (b)) both triangular and trapezoidal MFs are used. Multiple experts with different degree of competencies “C” are used to construct the MF. The resulting fuzzy inputs are evaluated in fuzzy inference engine, which makes use of well-defined rule base. In the study, based on the MFs of three input variables O f, S, O d with, five fuzzy sets in each, a total of 125 rules can be generated. Table IV shows the nature of rules. However, these rules are combined (wherever possible). For example, the three rules as shown in Table IV can be combined and read as: If O f is High and S is Moderate and O d is High Then Risk Priority is Moderate-High or any other combination of the three linguistic terms assigned to O f, S and O d.

Finally, to express the riskiness/criticality level of the failures so that corrective or remedial actions can be prioritized accordingly, defuzzification is done using centroid method to obtain crisp ranking from the fuzzy conclusion set. Figure 10 shows the fuzzy inference outputs for two failure causes WC33 and WC45.The comparison of the results obtained through traditional and fuzzy approach is presented in Table V, respectively.

4.1.4 Discussion

From the comparative results presented in Table V, for cleaners, it is evident that in traditional FMEA, events with same linguistic terms produce different RPN but identical ranking is produced with help of fuzzy decision-making system (FDMS). For instance, in case of cleaners causes WC22 and WC24, where O f, S and O d are described by same linguistic terms, i.e. moderate, moderate, high, respectively, the defuzzified output is 0.617. This entails that these two causes should be given the same priority for attention. The RPN method, however, produces an output of 288 and 240 for these causes and ranks them at third and fourth place, respectively. This means that WC22 has the highest priority than WC24, which could be misleading.

Similarly, in case of deckers, the causes WC41 and WC42 described with same linguistic terms, i.e moderate, moderate, moderate produce different RPN and are assigned different priorities but fuzzy method produces same output and same priorities for them. Also, causes WC48 and WC49, where O f, S and O d are described by different linguistic terms, i.e. high, moderate, high and moderate, high, high, respectively, the FMEA output/rank is same, i.e. 336/2nd but FDMS differentiates them and ranks WC48 at 3rd place and WC49 at 2nd place. For pumps, the causes WC54 and WC57 represented by same linguistic terms produce different RPN using traditional FMEA and are ranked 3rd and 1st, respectively, which could be misleading. On the other hand fuzzy approach produce same output/rank.

4.2 Quantitative analysis

In order to measure and analyze the behavior of system quantification of various system parameters (such as repair time (τ), failure rate (λ), MTBF, Av. and ENOF) is essential for managerial decision-making with respect to maintenance and manpower planning. In this framework, the Petrinet model of the system is obtained first from its equivalent fault tree model (Figure 11(a) and (b)) and then with the help of fuzzy synthesis of system information the fuzzy, crisp and defuzzified results are obtained (based on the steps shown in Figure 12) to analyze the system behavior.

In brief the steps are discussed as:

4.3 Behavior analysis

To analyze and predict the behavior of system in quantitative terms, the crisp and defuzzified values of system parameters are calculated at ±15,±25, and ±60 percent spread as the managers are always confronted with the maintenance decisions with respect to the systems and their components/parts. From the Table VIII, it is observed that defuzzified value changes with change in percentage-spread. For instance, repair time first increases by 5.28 percent when spread changes from ±15 to ±25 percent and further by 14.20 percent when spread changes from ±25 to ±60 percent. Similarly, for failure rate and ENOF, with increase in spread, increase in defuzzified values, is observed. On the other hand, at the same time for MTBF a decrease of 0.34 percent when spread changes from ±15 to ±25 percent and further by 2.836 percent when spread changes from ±25 to ±60 percent is observed. Similarly, for availability and reliability decrease in defuzzified values with increase in spread is observed. The computation of various system parameters (failure rate, repair time, ENOF, MTBF, availability and reliability) at different spreads helps the maintenance managers to understand the dynamics of system behavior. From the above discussions it is inferred that the maintenance action for the system should be based on defuzzified MTBF rather than on crisp value because with the reduced MTBF values a safe interval between maintenance actions can be established and inspections (continuous or periodic) can be conducted to monitor the condition or status of various equipments constituting the system before it reaches the crisp value. It can also be observed that with increase in repair time the availability goes on decreasing.

5 Conclusion

Owing to its sound logic, effectiveness in quantifying the vagueness and imprecision in human judgment, the FM is successfully used to evaluate and assess the system failure behavior. The in-depth analysis of system with the help of RCA helps to identify the various failure modes and their causes, the information on which FMEA for the system can be build. The use of FDMS not only addresses the seriously debated disadvantages associated with traditional procedure for conducting FMEA but also integrates expert judgment, experience and expertise in more flexible and realistic manner. For example, as evident from the comparative results in Table V that a decision support system based on fuzzy set theory combine severity, occurrence and non-detectability (the three input parameters in FMEA) in more flexible manner as compared to traditional FMEA.

Further, the quantitative analysis conducted to analyze and predict the system behavior helps the maintenance managers to understand the behavioral dynamics of the system. For instance, the triangular fuzzy graphical representations of system parameters (Figure 14) provides a general idea to the maintenance managers and helps them to predict the reliability measure(s) for the system (depending upon the value of confidence factor, α).The dispersion represented with the TMFs takes care of inherent variation in human performance, vagueness in system performance due to age and adverse operating conditions. Thus, it becomes intitutive for the engineers to arrive at decisions.

It is concluded from the study that the application of inexact reasoning methods (particularly the FM) in system failure engineering help the system/reliability analysts to:

6 Managerial implications

The important managerial implication of the unified approach presented in the study is to enable the managers/practitioners/researchers to define, measure, and analyze the imprecise information associated with reliability and maintainability aspects of the production systems. Certain other implications are summarized as under.

The FM in system failure engineering:

Though, the results still depends upon the expert's judgment (especially when building a knowledge base for fuzzy inference system in FMEA) and the quality of the information obtained from various sources, as with any modeling framework one has to exercise great care to ensure that the data and inputs presented to the method are of good quality without which the results could be biased.

ImageEquation 1
Equation 1

ImageEquation 2
Equation 2

ImageEquation 3
Equation 3

ImageEquation 4
Equation 4

ImageEquation 5
Equation 5

ImageEquation 6
Equation 6

ImageEquation 7
Equation 7

ImageEquation 8
Equation 8

ImageEquation 9
Equation 9

ImageApplications of CMMS modules
Figure 1Applications of CMMS modules

Image(a) Static (b) dynamic Petrinets
Figure 2(a) Static (b) dynamic Petrinets

ImageFailure mode effect analysis
Figure 3Failure mode effect analysis

Image cut of a fuzzy set
Figure 4 α cut of a fuzzy set

ImageFuzzy reasoning mechanism
Figure 5Fuzzy reasoning mechanism

ImageWashing system
Figure 6Washing system

ImageRoot cause analysis (washing system)
Figure 7Root cause analysis (washing system)

ImageArchitecture of fuzzy decision making system (FDMS)
Figure 8Architecture of fuzzy decision making system (FDMS)

ImageFuzzy representation of MFs (a) , , and  (b) risk priority
Figure 9Fuzzy representation of MFs (a) O s, S, and O d (b) risk priority

ImageFIS outputs for WC and WC
Figure 10FIS outputs for WC33 and WC45

ImageWashing system (a) fault tree model (b) Petrinet model
Figure 11Washing system (a) fault tree model (b) Petrinet model

ImageFramework of quantitative analysis
Figure 12Framework of quantitative analysis

ImageInput fuzzy triangular number representation (filter)
Figure 13Input fuzzy triangular number representation (filter)

ImageFuzzy representations of system parameters (a) repair time and failure rate; (b) reliability and availability; and (c) meantime between failures and expected number of failures
Figure 14Fuzzy representations of system parameters (a) repair time and failure rate; (b) reliability and availability; and (c) meantime between failures and expected number of failures

ImageCommon CMMS packages
Table ICommon CMMS packages

ImageScale used to measure (probability of failure occurrence, severity and detection)
Table IIScale used to measure (probability of failure occurrence, severity and detection)

ImageDetailed FMEA (washing system)
Table IIIDetailed FMEA (washing system)

ImageIf-Then rules
Table IVIf-Then rules

ImageComparative results (traditional and fuzzy ranking)
Table VComparative results (traditional and fuzzy ranking)

Image(a) System data and (b) fuzzy expressions
Table VI(a) System data and (b) fuzzy expressions

Image(a-c) Computed system parameters
Table VII(a-c) Computed system parameters

ImageDefuzzified values at different spreads
Table VIIIDefuzzified values at different spreads

References

Adamyan, A., David, He. (2002), "Analysis of sequential failures for assessment of reliability and safety of manufacturing systems", Reliability Engineering & System Safety, Vol. 76 pp.227-36.

[Manual request] [Infotrieve]

Adamyan, A., David, H. (2004), "System failure analysis through counters of Petrinet models", Quality & Reliability Engineering International, Vol. 20 No.4, pp.317-35.

[Manual request] [Infotrieve]

Aksu, S., Aksu, S., Osman, T. (2006), "Reliability and availability of pod propulsion systems", Quality Reliability Engineering International, Vol. 22 No.4, pp.41-58.

[Manual request] [Infotrieve]

Aneziris, O.N., Papazoglou, I.A. (2004), "Fast Markovian method for dynamic safety analysis of process plants'", Journal of Loss Prevention in the Process Industries, Vol. 17 No.1, pp.1-8.

[Manual request] [Infotrieve]

Aven, T., Kvaløy, J.T. (2002), "Implementing the Bayesian paradigm in risk analysis", Reliability Engineering & System Safety, Vol. 78 pp.195-201.

[Manual request] [Infotrieve]

Bai, X., Asgarpoor, S. (2004), "Fuzzy based approaches to substation reliability evaluation", Electric Power Systems Research, Vol. 69 pp.197-204.

[Manual request] [Infotrieve]

Bowles, J.B. (2003), "An assessment of RPN prioritization in a failure modes effects and criticality analysis", Proceedings of the Annual Reliability and Maintainability Symposium, pp.380-6.

[Manual request] [Infotrieve]

Bowles, J.B., Pelaez, C.E. (1995), "Fuzzy logic prioritization of failures in a system failure mode, effects and criticality analysis", Reliability Engineering & System Safety, Vol. 50 pp.203-13.

[Manual request] [Infotrieve]

Boznos, D. (1998), "The use of CMMSs to support team-based maintenance", Cranfield University, Cranfield, .

[Manual request] [Infotrieve]

Cai, K.Y. (1996), "System failure engineering and fuzzy methodology. An introductory overview", Fuzzy Sets and Systems, Vol. 83 pp.113-33.

[Manual request] [Infotrieve]

Ciardo, G., German, R., Lindermann, C. (1994), "A characteristics of the stochastic process underlying stochastic petrinets", IEEE Transactions Software Engineering, Vol. 20 pp.506-15.

[Manual request] [Infotrieve]

Cizelj, R.J., Mavko, B., Kljenak, I. (2001), "Component reliability assessment using quantitative and qualitative data", Reliability Engineering & System Safety, Vol. 71 No.1, pp.81-95.

[Manual request] [Infotrieve]

Ebeling, C. (2001), An Introduction to Reliability and Maintainability Engineering, Tata McGraw-Hill, New York, NY, .

[Manual request] [Infotrieve]

Exton, T., Labib, A.W. (2002), "Spare parts decision analysis the missing link in CMMSs (Part II)", Journal of Maintenance & Asset Management, Vol. 17 No.1, pp.14-21.

[Manual request] [Infotrieve]

Fonseca, D.J., Knapp, G.M. (2001), "A fuzzy scheme for failure mode screening", Fuzzy Sets and Systems, Vol. 121 No.3, pp.453-7.

[Manual request] [Infotrieve]

German, R. (2000), Performance Analysis of Communication Systems Modeling with Non-Markovian Stochastic Petrinets, Wiley, New York, NY, .

[Manual request] [Infotrieve]

Guimarães, A.C.F., Lapa, C.M.F. (2007), "Fuzzy inference to risk assessment on nuclear engineering systems", Applied Soft Computing, Vol. 7 No.1, pp.17-28.

[Manual request] [Infotrieve]

Hauptmanns, U. (2004), "Semi-quantitative fault tree analysis for process plant safety using frequency and probability ranges", Journal of Loss Prevention in the Process Industry, Vol. 17 pp.339-45.

[Manual request] [Infotrieve]

IEC 50 (191) (1990), International Electro Technical Vocabulary (IEV), International Electro Technical Commission, Geneva, .

[Manual request] [Infotrieve]

Kokso, B. (1999), Fuzzy Engineering, Prentice-Hall, Englewood Cliffs, NJ, .

[Manual request] [Infotrieve]

Konstandinidou, M., Nivolianitou, Z., Kiranoudis, C., Markatos, N. (2006), "A fuzzy modeling application of CREAM methodology for reliability", Reliability Engineering & System Safety, Vol. 91 No.6, pp.706-16.

[Manual request] [Infotrieve]

Kumar, D. (1991), "Analysis and optimization of systems availability in sugar, paper and fertilizer industries", University of Roorkee, Roorkee, .

[Manual request] [Infotrieve]

Kumar, V., Aggarwal, K.K. (1993), "Petri net modeling and reliability evaluation of distributed processing systems", Reliability Engineering & System Safety, Vol. 41 pp.167-76.

[Manual request] [Infotrieve]

Labib, A.W. (2003), "Computerized maintenance management systems (CMMSs): a black hole or a black box?", Journal of Maintenance & Asset Management, Vol. 18 No.3, pp.16-21.

[Manual request] [Infotrieve]

Liao, S.H. (2005), "Expert system methodologies and applications – a decade review from 1995 to 2004", Expert Systems with Applications, Vol. 28 No.1, pp.93-103.

[Manual request] [Infotrieve]

Liu, T.S., Chiou, S.B. (1997), "The application of Petri nets to failure analysis", Reliability Engineering & System Safety, Vol. 57 pp.129-1428.

[Manual request] [Infotrieve]

Liu, J., Yang, J.B., Wang, J., Sii, W.J. (2005), "Engineering system safety analysis using fuzzy evidential reasoning approach", Quality Reliability Engineering International, Vol. 21 pp.387-411.

[Manual request] [Infotrieve]

Modarres, M., Kaminsky, M.P. (1999), Reliability Engineering and Risk Analysis, Marcel Dekker, New York, NY, .

[Manual request] [Infotrieve]

Mustapha, F., Sapun, S.M., Ismail, N., Mokhtar, A.S. (2004), "A computer based intelligent system for fault diagnosis of an aircraft engine", Engineering Computations, Vol. 21 No.1, pp.78-90.

[Manual request] [Infotrieve]

O'Connor, P.D.T. (2000), Practical Reliability Engineering, Heyden, London, .

[Manual request] [Infotrieve]

Peterson, J.L. (2000), Petri Net Theory and the Modeling of Systems, Prentice-Hall, Englewood Cliffs, NJ, .

[Manual request] [Infotrieve]

Popstojanova, K.G., Trivedi, K.S. (2001), "Architecture-based approach to reliability assessment of software systems", Performance Evaluation, Vol. 45 No.2/3, pp.179-204.

[Manual request] [Infotrieve]

Ross, T.J. (2000), Fuzzy Logic with Engineering Applications, McGraw-Hill, New York, NY, .

[Manual request] [Infotrieve]

Sahner, R., Trivedi, K. (1996), Performance and Reliability Analysis of Computer Systems, Kluwer, Hingham, MA, .

[Manual request] [Infotrieve]

Savoia, M. (2002), "Structural reliability analysis through fuzzy number approach, with application to stability", Computers & Structures, Vol. 80 No.12, pp.1087-102.

[Manual request] [Infotrieve]

Schoenig, R., Aubry, J.F., Cambois, T., Hutinet, T. (2006), "An aggregation method of Markov graphs for reliability analysis of hybrid systems", Reliability Engineering & System Safety, Vol. 91 No.2, pp.137-48.

[Manual request] [Infotrieve]

Sergaki, A., Kalaitzakis, K. (2002), "A fuzzy knowledge based method for maintenance planning in a power system", Reliability Engineering & System Safety, Vol. 77 pp.19-30.

[Manual request] [Infotrieve]

Sharma, R.K., Kumar, D., Kumar, P. (2005b), "Systematic failure mode and effect analysis using fuzzy linguistic modeling", International Journal of Quality & Reliability Management, Vol. 22 No.9, pp.886-1004.

[Manual request] [Infotrieve]

Sharma, R.K., Kumar, D., Kumar, P. (2005c), "FP approach to model and analyze system failure behavior", Proceedings of International Conference on Reliability and Safety Engineering, INCRESE, IIT Khargpur, India, pp.57-63.

[Manual request] [Infotrieve]

Sharma, R.K., Kumar, D., Kumar, P. (2007a), "A framework to implement QCS through process cost modeling", TQM Magazine, Vol. 19 No.1, pp.18-36.

[Manual request] [Infotrieve]

Shorrocks, P., Labib, A.W. (2000), "Towards a multimedia based decision support system for word class maintenance", Proceedings of the 14th ARTS IMechE, University of Manchester, Manchester, .

[Manual request] [Infotrieve]

Sii, H.S., Ruxton, S.H., Wang, J.T. (2001), "A fuzzy logic based approach to qualitative safety modeling for marine systems", Reliability Engineering & System Safety, Vol. 73 pp.19-34.

[Manual request] [Infotrieve]

Singh, C., Dhillion, B.S. (1991), Engineering Reliability: New Techniques and Applications, Wiley, New York, NY, .

[Manual request] [Infotrieve]

Sittithumwat, A., Soudi, F., Tomsovic, K. (2004), "Optimal allocation of distribution maintenance resources with limited information", Electric Power Systems Research, Vol. 68 No.3, pp.208-20.

[Manual request] [Infotrieve]

Siu, O.N., Kelly, D.N. (1998), "Bayesian parameter estimation in probabilistic risk assessment", Reliability Engineering & System Safety, Vol. 62 pp.89-116.

[Manual request] [Infotrieve]

Swanson, L. (1997), "Computerized maintenance management systems: a study of system design and use", Production & Inventory Management Journal, Vol. 34 pp.11-14.

[Manual request] [Infotrieve]

Tanaka, K. (2001), An Introduction to Fuzzy Logic for Practical Applications, Springer, New York, NY, .

[Manual request] [Infotrieve]

Tay, K.M., Lim, C.P. (2006), "Fuzzy FMEA with a guided rules reduction system for prioritization of failures", International Journal of Quality & Reliability Management, Vol. 23 No.8, pp.1047-66.

[Manual request] [Infotrieve]

Vaurio, J.K. (2005), "Uncertainties and quantification of common cause failure rates and probabilities for system analyses", Reliability Engineering & System Safety, Vol. 90 No.2/3, pp.186-95.

[Manual request] [Infotrieve]

Wang, W-Li., Pan, D., Chen, M-H. (2006), "Architecture-based software reliability modeling", Journal of Systems and Software, Vol. 79 pp.132-46.

[Manual request] [Infotrieve]

Xu, K., Tang, L.C., Xie, M. (2002), "Fuzzy assessment of FMEA for engine system", Reliability Engineering & System Safety, Vol. 75 pp.17-29.

[Manual request] [Infotrieve]

Yadav, O., Singh, N., Chinnam, R.B., Goel, P.S. (2003), "A fuzzy logic based approach to reliability improvement estimation during product development", Reliability Engineering & System Safety, Vol. 80 pp.63-74.

[Manual request] [Infotrieve]

Yang, Y.Q., Wang, S.Q., Dulaimi, M., Low, S.P. (2003), "A fuzzy quality function deployment system for build able design decision-makings", Automation in Construction, Vol. 12 No.4, pp.381-93.

[Manual request] [Infotrieve]

Zadeh, A. (1975), "The concept of linguistic variable and its application to approximate reasoning", Information Sciences, Vol. 8 pp.301-57.

[Manual request] [Infotrieve]

Zadeh, L.A. (1996), Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems: Selected Papers, World Scientific, Singapore, .

[Manual request] [Infotrieve]

Zimmermann, H. (1996), Fuzzy Set Theory and its Applications, 3rd ed., Kluwer, London, .

[Manual request] [Infotrieve]

Further Reading

Knezevic, J., Odoom, E.R. (2001), "Reliability modeling of repairable systems using Petri nets and fuzzy lambda tau methodology", Reliability Engineering & System Safety, Vol. 73 No.1, pp.1-17.

[Manual request] [Infotrieve]

Corresponding author

Rajiv Kumar Sharma can be contacted at: rks_iitr@rediffmail.com