共查询到20条相似文献,搜索用时 31 毫秒
1.
Failure Detection and Diagnosis (FDD) using Discrete Event System (DES) framework is used for wide range of applications because of simplicity of both the model and associated algorithms. Initial research was focussed on permanent failures. Many systems exhibit temporary failures in the sense that system recovers to normal condition after failure. Contant et al. extended the event based DES framework, developed for permanent failure by Sampath et al. to handle temporary failures. Zad et al. developed a state based DES framework for permanent failures which has several advantages compared to the event based approach. In this paper, we extend the state based DES framework, developed for permanent failures, to handle temporary failures, maintaining the same order of complexity. The proposed DES framework has several advantages compared to that by Contant et al. namely, capability to detect failures that occur before starting execution of the diagnoser, same diagnoser for both temporary and permanent failures, etc. 相似文献
2.
E. García Moreno A. Correcher Salvador F. Morant Anglada E. Quiles Cucarella R. Blasco Giménez 《Discrete Event Dynamic Systems》2006,16(3):311-326
This paper studies modular decomposition as an approach for failure diagnosis based on Discrete Event Systems. This paper
also analyses the problem of coupling produced in the implementation of centralized modular diagnosers, as coupled diagnosers
cannot carry out their own diagnosis task, when there is a failure in another subsystem sharing a common energy or material
flow. In addition, we propose a method to avoid diagnoser coupling, by means of decoupling functions using non-local information
with respect to the coupled diagnoser and generated in the diagnoser where the failure has been isolated. 相似文献
3.
Hashtrudi Zad S. Kwong R.H. Wonham W.M. 《Automatic Control, IEEE Transactions on》2003,48(7):1199-1212
A state-based approach for online passive fault diagnosis in systems modeled as finite-state automata is presented. In this framework, the system and the diagnoser (the fault detection system) do not have to be initialized at the same time. Furthermore, no information about the state or even the condition (failure status) of the system before the initiation of diagnosis is required. The design of the fault detection system, in the worst case, has exponential complexity. A model reduction scheme with polynomial time complexity is introduced to reduce the computational complexity of the design. Diagnosability of failures is studied, and necessary and sufficient conditions for failure diagnosability are derived. 相似文献
4.
Diagnosability of discrete-event systems 总被引:8,自引:0,他引:8
Sampath M. Sengupta R. Lafortune S. Sinnamohideen K. Teneketzis D. 《Automatic Control, IEEE Transactions on》1995,40(9):1555-1575
Fault detection and isolation is a crucial and challenging task in the automatic control of large complex systems. We propose a discrete-event system (DES) approach to the problem of failure diagnosis. We introduce two related notions of diagnosability of DES's in the framework of formal languages and compare diagnosability with the related notions of observability and invertibility. We present a systematic procedure for detection and isolation of failure events using diagnosers and provide necessary and sufficient conditions for a language to be diagnosable. The diagnoser performs diagnostics using online observations of the system behavior; it is also used to state and verify off-line the necessary and sufficient conditions for diagnosability. These conditions are stated on the diagnoser or variations thereof. The approach to failure diagnosis presented in this paper is applicable to systems that fall naturally in the class of DES's; moreover, for the purpose of diagnosis, most continuous variable dynamic systems can be viewed as DES's at a higher level of abstraction 相似文献
5.
An awareness of failure type and location is an indispensable requirement for the establishment of adequate recovery strategies and the maintenance of Factory Automation and Process Control systems.The failure diagnosis methodology presented in this paper is based on Discrete Event Systems models and in the diagnoser concept, which enable the off-line and on-line analysis of systems failures. We present an approach for models and associated diagnosers based on a modular decomposition of the global system, with the aim of avoiding problems of exponential explosion in the number of states and computational complexity of the diagnosis process. 相似文献
6.
7.
Shengbing Jiang Zhongdong Huang Chandra V. Kumar R. 《Automatic Control, IEEE Transactions on》2001,46(8):1318-1321
Failure diagnosis in large and complex systems is a critical task. In the realm of discrete-event systems, Sampath et al. (1995) proposed a language based failure diagnosis approach. They introduced the diagnosability for discrete-event systems and gave a method for testing the diagnosability by first constructing a diagnoser for the system. The complexity of this method of testing diagnosability is exponential in the number of states of the system and doubly exponential in the number of failure types. We give an algorithm for testing diagnosability that does not construct a diagnoser for the system, and its complexity is of fourth order in the number of states of the system and linear in the number of the failure types 相似文献
8.
Active diagnosis of discrete-event systems 总被引:3,自引:0,他引:3
The need for accurate and timely diagnosis of system failures and the advantages of automated diagnostic systems are well appreciated. However, diagnosability considerations are often not explicitly taken into account in the system design. In particular, design of the controller and that of the diagnostic subsystem are decoupled, and this may significantly affect the diagnosability properties of a system. The authors present an integrated approach to control and diagnosis. More specifically, they present an approach for the design of diagnosable systems by appropriate design of the system controller. This problem, which they refer to as the active diagnosis problem, is studied in the framework of discrete-event systems (DESs); it is based on prior and new results on the theory of diagnosis for DESs and on existing results in supervisory control under partial observations. They formulate the active diagnosis problem as a supervisory control problem where the legal language is an “appropriate” regular sublanguage of the regular language generated by the system. They present an iterative procedure for determining the supremal controllable, observable, and diagnosable sublanguage of the legal language and for obtaining the supervisor that synthesizes this language. This procedure provides both a controller that ensures diagnosability of the closed-loop system and a diagnoser for online failure diagnosis. The procedure can be implemented using finite-state machines and is guaranteed to converge in a finite number of iterations. The authors illustrate their approach using a simple pump-valve system 相似文献
9.
Shigemasa Takai 《Automatica》2012,48(8):1913-1919
In this paper, we study robust failure diagnosis of discrete event systems. Given a set of possible models, each of which has its own nonfailure specification, we consider the existence of a single diagnoser such that, for all possible models, it detects any occurrence of a failure within a uniformly bounded number of steps. We call such a diagnoser a robust diagnoser. We introduce a notion of robust diagnosability, and prove that it serves as a necessary and sufficient condition for the existence of a robust diagnoser. We then present an algorithm for verifying the robust diagnosability condition. 相似文献
10.
11.
Identification and prioritization of failure modes in a system and planning for corrective actions are among the most important components of risk management in any organization. Meanwhile, conventional Failure Mode and Effects Analysis (FMEA) is one of the most commonly used methods for prioritization of the failures. Despite the widespread applications of this method in various industries, FMEA is associated with some shortcomings that can lead to unrealistic results. In this study, a proposed approach is presented in three phases to cover some of the shortcomings of the FMEA technique. In the first phase, FMEA is used to identify the failure modes and assign values to the Risk Priority Number (RPN) determinant factors. In the second phase, the Fuzzy Best-Worst Method (FBWM) based on the experts’ opinions is used to measure the weights of these factors. In the third phase, the outputs of the previous phases are used as a basis to prioritize the failures using the proposed Multi-Objective Optimization by Ratio Analysis based on the Z-number theory (Z-MOORA). In addition to assigning different weights to the RPN determinant factors and considering uncertainties of them, the Z-number theory is used in this approach to cover reliability in different failure modes. The proposed approach was implemented in the automotive spare parts industry, and the results indicate a full prioritization of the failures in comparison with other conventional methods such as FMEA and fuzzy MOORA. 相似文献
12.
In this paper, we consider distributed systems that can be modeled as finite state machines with known behavior under fault-free conditions, and we study the detection of a general class of faults that manifest themselves as permanent changes in the next-state transition functionality of the system. This scenario could arise in a variety of situations encountered in communication networks, including faults occurred due to design or implementation errors during the execution of communication protocols. In our approach, fault diagnosis is performed by an external observer/diagnoser that functions as a finite state machine and which has access to the input sequence applied to the system but has only limited access to the system state or output. In particular, we assume that the observer/diagnoser is only able to obtain partial information regarding the state of the given system at intermittent time intervals that are determined by certain synchronizing conditions between the system and the observer/diagnoser. By adopting a probabilistic framework, we analyze ways to optimally choose these synchronizing conditions and develop adaptive strategies that achieve a low probability of aliasing, i.e., a low probability that the external observer/diagnoser incorrectly declares the system as fault-free. An application of these ideas in the context of protocol testing/classification is provided as an example. 相似文献
13.
研究含有测量时滞的线性离散系统的故障诊断问题,提出一种测量时滞的无时滞转换方法和基于降维状态观测器而不利用残差体现故障的故障诊断方法.首先通过构造一个含有故障状态的增广系统和进行测量时滞的无时滞转换,将时滞系统的故障诊断问题转化为无时滞增广系统的状态观测问题;然后给出了其诊断误差能按预先指定的指数速率趋于零的故障诊断器的设计方法.仿真算例验证了该方法的可行性和有效性. 相似文献
14.
In this paper, we study the fault diagnosis problem for distributed discrete event systems. The model assumes that the system
is composed of distributed components which are modeled in labeled Petri nets and interact with each other via sets of common
resources (places). Further, a component’s own access to a common resource is an observable event. Based on the diagnoser
approach proposed by Sampath et al., a distributed fault diagnosis algorithm with communication is presented. The distributed
algorithm assumes that the local diagnosis process can exchange messages upon the occurrence of observable events. We prove
the distributed diagnosis algorithm is correct in the sense that it recovers the same diagnostic information as the centralized
diagnosis algorithm. Furthermore, we introduce the ordered binary decision diagrams (OBDD) in order to manage the state explosion
problem in state estimation of the system. 相似文献
15.
The problems of fault diagnosis and fault‐tolerant control are considered for systems with measurement delays. In contrast to the present fault diagnosis and fault‐tolerant control approaches, which consider only the input delay and/or state delay, the main contribution of this paper consists of proposing a new observer‐based reduced‐order fault diagnoser construction approach and a design approach to dynamic self‐restore fault‐tolerant control law for systems with measurement delays. First, the time‐delay system is transformed into a delay‐free system in form by a special functional‐based delay‐free transformation approach for measurement delays. Then, the fault diagnosis is realized online via the proposed reduced‐order fault diagnoser. Using the results of fault diagnosis, two dynamic self‐restore control laws are designed to make the system isolated from faults. A numerical example demonstrates the feasibility and validity of the proposed scheme. © 2012 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society 相似文献
16.
Thomas J. Hacker Fabian Romero Christopher D. Carothers 《Journal of Parallel and Distributed Computing》2009
Large supercomputers are built today using thousands of commodity components, and suffer from poor reliability due to frequent component failures. The characteristics of failure observed on large-scale systems differ from smaller scale systems studied in the past. One striking difference is that system events are clustered temporally and spatially, which complicates failure analysis and application design. Developing a clear understanding of failures for large-scale systems is a critical step in building more reliable systems and applications that can better tolerate and recover from failures. In this paper, we analyze the event logs of two large IBM Blue Gene systems, statistically characterize system failures, present a model for predicting the probability of node failure, and assess the effects of differing rates of failure on job failures for large-scale systems. The work presented in this paper will be useful for developers and designers seeking to deploy efficient and reliable petascale systems. 相似文献
17.
Rajiv Kumar Sharma Dinesh Kumar Pradeep Kumar 《International journal of systems science》2013,44(6):563-581
The main objective of the article is to permit the reliability analyst's/engineers/managers/practitioners to analyze the failure behavior of a system in a more consistent and logical manner. To this effect, the authors propose a methodological and structured framework, which makes use of both qualitative and quantitative techniques for risk and reliability analysis of the system. The framework has been applied to model and analyze a complex industrial system from a paper mill. In the quantitative framework, after developing the Petrinet model of the system, the fuzzy synthesis of failure and repair data (using fuzzy arithmetic operations) has been done. Various system parameters of managerial importance such as repair time, failure rate, mean time between failures, availability, and expected number of failures are computed to quantify the behavior in terms of fuzzy, crisp and defuzzified values. Further, to improve upon the reliability and maintainability characteristics of the system, in depth qualitative analysis of systems is carried out using failure mode and effect analysis (FMEA) by listing out all possible failure modes, their causes and effect on system performance. To address the limitations of traditional FMEA method based on risky priority number score, a risk ranking approach based on fuzzy and Grey relational analysis is proposed to prioritize failure causes. 相似文献
18.
Bahman Javadi Derrick Kondo Alexandru Iosup Dick Epema 《Journal of Parallel and Distributed Computing》2013
With the increasing presence, scale, and complexity of distributed systems, resource failures are becoming an important and practical topic of computer science research. While numerous failure models and failure-aware algorithms exist, their comparison has been hampered by the lack of public failure data sets and data processing tools. To facilitate the design, validation, and comparison of fault-tolerant models and algorithms, we have created the Failure Trace Archive (FTA)—an online, public repository of failure traces collected from diverse parallel and distributed systems. In this work, we first describe the design of the archive, in particular of the standard FTA data format, and the design of a toolbox that facilitates automated analysis of trace data sets. We also discuss the use of the FTA for various current and future purposes. Second, after applying the toolbox to nine failure traces collected from distributed systems used in various application domains (e.g., HPC, Internet operation, and various online applications), we present a comparative analysis of failures in various distributed systems. Our analysis presents various statistical insights and typical statistical modeling results for the availability of individual resources in various distributed systems. The analysis results underline the need for public availability of trace data from different distributed systems. Last, we show how different interpretations of the meaning of failure data can result in different conclusions for failure modeling and job scheduling in distributed systems. Our results for different interpretations show evidence that there may be a need for further revisiting existing failure-aware algorithms, when applied for general rather than for domain-specific distributed systems. 相似文献
19.
20.
Complex engineering systems have to be carefully monitored to meet demanding performance requirements, including detecting anomalies in their operations. There are two major monitoring challenges for these systems. The first challenge is that information collected from the monitored system is often partial and/or unreliable, in the sense that some occurred events may not be reported and/or may be reported incorrectly (e.g., reported as another event). The second is that anomalies often consist of sequences of event patterns separated in space and time. This paper introduces and analyzes a diagnoser algorithm that meets these challenges for detecting and counting occurrences of anomalies in engineering systems. The proposed diagnoser algorithm assumes that models are available for characterizing plant operations (via stochastic automata) and sensors (via probabilistic mappings) used for reporting partial and unreliable information. Methods for analyzing the effects of model uncertainties on the diagnoser performance are also discussed. In order to select configurations that reduce sensor costs, while satisfying diagnoser performance requirements, a sensor configuration selection algorithm developed in previous work is then extended for the proposed diagnoser algorithm. The proposed algorithms and methods are then applied to a multi-unit-operation system, which is derived from an actual facility application. Results show that the proposed diagnoser algorithm is able to detect and count occurrences of anomalies accurately and that its performance is robust to model uncertainties. Furthermore, the sensor configuration selection algorithm is able to suggest optimal sensor configurations with significantly reduced costs, while still yielding acceptable performance for counting the occurrences of anomalies. 相似文献