THE PROBLEM OF SYSTEM FAULT-TOLERANCE
System level self-diagnosis (SLSD) has been deeply investigated in literature. It aims at diagnosing systems composed by units, which are required to be able to test each other by exchanging information through available links. The article describes a simplified state-transition diagram model which gives a general impression of how checking, diagnosis and recovery can “conjointly” influence the system reliability and fault-tolerance. The model uses the integrated parameters and is very useful as a starting point and is a basis for further refinements.
SLSD; system reliability
Bianchini, R.; Buskens, R.: An adaptive distributed system-level diagnosis algorithm and its implementation. In the 21st International IEEE Symposium on Fault-Tolerant Computing. New York, 1991, 222-229.
Gostev, V.; Mashkov, V.; Mashkov, O.: Self-diagnosis of modular systems in random performance of elementary tests. Cybernetics and Computing Technology (Discrete Control System). No.105, 1997, 104-111.
Mashkov, V.: Identification of air-borne control computer systems technical state on the basis of cross-testing. Proc. 5th Mini Conf. on Vehicle System Dynamics. Budapest, 1996, 78-88.
Mashkov, V.; Barabash, O.: Self-checking of modular systems under random performance of elementary checks. Engineering Simulation. Vol.12, 1995, 433-445.
Palm, C.: Intersitatsschwankungen in Fernspreshverkehr. Ericsson Technics. 44, 1943.
Preparata, T.; Metze, G.; Chien, R.: On the connection assignment problem of diagnosable system. IEEE Transactions on Electronic Computers. Vol. EC-16, No.12, 1967, 848-854.