Experiences Analyzing Faults in a Hybrid Distributed System with Access Only to Sanitized Data


In this paper we report on a work in progress assessing the faults observed and reported in a distributed, safety-critical, largely embedded system with both electrical and mechanical components. We illustrate why standard software testing techniques are not sufficient and indicate some of the technical and non-technical problems encountered in examining the faults and the initial results obtained. While the application domain is elevator operation, the techniques described here are general enough to apply to many other domains. Much of the data analyzed here would be considered imprecise in the software industry if it were used in software testing or to help increase fault tolerance. The paper includes a discussion of the use of multiple views of data, assessment of missing data, and analysis of informal information to produce its conclusions about fault avoidance and fault tolerance.

Share and Cite:

R. Leach, "Experiences Analyzing Faults in a Hybrid Distributed System with Access Only to Sanitized Data," Journal of Software Engineering and Applications, Vol. 3 No. 5, 2010, pp. 446-454. doi: 10.4236/jsea.2010.35050.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] Unnamed elevator company, Unnamed Service Database, 2008.
[2] A. Avizienis and J. P. Kelly, “Fault Tolerance by Design Diversity: Concepts and Experiments,” IEEE Computer, Vol. 17, No. 8, August 1984, pp. 67-80.
[3] B. Randell, “System Structure for Software Fault Tole- rance,” IEEE Transactions on Software Engineering, Vol. 11, No. 2, June 1975, pp. 220-232.
[4] R. Amuthakkannan, S. M. Kannan, K. Vijayalakshmi and N. Ramaraj, “Reliability Analysis of Programmable Mechatronics System Using Bayesian Approach,” Intern- ational Journal of Industrial and Systems Engineering, Vol. 4, No. 3, 2009, pp. 303-325.
[5] V. Dhudsia, “Guidelines for Equipment Reliability,” Technical Publication, Sematech, Inc, 1997. http://www. sematech.org/docubase/document/1014agen.pdf
[6] G. K. Fourlas, “An Approach towards Fault Tolerant Hybrid Control Systems,” Control & Automation Mediter- ranean Conference on MED, Corsica, 27-29 June 2007, pp. 1-6.
[7] J. D. Musa, A. Iannino and K. Okumoto, “Software Relia- bility: Measurement, Prediction, Application,” Mc-Graw- Hill, Inc. New York, 1987.
[8] R. Isermann, “Mechatronic Systems Fundamentals,” Springer, London. 2003.
[9] K. Lee, K. C. Kang, E. Koh, W. Chae, B. Kim and B. W. Choi, “Domain-Oriented Engineering of Elevator Control Software: A Product Line Practice,” Proceedings of the First Software Product Line Conference, Denver, August 2000, pp. 3-22.
[10] “Facility System Safety Guidebook,” NASA-STD-8719.7, National Aeronautics and Space Administration, 1996.
[11] “The use of Computers in Safety Critical Operations,” Final Report of the Study Group on the Safety of Ope- rational Computer Operations, Health and Safety Commission, UK. http://www.hse.gov.uk/nuclear/compu ters.pdf
[12] N. Leveson, “Software Safety: Why, What, and How,” ACM Computing Surveys, Vol. 18, No. 2, June 1986, pp. 125-163.
[13] D. E. Knuth, “Fundamental Algorithms,” The Art of Com- puter Programming, 3rd Edition, Addison-Wesley, Read- ing, Massachusetts, Vol. 1, 1973.

Copyright © 2020 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.