Abstract
With the proliferation of novel paradigms in distributed systems, including service-oriented computing, ubiquitous computing or self-organizing systems, an efficient distributed management system needs to work effectively even in face of incomplete management information, uncertain situations and dynamic changes. In this paper, Bayesian networks are proposed to model dependencies between managed objects in distributed systems management. Based on probabilistic backward inference mechanisms the so-called Strongest Dependency Route (SDR) algorithm is used to compute the set of most probable faults that may have caused an error or failure.
Chapter PDF
Similar content being viewed by others
References
K. Basye, T. Dean, J. S. Vitter. Coping with Uncertainty in Map Learning. Machine Learning 29(1): 65–88, 1997.
E. Charniak, R. P. Goldman. A Semantics for Probabilistic Quantifier-Free First-Order Languages, with Particular Application to Story Understanding. Proceedings of IJCAI-89, pp1074–1079, Morgan-Kaufmann, 1989.
G. Cooper. Computational complexity of probabilistic inference using Bayesian belief networks. Artificial Intelligence, 42:393–405, 1990.
C. Chao, D. Yang, A. Liu. A LAN fault diagnosis system. Computer Communications. Vol 24, pp1439–1451, 2001.
R. H. Deng, A. A. Lazar, and W. Wang. A probabilistic Approach to Fault Diagnosis in Linear Lightwave Networks. IEEE Journal on Selected Areas in Communications, Vol. 11, no. 9, pp. 1438–1448, December 1993.
M. Frontini, J. Griffin, and S. Towers. A knowledge-based system for fault localization in wide area networks. In Integrated Network Management, II. Amsterdam: North-Holland, 519–530, 1990.
J. Gao, G. Kar, P. Kermani. Approaches to Building Self Healing Systems using Dependency Analysis. IEEE/IFIP NOMS’04, April, 2004.
M. Gupta, A. Neogi, M. K. Agarwal and G. Kar. Discovering Dynamic Dependencies in Enterprise Environments for Problem Determination. DSOM 2003, LNCS 2867, ISBN 3-540-20314-1, pp221–233, 2003
C. S. Hood and C. Ji. Proactive network-fault detection. IEEE Transactions on Reliabilify, 46(3):333–341, September 1997.
D. Heckerman and M. P. Wellman. Bayesian networks. Communications of the ACM, 38(3):27–30, Mar. 1995.
C. A. Joseph, A. Sherzer, K. Muralidhar. Knowledge based fault management for OSI networks. Proceedings of 3 th International conference on Industrial and engineering applications of artificial intelligence and expert systems, pp. 61–69, June 1990.
A. Keller, U. Blumenthal, G. Kar. Classification and Computation of Dependencies for Distributed Management. Proceedings of 5 th IEEE Symposium on Computers and Communications. Antibes-Juan-les-Pins, France, July 2000.
I. Katzela and M. Schwarz. Schemes for fault identification in communication networks. IEEE Transactions on Networking, 3(6): 733–764, 1995.
F. L. Koch, and C. B. Westphall. Decentralized network management using distributed artificial intelligence. Journal of Network and Systems Management, Vol. 9, No. 4, Dec. 2001.
S. Klinger, S. Yemini, Y. Yemini, D. Ohsie, S. Stolfo. A coding approach to event correlation. Proceedings of the fourth international symposium on Integrated network management IV, p.266–277, January 1995.
C. Lo, S. H. Chen, B. Lin. Coding-based schemes for fault identification in communication networks. Int. Journal of Network Management 10(3): 157–164, 2000.
L. Lewis. A case-based reasoning approach to the resolution of faults in communication networks. In Integrated Network Management, III, 671–682. Elsevier Science Publishers B.V., Amsterdam, 1993.
S. L. Lauritzen and D. J. Spiegelhalter. Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society, Series B 50:157–224, 1988.
R. M. Neal, Probabilistic inference using Markov chain Monte Carlo methods. Tech. Rep. CRG-TR93-1, University of Toronto, Department of Computer Science, 1993.
D. Nikovski. Constructing Bayesian networks for medical diagnosis from incomplete and partially correct statistics. IEEE Transactions on Knowledge and Data Engineering, Vol. 12, No. 4, pp. 509–516, July 2000.
J. Pearl. A constraint-propagation approach to probabilistic reasoning. Uncertainty in Artificial Intelligence. North-Holland, Amsterdam, pp357–369, 1986.
The International Engineering Consortium. Highly available embedded computer platforms become reality. http://www.iec.org/online/tutorials/acrobat/ha_embed.pdf.
H. J. Suermondt and G. F. Cooper. Probabilistic inference in multiply connected belief network using loop cutsets. International Journal of Approximate Reasoning, vol. 4, 1990, pp. 283–306.
M. Steinder and A. S. Sethi. Non-deterministic diagnosis of end-to-end service failures in a multi-layer communication system. In Proc. of ICCCN, Scottsdale, AR, 2001. pp. 374–379.
A. S. Weigend, and N. A. Gershenfeld. Time Series Prediction. Addison-Wesley, 1994.
W. Wiegerinck, H.J. Kappen, E.W.M.T ter Braak, W.J.P.P ter Burg, M.J. Nijman. Approximate inference for medical diagnosis. Pattern Recognition Letters, 20:1231–1239, 1999.
C. Wang, M. Schwartz. Fault detection with multiple observers. IEEE/ACM transactions on Networking. 1993; Vol 1: pp48–55
S. A. Yemini, S. Kliger, E. Mozes, Y. Yemini, and D. Ohsie. High speed and robust event correlation. IEEE Communications Magazine, 34(5):82–90, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 International Federation for Information Processing
About this paper
Cite this paper
Ding, J., Krämer, B., Bai, Y., Chen, H. (2005). An Inference Algorithm for Probabilistic Fault Management in Distributed Systems. In: Gaïti, D., Galmés, S., Puigjaner, R. (eds) Network Control and Engineering for QoS, Security and Mobility, III. NetCon 2004. IFIP International Federation for Information Processing, vol 165. Springer, Boston, MA. https://doi.org/10.1007/0-387-23198-6_15
Download citation
DOI: https://doi.org/10.1007/0-387-23198-6_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-23197-6
Online ISBN: 978-0-387-23198-3
eBook Packages: Computer ScienceComputer Science (R0)