scispace - formally typeset
Journal ArticleDOI

DOFSON—A Distributed Fault Management System for Open System Networks

K Vijayananda, +1 more
- 01 May 1993 - 
- Vol. 39, Iss: 3, pp 157-164
TLDR
This paper presents a distributed fault management system called DOFSON for open system networks that overcomes the inadequacies of existing methodologies for detecting errors and facilitates on-line fault isolation in an operational network.
Abstract
This paper presents a distributed fault management system called DOFSON for open system networks. The proposed model overcomes the inadequacies of existing methodologies for detecting errors and facilitates on-line fault isolation in an operational network. Moreover this distributed approach to fault management results in reduced network management compared to centralized models.DOFSON is modelled as a set of fault managers and agents residing in all the open systems in the network. Each fault manager is an application process responsible for managing faults in the network. Control is distributed among all the fault managers in the network. Management agents present in all the open systems provide the services required by each fault manager for performing its tasks. Software monitors detect and report errors to local fault manager. Faults are diagnosed and isolated by performing a series of diagnostic tests to check the functionality of the suspected components. A model case study discussing the detection...

read more

References
More filters
Journal ArticleDOI

A relational approach to monitoring complex systems

TL;DR: A new approach is described in which a historical database forms the conceptual basis for the information processed by the monitor, which permits advances in specifying the low-level data collection, specifying the analysis of the collected data, performing the analysis, and displaying the results.
Journal ArticleDOI

The high-level entity management system (HEMS)

Craig Partridge, +1 more
- 01 Mar 1988 - 
TL;DR: HEMS, the high-level entity management system, is an internetwork management protocol designed to work with the TCP-IP protocol suite and provides database query language primitives that allow remote users to modify the database.