Chapter 6. Event management products and best practices 219
Often during a network failure, the path from the NetView server to portions of
the network is broken. Prior to router fault isolation, NetView attempted to poll the
devices in the unreachable part of the network and generated down traps when
they did not answer. This resulted in many segment, node, and interface down
traps, particularly in networks with a large number of nodes on the far sides of
routers. When the failure was corrected, NetView generated numerous up traps
for each device it could again successfully reach.
This plethora of events had several drawbacks:
Increased the difficulty of determining the original cause of the network failure
Slowed network traffic considerably with the large number of status polls to
the occluded area
Created performance problems and unreliable status reports if the events
were forwarded to the IBM Tivoli Enterprise Console and IBM Tivoli
Enterprise Data Warehouse
The RFI function rectifies these problems. When NetView detects a node or
interface is down, RFI first checks the status and accessibility of the router
interfaces connected to the subnet on which the node or interface resides. During
the router check, each interface and its subnet are analyzed. An unresponsive
interface triggers checks of the interface and any connecting routers.
RFI generates appropriate Router Down or Router Marginal traps for conditions
detected. It also simplifies the notification action by issuing one summary alert
identifying the router nearest the fault.
When active, the Router Fault Isolation feature generates the events shown in
Table 6-4 to alert users to important status changes.
Table 6-4 Router fault isolation events
Event Network status
Router Marginal At least one router interface is down. At least one other
interface on that router is up.
Router Down All interfaces are not responding, but at least one connected
subnet is reachable. (The router is not in an occluded region.)
Router Unreachable The network management workstation cannot query the
router because it is an occluded region.
Router Up All the interfaces have responded successfully. This event is
issued on initial discovery and following a recovery from one
or more interfaces being down.