OcNOS-DC : System Management Guide : System Management Configuration Guide : Software Monitoring and Reporting
Software Monitoring and Reporting
Overview
OcNOS provides a mechanism (called “watchdogging”) to monitor all OcNOS modules and provides the following functions.
1. Periodic heart beat check.
2. Automatic restarts of a module upon a hung state or crash detection.
3. Upon hanging or crashing of a module, a crash report (including system states) is logged.
4. A proprietary SNMP trap is sent to the trap manager, if configured, after a fault is detected in a protocol module. Similarly a trap is sent when the module recovers.
By default, the software watchdog is enabled and the keep-alive time interval is 60 seconds. All OcNOS processes periodically send keep-alive messages to a monitoring module at the configured keep-alive time interval.
This functionality can be disabled for a particular module or all OcNOS modules by using CLI commands. In order to permanently disable software monitoring functionality, the user has to disable the watchdog feature. If, however, software watchdogging is disabled the monitoring module doesn’t take any action upon a hang or crash of any OcNOS module.
Configuration
 
#configure terminal
Enter configure mode.
(config)#feature software-watchdog
Enable software watchdog for all OcNOS modules — This is the default.
(config)#no software-watchdog imi
To disable software watchdog for only imi modules.
(config)#software-watchdog keep-alive-time 100
The keep-alive time interval in seconds. Default is 60 seconds and applies to all OcNOS modules.
#show software-watchdog status
Display the keep-alive time interval and list of OcNOS process names with watchdog status for each OcNOS modules.
Validation
#show software-watchdog status
Software Watchdog timeout in seconds : 100
Process name Watchdog status
============ ===============
nsm Enabled
ripd Enabled
ospfd Enabled
isisd Enabled
hostpd Enabled
ldpd Enabled
rsvpd Enabled
mribd Enabled
pimd Enabled
authd Enabled
mstpd Enabled
imi Disabled
onmd Enabled
HSL Enabled
oam Enabled
vlogd Enabled
vrrpd Enabled
ndd Enabled
ribd Enabled
bgpd Enabled
l2mribd Enabled
lagd Enabled
sflow Enabled