Alarm Manager
  • 21 Aug 2024
  • 5 Minutes to read
  • Contributors
  • PDF

Alarm Manager

  • PDF

Article summary

Overview

Alarm Manager is the fault management system detecting Network Elements faults. The alarm notifications are sent to a fault management system.

Using the Alarm Manager feature it is possible to:

  • Receive events (see table below) indicating changes in state

  • Forward events related to the creation, destruction and state change of Network Interfaces to the Alarm Manager

  • Forward events related to the creation, destruction and state change of MEP objects to the Alarm Manager

  • Forward events related to the creation, destruction and state change of Platform Monitor Alarms

  • Set alarm notifications for SNMP traps, LEDs, Syslog

  • Import or export Alarm Manager configurations

  • Retrieve alarm summary and alarm inventory reports

  • Retain a limited history log of alarm occurrences

  • Manage alarms, their reporting state, and their severity by either CLI, NETCONF, or SNMP

Terms and Definitions

This table lists terms and definitions related to the Alarm Manager:

Term

Definition

Alarm

An alarm signifies an undesirable state in a resource that requires corrective action.

Alarm Instance

The alarm state for a specific resource and alarm type; for example, ("GigabitEthernet0/15", "link-alarm"). An entry in the alarm list.

Alarm Inventory

A list of all possible alarm types on a system.

Alarm Type

An alarm type identifies a possible unique alarm state for a resource. Alarm types are names to identify the state such as "link-alarm", "jitter violation", and "high-disk-utilization".

Event

Signals a change of state of an entity.

Fault

A persistent abnormal condition affecting a device or a group of devices in a network.

A fault is the underlying cause of an undesired behavior. There is no trivial one-to-one mapping between faults and alarms. One fault may result in several alarms in case the system lacks root-cause and correlation capabilities.

An alarm might not have an underlying fault as a cause.

Fault Management

A set of functions, which enables the detection, isolation and correction of abnormal operation of the telecommunication network and its environment.

Resource

A fine-grained identification of the alarming resource such as, for example, an interface and a process. Typically, a resource is a managed object within the system.

Resource Address

Logical or physical address, within the system, of the Resource.

Standing Condition

See Alarm Type. Standing Condition is used as a synonym of Alarm Type in this text.

Transient Condition

A detected fault condition whose duration is not long enough to be considered a Standing Condition.

Alarm Types

The following tables provide a description of all reported standing conditions (alarm types) and condition changes.

Platform Monitor Alarms

Condition

Description

Filtering Expected

Reporting*

Severity*

Service Affecting*

1

Equipment not present

The hardware resource is not present or unreachable

Yes

Disabled

Major

No

2

Stall alarm

The FAN speed is below the manufacturer's recommended number of RPM

Yes

Disabled

Major

No

3

Aging-alarm

The FAN speed is below the manufacturer's thresholds

Yes

Disabled

Minor

No

4

High Voltage Alarm

The input or output voltage exceeds the manufacturer's established threshold

Yes

Disabled

Minor

No

5

Low Voltage Alarm

The input or output voltage is below the manufacturer's established threshold

Yes

Disabled

Minor

No

6

Temperature Fault Alarm

Excessive temperature conditions caused by loss of fan cooling or excessive ambient temperature

Yes

Disabled

Major

No

7

Power Fault Alarm

Power supply failure

Yes

Disabled

Major

No

8

High Temperature Alarm

Temperature exceeding the high temperature threshold

Yes

Disabled

Critical

Note: For TRX, Critical means Major.

No

9

Low Temperature Alarm

Temperature below the low temperature threshold

Yes

Disabled

Major

Note: For TRX, Major means Minor.

No

10

High Temperature Warning

Temperature exceeding the high warning temperature threshold

Yes

Disabled

Minor

No

11

Low Temperature Warning

Temperature below the low warning temperature threshold

Yes

Disabled

Minor

No

12

High Voltage Warning

The input or output voltage exceeds the manufacturer's established warning threshold

Yes

Disabled

Minor

No

13

Low Voltage Warning

The input or output voltage is below the manufacturer's established warning threshold

Yes

Disabled

Minor

No

14

High Tx Bias Current Alarm

The Tx Bias Current exceeds the manufacturer's defined threshold

Yes

Disabled

Minor

No

15

Low Tx Bias Current Alarm

The Tx Bias Current is below the manufacturer's defined threshold

Yes

Disabled

Minor

No

16

High Tx Bias Current Warning

The Tx Bias Current exceeds the manufacturer's warning threshold

Yes

Disabled

Minor

No

17

Low Tx Bias Current Warning

The Tx Bias Current is below the manufacturer's warning threshold

Yes

Disabled

Minor

No

18

High TX Power Alarm

The Optical Tx Power exceeds the manufacturer's defined threshold

Yes

Disabled

Minor

No

19

Low TX Power Alarm

The Optical Tx Power is below the manufacturer's defined threshold

Yes

Disabled

Minor

No

20

High TX Power Warning

The Optical Tx Power exceeds the manufacturer's warning threshold

Yes

Disabled

Minor

No

21

Low TX Power Warning

The Optical Tx Power is below the manufacturer's warning threshold

Yes

Disabled

Minor

No

22

High RX Power Alarm

The Optical Rx Power exceeds the manufacturer's defined threshold

Yes

Disabled

Minor

No

23

Low RX Power Alarm

The Optical Rx Power is below the manufacturer's defined threshold

Yes

Disabled

Minor

No

24

High RX Power Warning

The Optical Rx Power exceeds the manufacturer's warning threshold

Yes

Disabled

Minor

No

25

Low RX Power Warning

The Optical Rx Power is below the manufacturer's warning threshold

Yes

Disabled

Minor

No

*Default configuration

These Event Types are generated by the platform's management system:

  • Network Interface add

  • Network Interface delete

  • MEP add

  • MEP delete

Resource List

Resource Address

Resource Description

Conditions being monitored (and their row number)

Board[id]/Fan-tray[id]

Fan; mechanical air cooling device

Equipment not present (1)

Stall alarm (2)

Aging-alarm (3)

Board[id]/PSU-tray[id]

 

OR

Board[id]/PSU-tray[id]/Feed[id]

Power Supply Unit (PSU)

Note: Element Feed[id] reflects the specific case of Assurance Sensor F25 AC (1 PSU with 1 feed) and Assurance Sensor F25 DC (1 PSU with 2 feeds)

Equipment not present (1)

High Voltage Alarm (4)

Low Voltage Alarm (5)

Temperature Fault Alarm (6)

Power Fault Alarm (7)

Board[id]/HwCompFpga/TempSensor[id]

Flexible Programable Gate Array component in F100 and F25 (FPGA)

Equipment not present (1)

High Temperature Alarm (8)

Low Temperature Alarm (9)
High Temperature Warning (10)

Low Temperature Warning (11)

Board[id]/HwCompAsic/TempSensor[id]

Application-Specific Integrated Circuit in F100 and F25 (ASIC)

Equipment not present (1)

High Temperature Alarm (8)

Low Temperature Alarm (9)
High Temperature Warning (10)

Low Temperature Warning (11)

Board[id]/HwCompCpu/TempSensor[id]

Central Processing Unit in F100 and F25 (CPU)

Equipment not present (1)

High Temperature Alarm (8)

Low Temperature Alarm (9)
High Temperature Warning (10)

Low Temperature Warning (11)

Board[id]/TransceiverSlot[id]

OR

Board[id]/TransceiverSlot[id]/Lane[id]

Transceiver; optical or copper pluggable device used to provide Ethernet connectivity

Note: Element Lane[id] reflects the 100G transceivers that splits alarm information into four lanes (alarm rows 14 to 25).

Equipment not present (1)

High Voltage Alarm (4)

Low Voltage Alarm (5)

High Temperature Alarm (8)

Low Temperature Alarm (9)

High Temperature Warning (10)

Low Temperature Warning (11)

High Voltage Warning (12)

Low Voltage Warning (13)

High Tx Bias Current Alarm (14)

Low Tx Bias Current Alarm (15)
High Tx Bias Current Warning (16)

Low Tx Bias Current Warning (17)

High TX Power Alarm (18)

Low TX Power Alarm (19)
High TX Power Warning (20)
Low TX Power Warning (21)
High RX Power Alarm (22)
Low RX Power Alarm (23)

High RX Power Warning (24)

Low RX Power Warning (25)

Board[id]/SynchModule[i]

Hardware synchronization module (SYNCHMOD)

Equipment not present (1)

Interface and MEP Alarms

Condition

Description

Filtering Expected

Reporting*

Severity*

Service Affecting*

1

LinkDown

Link is down

Yes

Enabled

Major

Yes

2

DefXConnCCM

The MEP has received at least one CCM from either another MAID or a lower MD Level whose CCM Interval has not yet timed out

No

Disabled

Major

Yes

3

DefErrorCCM

The MEP has received at least one invalid CCM whose CCM Interval has not yet timed out

No

Disabled

Major

Yes

4

DefRemoteCCM

The MEP is not receiving valid CCMs from at least one of the remote MEPs

No

Disabled

Major

Yes

5

DefRdiCCM

A remote MEP has reported the RDI bit in its last CCM

No

Disabled

Minor

No

6

dLOC

Loss of continuity defect

No

Disabled

Major

Yes

7

dUNL

Unexpected MEL defect

No

Disabled

Major

Yes

8

dMMG

Mismerge defect

No

Disabled

Major

Yes

9

dUNM

Unexpected MEP defect

No

Disabled

Minor

No

10

dUNP

Unexpected periodicity defect

No

Disabled

Minor

No

11

dRDI

Remote defect indicator

No

Disabled

Minor

No

12

dCSF-LOS

Client signal fail defect-LOS

No

Disabled

Major

Yes

13

dCSF-FDI

Client signal fail defect-FDI

No

Disabled

Major

Yes

14

dCSF-RDI

Client signal fail defect-RDI

No

Disabled

Major

No

*Default configuration

Resource List

Resource Address

Resource Description

Conditions being monitored (and their row number)

Board[id]/Bridge[id]/Interface[id]

Network Interface; logical point of interconnection

LinkDown (1)

Board[id]/Bridge[id]/If[id]/Vlan[id]/MDL[id]/MEP[id]

Maintenance Association Endpoint (MEP)

DefXConnCCM (2)

DefErrorCCM (3)

DefRemoteCCM (4)

DefRdiCCM (5)

dLOC (6)

dUNL (7)

dMMG (8)

dUNM (9)

dUNP (10)

dRDI (11)

dCSF-LOS (12)

dCSF-FDI (13)

dCSF-RDI (14)

© 2024 Cisco and/or its affiliates. All rights reserved.
 
For more information about trademarks, please visit: Cisco trademarks
For more information about legal terms, please visit: Cisco legal terms

For legal information about Accedian Skylight products, please visit: Accedian legal terms and tradmarks



Was this article helpful?

Changing your password will log you out immediately. Use the new password to log back in.
First name must have atleast 2 characters. Numbers and special characters are not allowed.
Last name must have atleast 1 characters. Numbers and special characters are not allowed.
Enter a valid email
Enter a valid password
Your profile has been successfully updated.