You are on page 1of 376

Use pursuant to applicable agreements

Title page

Alcatel-Lucent LTE Evolved Packet Core


(EPC)
9471 Wireless Mobility Manager (WMM) | Release WM7.0.0
Alarm Dictionary
9YZ-05481-0005-RKZZA
Issue 1 | August 2013

Alcatel-Lucent Proprietary
Use pursuant to applicable agreements
Use pursuant to applicable agreements

Legal notice

Legal notice

Alcatel, Lucent, Alcatel-Lucent and the Alcatel-Lucent logo are trademarks of Alcatel-Lucent. All other trademarks are the property of their respective
owners.

The information presented is subject to change without notice. Alcatel-Lucent assumes no responsibility for inaccuracies contained herein.
Copyright 2013 Alcatel-Lucent. All rights reserved.
Contains proprietary/trade secret information which is the property of Alcatel-Lucent and must not be made available to, or copied or used by anyone outside
Alcatel-Lucent without its written authorization.
Not to be used or disclosed except in accordance with applicable agreements.

Notice

Every effort has been made to ensure that the information contained in this document was accurate at the time of printing. However, information is subject to
change.

PICMG, AdvancedTCA, and ATCA are registered trademarks of the PCI Industrial Computer Manufacturers Group.

Conformance statements

Refer to Appendix A, Compliance Summary.

Limited warranty

This system is a single homogenous system consisting of component parts designed to operate in the manner that the switch is configured when provided to
the customer. Changes to system level configurations set "at the factory" can affect the availability, throughput, standards compliance, and stability of the
product and result in expanded unplanned downtimes as unforeseen issues arise with untested configuration settings. Changes from factory settings can result
in violation of warranty and maintenance agreements with Alcatel-Lucent and should not be performed without the expressed written consent of
Alcatel-Lucent.

Licenses

Refer to the 9471 WMM Technical Description for a complete licensing statement.

Technical support

For technical support, contact your local customer support team. Reach them using the web at http://alcatel-lucent.com/support (http://alcatel-lucent.com/
support) at or the telephone number listed under the Technical Assistance Center menu at http://www.alcatel-lucent.com/contact (http://www.alcatel-lucent.
com/contact).

Alcatel-Lucent Proprietary
Use pursuant to applicable agreements
Contents

About this document


Purpose .......................................................................................................................................................................................... xiii
xiii

Reason for reissue ...................................................................................................................................................................... xiii


xiii

Intended audience ...................................................................................................................................................................... xiv


xiv

How to use this document ....................................................................................................................................................... xiv


xiv

Conventions used ....................................................................................................................................................................... xiv


xiv

Related information .................................................................................................................................................................... xv


xv

To obtain technical support, documentation, and training or submit feedback ................................................... xvi

How to comment ........................................................................................................................................................................ xvi


xvi

1 About alarm management

Overview ...................................................................................................................................................................................... 1-1


1-1

Alarm groups description ....................................................................................................................................................... 1-2


1-2

Network event categories description ............................................................................................................................... 1-6


1-6

2 MME Alarms

Overview ...................................................................................................................................................................................... 2-1


2-1

LSS_cmasFailure ..................................................................................................................................................................... 2-4


2-4

LSS_cmasReceiveFailure ..................................................................................................................................................... 2-5


2-5

LSS_cmasSendFailure ........................................................................................................................................................... 2-6


2-6

LSS_cpiGTPcResponseTOGn ............................................................................................................................................. 2-7


2-7

LSS_cpiGTPcResponseTOS3 .............................................................................................................................................. 2-9


2-9

LSS_cpiGTPcResponseTOSv ............................................................................................................................................ 2-11


2-11

LSS_cpiHOFailuresTo3G2GOverGn ............................................................................................................................. 2-13


2-13

LSS_cpiHOfailuresFromGERANoverS3 ...................................................................................................................... 2-15


2-15

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary iii
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_cpiHOfailuresFromUTRANoverS3 ...................................................................................................................... 2-17
2-17

LSS_cpiHOfailuresRAUto2G3GOverS3 ...................................................................................................................... 2-19


2-19

LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3 ..................................................................................................... 2-21


2-21

LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3 ................................................................................................... 2-23


2-23

LSS_cpiHOfailuresToGERANoverS3 ........................................................................................................................... 2-25


2-25

LSS_cpiHOfailuresToUTRANoverS3 ........................................................................................................................... 2-27


2-27

LSS_cpiMAFCommunicationFailureRate .................................................................................................................... 2-29


2-29

LSS_cpiMBMSSessionStartM3FailureRate ................................................................................................................ 2-31


2-31

LSS_cpiMBMSSessionStartSmFailureRate ................................................................................................................. 2-33


2-33

LSS_cpiMBMSSessionStopM3FailureRate ................................................................................................................. 2-35


2-35

LSS_cpiMBMSSessionStopSmFailureRate ................................................................................................................. 2-37


2-37

LSS_cpiMBMSSessionUpdateM3FailureRate ........................................................................................................... 2-39


2-39

LSS_cpiMBMSSessionUpdateSmFailureRate ............................................................................................................ 2-41


2-41

LSS_cpiMafAttachFailuresSysRelated .......................................................................................................................... 2-43


2-43

LSS_cpiMafAttachWithPGWreselection ...................................................................................................................... 2-44


2-44

LSS_cpiMafAttachWithSGWreselection ...................................................................................................................... 2-45


2-45

LSS_cpiMafEIRfailuresS13 ............................................................................................................................................... 2-46


2-46

LSS_cpiMafExtServiceReqFailuresSysRelated ......................................................................................................... 2-47


2-47

LSS_cpiMafExtServiceRequestFailures ........................................................................................................................ 2-49


2-49

LSS_cpiMafFailuresOverSGs ........................................................................................................................................... 2-51


2-51

LSS_cpiMafHLRAuthFail .................................................................................................................................................. 2-52


2-52

LSS_cpiMafHSSreselection ............................................................................................................................................... 2-53


2-53

LSS_cpiMafPDNconnWithPGWreselection ................................................................................................................ 2-54


2-54

LSS_cpiMafServiceReqFailuresSysRelated ................................................................................................................ 2-55


2-55

LSS_cpiMafTauFailuresInterMme .................................................................................................................................. 2-56


2-56

LSS_cpiMafTauFailuresInterMmeInterSgw ................................................................................................................ 2-57


2-57

LSS_cpiMafTauFailuresInterSgw .................................................................................................................................... 2-59


2-59
....................................................................................................................................................................................................................................
iv Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_cpiNoPSHOFailuresOverSv .................................................................................................................................... 2-61
2-61

LSS_cpiPSHOFailuresOverSv .......................................................................................................................................... 2-63


2-63

LSS_cpiS3TauFailures ......................................................................................................................................................... 2-65


2-65

LSS_cpiS3TauFailuresInterSgw ....................................................................................................................................... 2-67


2-67

LSS_cpiS3TauFailuresIntraSGW ..................................................................................................................................... 2-69


2-69

LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate ................................................................................................ 2-71


2-71

LSS_cpiStopWarnMsgDeliverySBcFailureRate ........................................................................................................ 2-73


2-73

LSS_cpiUECapacityUsage ................................................................................................................................................. 2-75


2-75

LSS_cpiWarnMsgDeliveryS1MMEFailureRate ......................................................................................................... 2-76


2-76

LSS_cpiWarnMsgDeliverySBcFailureRate .................................................................................................................. 2-78


2-78

LSS_dataMismatch ................................................................................................................................................................ 2-80


2-80

LSS_excessiveExternalLinksDown ............................................................................................................................... 2-83


2-83

LSS_externalLinkConfigurationLimit ............................................................................................................................ 2-84


2-84

LSS_externalLinkDown ..................................................................................................................................................... 2-85


2-85

LSS_failedAttachReqsRateExceeded ............................................................................................................................. 2-86


2-86

LSS_failedAuthRequestsHSSRateExceeded ............................................................................................................... 2-88


2-88

LSS_failedAuthRequestsUERateExceeded .................................................................................................................. 2-90


2-90

LSS_failedCrDedBearerReqsRateExceeded ................................................................................................................ 2-91


2-91

LSS_failedDeactDedBearerReqsRateExceeded ......................................................................................................... 2-93


2-93

LSS_failedHRPDhandoverRateExceeded .................................................................................................................... 2-94


2-94

LSS_failedMobileTermLocRequestRateExceeded .................................................................................................... 2-95


2-95

LSS_failedNetwrkInducedLocRequestRateExceeded .............................................................................................. 2-97


2-97

LSS_failedNumHOFwdRelocRateExceeded ............................................................................................................... 2-99


2-99

LSS_failedNumHOPathSwNewSgwRateExceeded ............................................................................................... 2-100


2-100

LSS_failedNumHOPathSwSameSgwRateExceeded .............................................................................................. 2-101


2-101

LSS_failedNumHORequiredRateExceeded .............................................................................................................. 2-102


2-102

LSS_failedS1MMEconnEstRateExceeded ................................................................................................................. 2-103


2-103
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary v
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_failedServiceReqsRateExceeded ......................................................................................................................... 2-104
2-104

LSS_failedTAURateExceeded ........................................................................................................................................ 2-106


2-106

LSS_failedUpdBearerReqsRateExceeded .................................................................................................................. 2-108


2-108

LSS_failedUpdDedBearerReqsRateExceeded .......................................................................................................... 2-109


2-109

LSS_ggsnDnsError ............................................................................................................................................................ 2-110


2-110

LSS_internalCommunicationFailure ........................................................................................................................... 2-111


2-111

LSS_ippuBusError ............................................................................................................................................................. 2-112


2-112

LSS_ippuResourceReset .................................................................................................................................................. 2-114


2-114

LSS_liNearingCapacityLimit ......................................................................................................................................... 2-115


2-115

LSS_maxDurationExpiredOnHRPDhandover .......................................................................................................... 2-116


2-116

LSS_mmeDnsError ............................................................................................................................................................ 2-117


2-117

LSS_noResetAckReceived ............................................................................................................................................... 2-118


2-118

LSS_numTOS10gtpcRateExceeded .............................................................................................................................. 2-119


2-119

LSS_numTOS11gtpcRateExceeded .............................................................................................................................. 2-120


2-120

LSS_numTOS3gtpcRateExceeded ................................................................................................................................ 2-121


2-121

LSS_pathAvailability ........................................................................................................................................................ 2-122


2-122

LSS_pgwDnsError ............................................................................................................................................................. 2-123


2-123

LSS_provisioningError ..................................................................................................................................................... 2-124


2-124

LSS_sgsnDnsError ............................................................................................................................................................. 2-125


2-125

LSS_taiFqdnError .............................................................................................................................................................. 2-126


2-126

3 SGSN Alarms

Overview ...................................................................................................................................................................................... 3-1


3-1

LSS_cdrStorageSpaceThreshold ......................................................................................................................................... 3-3


3-3

LSS_cgfNotResponding ......................................................................................................................................................... 3-4


3-4

LSS_cgfServiceNotSupported ............................................................................................................................................. 3-5


3-5

LSS_cgfSystemFailure ........................................................................................................................................................... 3-6


3-6

LSS_cgfVersionNotSupported ............................................................................................................................................. 3-7


3-7
....................................................................................................................................................................................................................................
vi Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_cpiGTPcResponseTOGn ............................................................................................................................................. 3-8
3-8

LSS_cpiGTPcResponseTOS3 ........................................................................................................................................... 3-10


3-10

LSS_cpiUECapacityUsage ................................................................................................................................................. 3-12


3-12

LSS_excessiveExternalLinksDown ............................................................................................................................... 3-13


3-13

LSS_externalLinkDown ..................................................................................................................................................... 3-14


3-14

LSS_ggsnDnsError ............................................................................................................................................................... 3-15


3-15

LSS_internalCommunicationFailure ............................................................................................................................. 3-16


3-16

LSS_ippuBusError ................................................................................................................................................................ 3-17


3-17

LSS_ippuResourceReset .................................................................................................................................................... 3-19


3-19

LSS_liNearingCapacityLimit ........................................................................................................................................... 3-20


3-20

LSS_msThreshold .................................................................................................................................................................. 3-21


3-21

LSS_noResetAckReceived ................................................................................................................................................. 3-22


3-22

LSS_nseBandwidthThreshold ........................................................................................................................................... 3-23


3-23

LSS_pathAvailability ........................................................................................................................................................... 3-24


3-24

LSS_pdpThreshold ................................................................................................................................................................ 3-25


3-25

LSS_sgsnDnsError ............................................................................................................................................................... 3-26


3-26

4 BASE_ATCA Alarms

Overview ...................................................................................................................................................................................... 4-1


4-1

ATCA_AggregatePowerSensor ........................................................................................................................................... 4-6


4-6

ATCA_AggregateTemperatureSensor ............................................................................................................................... 4-7


4-7

ATCA_BoardPower ................................................................................................................................................................. 4-8


4-8

ATCA_CPLDState ................................................................................................................................................................... 4-9


4-9

ATCA_DS75Temperature ................................................................................................................................................... 4-11


4-11

ATCA_ExhaustTemp ............................................................................................................................................................ 4-13


4-13

ATCA_FPGATemp ................................................................................................................................................................ 4-15


4-15

ATCA_FanSpeed .................................................................................................................................................................... 4-17


4-17

ATCA_FanTrayPresence ..................................................................................................................................................... 4-18


4-18
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary vii
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
ATCA_FanTraysFRU ........................................................................................................................................................... 4-19
4-19

ATCA_FilterPresence ........................................................................................................................................................... 4-21


4-21

ATCA_I2CLocalBus ............................................................................................................................................................. 4-22


4-22

ATCA_IPMBLink .................................................................................................................................................................. 4-23


4-23

ATCA_InletTemp ................................................................................................................................................................... 4-24


4-24

ATCA_LM75Temperature .................................................................................................................................................. 4-26


4-26

ATCA_LM83Temperature .................................................................................................................................................. 4-28


4-28

ATCA_LMeUC75Temperature ......................................................................................................................................... 4-30


4-30

ATCA_LMeUC75Top-Rig ................................................................................................................................................. 4-32


4-32

ATCA_LocalTemperature ................................................................................................................................................... 4-34


4-34

ATCA_MMCTemp ................................................................................................................................................................ 4-35


4-35

ATCA_OcteonTemperature ................................................................................................................................................ 4-37


4-37

ATCA_OutletTemp ................................................................................................................................................................ 4-38


4-38

ATCA_PayloadCurrent ........................................................................................................................................................ 4-40


4-40

ATCA_PayloadVoltage ........................................................................................................................................................ 4-42


4-42

ATCA_PowerOk ..................................................................................................................................................................... 4-44


4-44

ATCA_ShelfFRUs ................................................................................................................................................................. 4-45


4-45

ATCA_UnexpectedDeact .................................................................................................................................................... 4-47


4-47

ATCA_m48vSensor ............................................................................................................................................................... 4-48


4-48

LSS_cardConnectionLost .................................................................................................................................................. 4-49


4-49

LSS_cardError ........................................................................................................................................................................ 4-51


4-51

LSS_cpiAlrmCritical ............................................................................................................................................................ 4-52


4-52

LSS_cpiAlrmMajor ............................................................................................................................................................... 4-53


4-53

LSS_cpiAlrmMinor ............................................................................................................................................................... 4-54


4-54

LSS_cpiAlrmWarning .......................................................................................................................................................... 4-55


4-55

LSS_cpiAsrtEsc ...................................................................................................................................................................... 4-56


4-56

LSS_cpiAsrtNonEsc ............................................................................................................................................................. 4-58


4-58
....................................................................................................................................................................................................................................
viii Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_cpiAsrtNonEscCritical ............................................................................................................................................... 4-60
4-60

LSS_cpiAsrtNonEscMajor ................................................................................................................................................. 4-62


4-62

LSS_cpiAsrtNonEscMinor ................................................................................................................................................. 4-64


4-64

LSS_cpiAudErrCount ........................................................................................................................................................... 4-66


4-66

LSS_cpiAudManAct ............................................................................................................................................................. 4-68


4-68

LSS_cpiAudNewEvent ........................................................................................................................................................ 4-70


4-70

LSS_cpiExceptionService ................................................................................................................................................... 4-72


4-72

LSS_cpiFileSysUsage .......................................................................................................................................................... 4-74


4-74

LSS_cpiMemAllocFail ......................................................................................................................................................... 4-75


4-75

LSS_cpiReinitServiceSelf ................................................................................................................................................... 4-76


4-76

LSS_cpuOverload ................................................................................................................................................................. 4-78


4-78

LSS_databaseConnectionLost .......................................................................................................................................... 4-79


4-79

LSS_databaseReplicationLinkDown ............................................................................................................................. 4-80


4-80

LSS_databaseSizeExhausted ............................................................................................................................................ 4-81


4-81

LSS_dbHighCpuUtilization .............................................................................................................................................. 4-82


4-82

LSS_dbOffline ....................................................................................................................................................................... 4-83


4-83

LSS_dbStatusUnexpected .................................................................................................................................................. 4-84


4-84

LSS_degradedResource ....................................................................................................................................................... 4-85


4-85

LSS_degrow ......................................................................................................................................................................... 4-126


4-126

LSS_diskGoingDown ....................................................................................................................................................... 4-127


4-127

LSS_diskSector ................................................................................................................................................................... 4-128


4-128

LSS_dnsThreshold ............................................................................................................................................................. 4-129


4-129

LSS_ethernetError .............................................................................................................................................................. 4-130


4-130

LSS_ethernetLinkDown ................................................................................................................................................... 4-131


4-131

LSS_externalConnectivity .............................................................................................................................................. 4-133


4-133

LSS_fru .................................................................................................................................................................................. 4-134


4-134

LSS_grow .............................................................................................................................................................................. 4-135


4-135
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary ix
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_hostDown .................................................................................................................................................................... 4-136
4-136

LSS_memoryOverload ..................................................................................................................................................... 4-137


4-137

LSS_nodeGroupOOS ........................................................................................................................................................ 4-138


4-138

LSS_nodeOOS ..................................................................................................................................................................... 4-139


4-139

LSS_numberOfTuplesInUse ............................................................................................................................................ 4-140


4-140

LSS_osSecInfoModificationDetected ......................................................................................................................... 4-141


4-141

LSS_osSecInformationMissing ..................................................................................................................................... 4-142


4-142

LSS_osSecUnexpectedInformation ............................................................................................................................. 4-143


4-143

LSS_patch ............................................................................................................................................................................. 4-144


4-144

LSS_pktCorruptionDetectedViaRCCLANCheck ................................................................................................... 4-145


4-145

LSS_platformCommandFailure .................................................................................................................................... 4-146


4-146

LSS_pmDataNotCollected ............................................................................................................................................... 4-147


4-147

LSS_processDown ............................................................................................................................................................. 4-148


4-148

LSS_processNotStarted ..................................................................................................................................................... 4-149


4-149

LSS_remoteQueryServerFailure ................................................................................................................................... 4-152


4-152

LSS_remotedbLinkDown ................................................................................................................................................ 4-153


4-153

LSS_restore ........................................................................................................................................................................... 4-154


4-154

LSS_serviceOnewayCommunication .......................................................................................................................... 4-155


4-155

LSS_sheddingOverload .................................................................................................................................................... 4-156


4-156

LSS_shmcEthernetError .................................................................................................................................................... 4-157


4-157

LSS_simxml ......................................................................................................................................................................... 4-158


4-158

LSS_softwareAllocatedResourceOverload ............................................................................................................... 4-159


4-159

LSS_softwareComponentStandbyNotReady ........................................................................................................... 4-160


4-160

LSS_svcdegrow ................................................................................................................................................................... 4-161


4-161

LSS_svcgrow ....................................................................................................................................................................... 4-162


4-162

LSS_swVersionMismatch ................................................................................................................................................. 4-163


4-163

LSS_tftpDownloadCorrupt ............................................................................................................................................. 4-164


4-164
....................................................................................................................................................................................................................................
x Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
LSS_threadsExhausted ...................................................................................................................................................... 4-166
4-166

LSS_upgrade ........................................................................................................................................................................ 4-167


4-167

LSS_virtualClusterDown ................................................................................................................................................. 4-168


4-168

RALARM_Loop .................................................................................................................................................................. 4-169


4-169

RALARM_Power ................................................................................................................................................................ 4-170


4-170

SYS_BackupFailure ............................................................................................................................................................ 4-171


4-171

SYS_CPM_USERDATA_INCONSITENCY ........................................................................................................... 4-172


4-172

SYS_CPM_USERDATA_RESTORED ...................................................................................................................... 4-173


4-173

SYS_Configuration ............................................................................................................................................................ 4-174


4-174

SYS_EventQueueCapacity ............................................................................................................................................. 4-176


4-176

SYS_ICMPFailure ............................................................................................................................................................... 4-177


4-177

SYS_IPsecConfig ............................................................................................................................................................... 4-178


4-178

SYS_LinkDown ................................................................................................................................................................... 4-179


4-179

SYS_NotifyDisabled ......................................................................................................................................................... 4-180


4-180

SYS_NotifyLocked ............................................................................................................................................................ 4-181


4-181

SYS_RADIUS_TO_LDAP_FAILURE ....................................................................................................................... 4-182


4-182

SYS_ROOT_ACCESS_DENIED ................................................................................................................................. 4-183


4-183

SYS_ROOT_FTP_VIOLATION ................................................................................................................................... 4-184


4-184

SYS_ROOT_LOGIN_VIOLATION ............................................................................................................................ 4-185


4-185

SYS_ROOT_SSH_LOGIN_VIOLATION ................................................................................................................. 4-186


4-186

SYS_SNETrapOverload .................................................................................................................................................... 4-187


4-187

SYS_SNMPAuthenticationFailure ................................................................................................................................ 4-188


4-188

SYS_SNMPFailure ............................................................................................................................................................. 4-189


4-189

SYS_SU_TO_ROOT_FAILURE .................................................................................................................................. 4-190


4-190

SYS_SYSTEMTrapOverload .......................................................................................................................................... 4-191


4-191

SYS_SetupAAAFailure ..................................................................................................................................................... 4-192


4-192

SYS_TestAlarm ................................................................................................................................................................... 4-193


4-193
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary xi
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
Contents

....................................................................................................................................................................................................................................
SYS_ThresholdCrossed ................................................................................................................................................... 4-194
4-194

SYS_UndiscoveredObject ............................................................................................................................................... 4-195


4-195

SYS_WriteAAAFailure ..................................................................................................................................................... 4-196


4-196

A Compliance Summary

9471 WMM compliance summary .................................................................................................................................... A-1


A-1

B References

Revision history ........................................................................................................................................................................ B-1


B-1

Index

....................................................................................................................................................................................................................................
xii Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
About this document
About this document

Purpose
This document is used to interpret alarms on the Alcatel-Lucent 9471 WMM.

Reason for reissue


The following was updated in this release:

Reason for Change Location


New MME Alarms: LSS_cmasFailure (p. 2-4)
LSS_cmasReceiveFailure (p. 2-5)
LSS_cmasSendFailure (p. 2-6)
LSS_cpiMafAttachWithPGWreselection (p. 2-44)
LSS_cpiMafAttachWithSGWreselection (p. 2-45)
LSS_cpiMafHSSreselection (p. 2-53)
LSS_cpiMafPDNconnWithPGWreselection (p. 2-54)
LSS_excessiveExternalLinksDown (p. 2-83)
LSS_externalLinkConfigurationLimit (p. 2-84)

New ATCA platform alarm ATCA_LMeUC75Top-Rig (p. 4-32)


LSS_nodeGroupOOS (p. 4-138)
LSS_nodeOOS (p. 4-139)
LSS_threadsExhausted (p. 4-166)

Modified ATCA_BASE LSS_diskSector (p. 4-128) (Updated fault clearance


alarms procedure)
SYS_Configuration (p. 4-174) (change 'ntp' to
'ntp_server' in fault clearance commands)

...................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary xiii
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
About this document

....................................................................................................................................................................................................................................
Intended audience
This document is for service provider personnel who support the 9471 WMM.

How to use this document


The WMM application is built on a common platform used by many different
applications. The WMM does not use all of the capabilities of the platform and therefore,
some base ATCA alarms may not be applicable. In addition, certain functionality defined
within some alarms may also not be applicable to the WMM such as the following: CDR,
SS7, FS5K, FS GUI, NGSS, TL1, and CPSB.

Conventions used
The following conventions are used throughout this information product:
Typographic conventions
This information product presents different types of information in different typefaces to
emphasize the nature of the information:
Literal input: Keystrokes that you are to enter character by character exactly as shown
in the text appear in monospace bold type. For example:
Enter the following command:
apappsconfig
Variable user input: Input values that vary from one execution or instance to another
appear in monospace bold italic type. For example:
cd directory
where
directory = the directory to change to.
Literal output: The names of files, directories, forms, messages, and other information
that a system outputs exactly as shown in the text appear in monospace regular
type. For example:
RST SPA=cnam REQUEST ACKNOWLEDGED
Variable system output: Values that vary from one instance to another in system
output appear in monospace italic type. For example:
RST SPA=SPA_NAME REQUEST COMPLETED
where
SPA_NAME = the name of the Service Package Application (SPA) that is successfully
restored.
The names of keys on a terminal keyboard are indicated by bold letters. For example:

....................................................................................................................................................................................................................................
xiv Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
About this document

....................................................................................................................................................................................................................................
Press the F4 (Enter Query) function key.
The Ctrl (Control) key is signified by the carat ( ^ ) symbol. When the ^ symbol
precedes the name of another key (as in ^e), press the Ctrl key and the other key
simultaneously.
Actions for user input
In this information product, the following words specify what actions to perform to input
data or execute commands:
The word enter means to key in the specified keystrokes (such as a command) and
then press the Enter or Return key. For example:
Enter the following command:
apappconfig
The word type means to key in the specified keystrokes (such as a value in the field of
a form) without pressing the Enter or Return key. For example:
In the IP address field, type the IP address of the host server.

Related information
The following documents contain information related to this product:

Document Document Title


Number
9YZ-05481-0001- 9471 WMM Technical Description
DEZZA
9YZ-05481-0002- 9471 WMM Operations, Administration & Maintenance
REZZA
9YZ-05481-0003- 9471 WMM Security Management
USZZA Note: Restricted Document only available through the OLCS website.
9YZ-05481-0004- 9471 WMM Software Update
RJZZA
9YZ-05481-0006- 9471 WMM Observation Counters
RKZZA
9YZ-05481-0008- 9471 WMM Site Preparation
RJZZA
9YZ-05481-0012- 9471 WMM CALEA/LI Management
REZZA Note: Restricted Document only available through the OLCS website.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary xv
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
About this document

....................................................................................................................................................................................................................................
To obtain technical support, documentation, and training or submit feedback
The Online Customer Support (OLCS) web site (http://support.alcatel-lucent.com),
provides access to technical support, related documentation, related training, and
feedback tools. The site also provides account registration for new users.

How to comment
To comment on this document, go to the Online Comment Form (http://infodoc.alcatel-
lucent.com/comments/) or e-mail your comments to the Comments Hotline
(comments@alcatel-lucent.com).

....................................................................................................................................................................................................................................
xvi Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
1 About alarm management
1

Overview
Purpose
This chapter provides a description of alarm groups and network event categories.

Contents

Alarm groups description 1-2


Network event categories description 1-6

...................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 1-1
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
About alarm management Alarm groups description

....................................................................................................................................................................................................................................

Alarm groups description


Overview
Different alarm categories can be observed in the MI GUI under Fault Management --->
Alarms menu. The alarm types, alarm severities, and probable causes are defined in the
CCITT standard X.733.
Although alarm details are available in the X.733 standards, the 3GPP standards are used
which are based off X.733 standards.

Introduction
Alarms appearing in the alarm log are all propagated events with the severities: Warning,
Minor, Major, Indeterminate and Critical. All other events appear in the event log.

Alarm types
The Alarm Type field is shown as a column in the alarms window on the MI GUI, and as
eventType on the properties of an alarm.

Alarm Type Explanation


Communication Associated with the procedures and/or processes required to convey
alarm information from one point to another.
Quality of Service Associated with degradation in the quality of a service.
Alarm
Processing Error Associated with a software or processing fault.
alarm
Equipment alarm Associated with an equipment fault.
Environmental Associated with a condition relating to an enclosure in which the
alarm equipment resides.
Integrity violation For security alarms: associated with duplicate information, information
missing, information modification detected, information out of sequence,
or unexpected information.
Operational For security alarms: associated with denial of service, out of service,
violation procedural error, or unspecified reason.
Physical violation For security alarms: associated with cable tamper, intrusion detection or
alarm unspecified reason.
Security service or For security alarms: associated with authentication failure, breach of
mechanism confidentiality, non-repudiation failure, unauthorized access attempt, or
violation alarm unspecified reason.

....................................................................................................................................................................................................................................
1-2 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
About alarm management Alarm groups description

....................................................................................................................................................................................................................................

Alarm Type Explanation


Time domain For security alarms: associated with delayed information, key expired or
violation alarm out of hours activity.

Alarm severity
The Severity field appears as a column in the alarms window on the MI GUI, and as field
on the alarm details.

Severity Explanation
Critical The Critical severity level indicates that a service affecting condition has
occurred and immediate corrective action is required. For example, this severity
is reported when a managed object is out of service and its capability must be
restored.
Major The Major severity level indicates that a service affecting condition has
developed and urgent corrective action is required. For example, this severity is
reported when a severe degradation in the capability of the managed object
exists and its full capability must be restored.
Minor The Minor severity level indicates the existence of a non-service affecting fault
condition and that corrective action should be taken to prevent a more serious
(for example, service affecting) fault. For example, this severity is reported
when the detected alarm condition is not currently degrading the capacity of the
managed object.
Warning The Warning severity level indicates the detection of a potential or impending
service affecting fault, before any significant affects have been felt. To prevent a
more serious service-affecting fault, further action should be taken to diagnose
and correct the problem.
Indetermi- The Indeterminate severity level indicates that the severity level cannot be
nate determined by the sending element.
Cleared The Cleared severity level indicates the clearing of one or more previously
reported alarms. This alarm clears all alarms for the managed object that have
the same alarm type, probable cause, and specific problems. Multiple associated
notifications can be cleared by using the correlated notifications parameter.

When using filters a !clear severity also exists. This severity can be used to show all
active (not cleared) alarms in the system.
Alarms with informational severity appear under the events log and are described at the
events part.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 1-3
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
About alarm management Alarm groups description

....................................................................................................................................................................................................................................
Cause of alarms
The Probable Cause field is shown as a column in the alarms window on the MI GUI,
and as stringProbableCause field on the properties of an alarm. It provides an indication
of the most likely cause of the alarm and the related problem.
Two examples of probable causes available on the MI-Agent are explained in the table
that follows. Descriptions are retrieved from the ITU-T X.733 specification. For a
complete description of the probable causes, refer to the X.733 and other standards listed
under the reference part.

Probable Cause Explanation


Communication Protocol Error A communication protocol has been violated
Configuration or Customizing A system or device generation or customizing parameter has
Error been specified incorrectly, or is inconsistent with the actual
configuration

Alarm categories
The alarm category is shown in the Alarm Summary View, as a field on the alarm
details, and can optionally be added as a column to the alarms view window. This
category is used to group all alarms on a system in addition to the provided severity. The
different category types can also be used as criteria within custom views.

Category Category related to:


CP Hosts The hosts of the WMM
CP HW The hardware of the WMM
CP Services The services on top of the WMM
Topology The physical architecture of the network

Reference
The following specifications apply to alarms and events:
Telecommunication management; Fault Management; Part 2: Alarm Integration
Reference Point (IRP): Information Service (IS) , 3GPP TS 32.111-2
Telecommunication management; Fault Management; Part 3: Alarm Integration
Reference Point (IRP): Common Object Request Broker Architecture (CORBA)
Solution Set (SS), 3GPP TS 32.111-3
Generic network information model , ITU-T M.3100
Information technology - Open Systems Interconnection - Structure of management
information: Definition of management information, ITU-T X.721
....................................................................................................................................................................................................................................
1-4 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
About alarm management Alarm groups description

....................................................................................................................................................................................................................................
Information technology - Open Systems Interconnection - Systems Management:
Alarm reporting function, ITU-T X.733
Information technology - Open Systems Interconnection - Systems Management:
Security alarm reporting function , ITU-T X.736

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 1-5
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
About alarm management Network event categories description

....................................................................................................................................................................................................................................

Network event categories description


Introduction
Network events provide detail and history about activities that generate alarms. Events
can be used to correlate multiple alarms to a particular event. The same tasks that can be
performed on alarms can be performed on events (for example, search, creating custom
views, and filtering).
Events can be viewed in the MI GUI under Fault Management ----> Events. Events
received by the MI-Agent are logged in the events log. All events with the severities:
Warning, Minor, Major, Indeterminate and Critical are propagated to an alarm in the
alarm log. Other event types (with the severities, Info or State) are not propagated to the
alarm log and only stay in the event log.

Severity description
Events use all the standard severities used for alarms. Event-specific severities are
described in the table that follows

Severity Explanation
Info Used for alarms which do not require action to be taken.
Because this severity does not meet the X.733 standard, these alarms are not
propagated to the alarm log and show only in the event log.
State Indicates that a state change happened to a managed object on the MI-Agent.
This state follows the X.731 standard and should be referred to for more
information.

Event details
Details about specific events can be retrieved by viewing the details for the event. The
details and the Message field provides information to understand the cause of the event.

Custom Views
Certain types of events can be filtered out of the extensive event log by using Custom
Views on the MI GUI menu bar. One example is the custom view for informational
alarms which can be viewed separately in a sub folder under Events.

....................................................................................................................................................................................................................................
1-6 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
2 MME Alarms
2

Overview
Purpose
This chapter contains alarms that are specific to the MME.

Contents

LSS_cmasFailure 2-4
LSS_cmasReceiveFailure 2-5
LSS_cmasSendFailure 2-6
LSS_cpiGTPcResponseTOGn 2-7
LSS_cpiGTPcResponseTOS3 2-9
LSS_cpiGTPcResponseTOSv 2-11
LSS_cpiHOFailuresTo3G2GOverGn 2-13
LSS_cpiHOfailuresFromGERANoverS3 2-15
LSS_cpiHOfailuresFromUTRANoverS3 2-17
LSS_cpiHOfailuresRAUto2G3GOverS3 2-19
LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3 2-21
LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3 2-23
LSS_cpiHOfailuresToGERANoverS3 2-25
LSS_cpiHOfailuresToUTRANoverS3 2-27
LSS_cpiMAFCommunicationFailureRate 2-29
LSS_cpiMBMSSessionStartM3FailureRate 2-31
LSS_cpiMBMSSessionStartSmFailureRate 2-33
LSS_cpiMBMSSessionStopM3FailureRate 2-35
LSS_cpiMBMSSessionStopSmFailureRate 2-37

...................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-1
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms Overview

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionUpdateM3FailureRate 2-39
LSS_cpiMBMSSessionUpdateSmFailureRate 2-41
LSS_cpiMafAttachFailuresSysRelated 2-43
LSS_cpiMafAttachWithPGWreselection 2-44
LSS_cpiMafAttachWithSGWreselection 2-45
LSS_cpiMafEIRfailuresS13 2-46
LSS_cpiMafExtServiceReqFailuresSysRelated 2-47
LSS_cpiMafExtServiceRequestFailures 2-49
LSS_cpiMafFailuresOverSGs 2-51
LSS_cpiMafHLRAuthFail 2-52
LSS_cpiMafHSSreselection 2-53
LSS_cpiMafPDNconnWithPGWreselection 2-54
LSS_cpiMafServiceReqFailuresSysRelated 2-55
LSS_cpiMafTauFailuresInterMme 2-56
LSS_cpiMafTauFailuresInterMmeInterSgw 2-57
LSS_cpiMafTauFailuresInterSgw 2-59
LSS_cpiNoPSHOFailuresOverSv 2-61
LSS_cpiPSHOFailuresOverSv 2-63
LSS_cpiS3TauFailures 2-65
LSS_cpiS3TauFailuresInterSgw 2-67
LSS_cpiS3TauFailuresIntraSGW 2-69
LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate 2-71
LSS_cpiStopWarnMsgDeliverySBcFailureRate 2-73
LSS_cpiUECapacityUsage 2-75
LSS_cpiWarnMsgDeliveryS1MMEFailureRate 2-76
LSS_cpiWarnMsgDeliverySBcFailureRate 2-78
LSS_dataMismatch 2-80
LSS_excessiveExternalLinksDown 2-83
LSS_externalLinkConfigurationLimit 2-84
LSS_externalLinkDown 2-85
LSS_failedAttachReqsRateExceeded 2-86
LSS_failedAuthRequestsHSSRateExceeded 2-88
LSS_failedAuthRequestsUERateExceeded 2-90
....................................................................................................................................................................................................................................
2-2 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms Overview

....................................................................................................................................................................................................................................

LSS_failedCrDedBearerReqsRateExceeded 2-91
LSS_failedDeactDedBearerReqsRateExceeded 2-93
LSS_failedHRPDhandoverRateExceeded 2-94
LSS_failedMobileTermLocRequestRateExceeded 2-95
LSS_failedNetwrkInducedLocRequestRateExceeded 2-97
LSS_failedNumHOFwdRelocRateExceeded 2-99
LSS_failedNumHOPathSwNewSgwRateExceeded 2-100
LSS_failedNumHOPathSwSameSgwRateExceeded 2-101
LSS_failedNumHORequiredRateExceeded 2-102
LSS_failedS1MMEconnEstRateExceeded 2-103
LSS_failedServiceReqsRateExceeded 2-104
LSS_failedTAURateExceeded 2-106
LSS_failedUpdBearerReqsRateExceeded 2-108
LSS_failedUpdDedBearerReqsRateExceeded 2-109
LSS_ggsnDnsError 2-110
LSS_internalCommunicationFailure 2-111
LSS_ippuBusError 2-112
LSS_ippuResourceReset 2-114
LSS_liNearingCapacityLimit 2-115
LSS_maxDurationExpiredOnHRPDhandover 2-116
LSS_mmeDnsError 2-117
LSS_noResetAckReceived 2-118
LSS_numTOS10gtpcRateExceeded 2-119
LSS_numTOS11gtpcRateExceeded 2-120
LSS_numTOS3gtpcRateExceeded 2-121
LSS_pathAvailability 2-122
LSS_pgwDnsError 2-123
LSS_provisioningError 2-124
LSS_sgsnDnsError 2-125
LSS_taiFqdnError 2-126

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-3
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cmasFailure

....................................................................................................................................................................................................................................

LSS_cmasFailure
Description
This alarm indicates that there is software failure in s1mme or sbc modules, related to
CMAS.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
Software failure in either the S1mme or sbc modules. Severity of the alarm is
controlled by provisoning global parameters

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-4 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cmasReceiveFailure

....................................................................................................................................................................................................................................

LSS_cmasReceiveFailure
Description
This alarm indicates that the MME failed to receive an acknowledgement to a CMAS
message.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
Possible link failure on the S1mme interface. Severity of the alarm is controlled by
provisioning global parameters.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the S1mme links are up.


...................................................................................................................................................................................................

2 Alarm can only be cleared manually by running "alarm_cli --clear


alarmName=LSS_cmasReceiveFailure" from the active MI.
...................................................................................................................................................................................................

3 If condition persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-5
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cmasSendFailure

....................................................................................................................................................................................................................................

LSS_cmasSendFailure
Description
This alarm indicates that there is a failure in sending a CMAS message over s1mme or
sbc interfaces.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
Possible link failure on the S1mme or sbc interfaces. Severity of the alarm is
controlled by provisioning global parameters.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the S1mme and sbc links are up.


...................................................................................................................................................................................................

2 Alarm can only be cleared manually by running "alarm_cli --clear


alarmName=LSS_cmasSendFailure" from the active MI.
...................................................................................................................................................................................................

3 If condition persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-6 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOGn

....................................................................................................................................................................................................................................

LSS_cpiGTPcResponseTOGn
Description
The raised alarm, LSS_cpiGTPcResponseTOGn, indicates that the value of
VS.cpiGTPcResponseTOGn has exceeded a threshold in the last 15 minute interval. This
counter monitors the percentage of GTP Requests sent over a Gn interface for which no
Response is received by the WMM. The Gn interface connects the WMM with one or
more SGSNs. The calculated percentage is compared against provisioned thresholds for
Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure to receive GTP responses from an SGSN could be due to any of the following
reasons:
Errors or problems at the far end SGSN
Network problems between the WMM and the SGSN
Internal errors at the WMM

Fault clearance procedure


...................................................................................................................................................................................................

1 Check neighboring SGSNs for error conditions or ongoing problems. Verify network
connectivity and proper configuration between WMM and SGSNs. If SGSNs and
network connectivity are verified, examine all the GTP failure counters to determine if
one failure cause predominates, and check fs.log to determine if errors related to the Gn

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-7
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOGn

....................................................................................................................................................................................................................................
interface have been reported. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-8 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOS3

....................................................................................................................................................................................................................................

LSS_cpiGTPcResponseTOS3
Description
The raised alarm, LSS_cpiGTPcResponseTOS3, indicates meeting a threshold of GTP
response failure rate in the last 5 minute interval. This failure rate monitors the percentage
of GTP Requests sent over an S3 interface for which no Response is received by the
MME. The S3 interface connects the MME with one or more SGSNs. The calculated
percentage is compared against provisioned thresholds for Minor, Major, and Critical
alarm conditions.
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure to receive GTP responses from an SGSN could be due to any of the following
reasons:
Errors or problems at the far end SGSN
Network problems between the MME and the SGSN
Internal errors at the MME

Fault clearance procedure


...................................................................................................................................................................................................

1 Check neighboring SGSNs for error conditions or ongoing problems. Verify network
connectivity and proper configuration between MME and SGSNs. If SGSNs and network
connectivity are verified, examine all the GTP failure counters to determine if one failure
cause predominates, and check fs.log to determine if errors related to the S3 interface

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-9
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOS3

....................................................................................................................................................................................................................................
have been reported. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-10 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOSv

....................................................................................................................................................................................................................................

LSS_cpiGTPcResponseTOSv
Description
The raised alarm LSS_cpiGTPcResponseTOSv indicates meeting a threshold of the GTPc
Response Time out over Sv CPI (requests sent over an Sv interface for which no response
is received), which is calculated every 5 minutes using this formula:
VS.NbrTO_SvGtpc / VS.TotalReqSent_SvGtpc
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure to receive GTP responses from an MSC could be due to any of the following
reasons:
Errors or problems at the far end MSC
Network problems between the MME and the MSC
Internal errors at the MME

Fault clearance procedure


...................................................................................................................................................................................................

1 Check neighboring MSC(s) for error conditions or ongoing problems. Verify network
connectivity and proper configuration between MME and MSC(s). If MSC(s) and
network connectivity are verified, examine all the GTP failure counters to determine if
one failure cause predominates, and check fs.log to determine if errors related to the Sv

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-11
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiGTPcResponseTOSv

....................................................................................................................................................................................................................................
interface have been reported. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-12 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOFailuresTo3G2GOverGn

....................................................................................................................................................................................................................................

LSS_cpiHOFailuresTo3G2GOverGn
Description
The raised alarm, LSS_cpiHOFailuresTo3G2GOverGn, indicates that the value of
VS.cpiHOFailuresto3G2GOverGn has exceeded a threshold in the last 15 minute interval.
This counter monitors the failure rate of attempted handovers from E-UTRAN to a
UTRAN/GERAN SGSN using the Gn interface. This includes Routing Area Update
procedures. The failure rate is compared against provisioned thresholds for Minor, Major,
and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Attempted handovers from E-UTRAN to a UTRAN/GERAN SGSN via the Gn interface
may fail for any of the following reasons:
Protocol Errors on the Gn interface with the SGSN
Gn link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-13
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOFailuresTo3G2GOverGn

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the Gn link status and MME service status.
Check fs.log for error indications related to Gn interface procedures, contact next
level of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-14 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresFromGERANoverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresFromGERANoverS3
Description
The raised alarm, LSS_cpiHOfailuresFromGERANoverS3, indicates that the value of
VS.cpiHOfailuresFromGERANoverS3 has exceeded a threshold in the last 5 minute
interval. This counter monitors the failure rate of attempted handovers from GERAN to a
E-UTRAN SGSN using the S3 interface. The failure rate is compared against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted handovers from GERAN to E-UTRAN SGSN via the S3 interface may fail for
any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-15
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresFromGERANoverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-16 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresFromUTRANoverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresFromUTRANoverS3
Description
The raised alarm, LSS_cpiHOfailuresFromUTRANoverS3, indicates that the value of
VS.cpiHOfailuresFromUTRANoverS3 has exceeded a threshold in the last 5 minute
interval. This counter monitors the failure rate of attempted handovers from UTRAN to a
E-UTRAN SGSN using the S3 interface. The failure rate is compared against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted handovers from UTRAN to E-UTRAN SGSN via the S3 interface may fail for
any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-17
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresFromUTRANoverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-18 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GOverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresRAUto2G3GOverS3
Description
The raised alarm, LSS_cpiHOfailuresRAUto2G3GOverS3, indicates the failure rate of
attempted Routing Area Update (RAU) procedures from E-UTRAN to a
UTRAN/GERAN SGSN using the S3 interface has exceeded a threshold in the last 5
minute interval. Failures encountered during the entire duration of the RAU procedure are
included. Therefore, failures encountered both prior to and after SGW change
determination are included. The failure rate is compared against provisioned thresholds
for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Attempted Routing Area Update procedures from E-UTRAN to a UTRAN/GERAN
SGSN using the S3 interface may fail for any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
Internal MME resource overload
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-19
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GOverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 For failures attributed to the SGSN, check the target UTRAN/GERAN network for errors
related to inter-system mobility procedures.
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the S3 link status and MME service status.
...................................................................................................................................................................................................

3 Check fs.log for error indications related to S3 interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-20 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3
Description
The raised alarm, LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3, indicates that the
value of VS.cpiHOfailuresRAUto2G3GnewSgwOverS3 has exceeded a threshold in the
last 5 minute interval. This counter monitors the failure rate of attempted RAU-based
handovers from E-UTRAN to a UTRAN/GERAN SGSN using the S3 interface with
SGW Relocation. This is Routing Area Update procedures. The failure rate is compared
against provisioned thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted RAU-based handovers from E-UTRAN to a UTRAN/GERAN SGSN via the
S3 interface may fail for any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-21
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GnewSgwOverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-22 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3
Description
The raised alarm, LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3, indicates that the
value of VS.cpiHOfailuresRAUto2G3GsameSgwOverS3 has exceeded a threshold in the
last 5 minute interval. This counter monitors the failure rate of attempted handovers from
E-UTRAN to a UTRAN/GERAN SGSN using the S3 interface without SGW Relocation.
This is Routing Area Update procedures. The failure rate is compared against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted RAU-based handovers from E-UTRAN to a UTRAN/GERAN SGSN via the
S3 interface may fail for any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-23
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresRAUto2G3GsameSgwOverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-24 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresToGERANoverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresToGERANoverS3
Description
The raised alarm, LSS_cpiHOfailuresToGERANoverS3, indicates that the value of
VS.cpiHOfailuresToGERANoverS3 has exceeded a threshold in the last 5 minute
interval. This counter monitors the failure rate of attempted handovers from E-UTRAN to
a GERAN SGSN using the S3 interface. The failure rate is compared against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted handovers from E-UTRAN to a GERAN SGSN via the S3 interface may fail
for any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-25
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresToGERANoverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-26 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresToUTRANoverS3

....................................................................................................................................................................................................................................

LSS_cpiHOfailuresToUTRANoverS3
Description
The raised alarm, LSS_cpiHOfailuresToUTRANoverS3, indicates that the value of
VS.cpiHOfailuresToUTRANoverS3 has exceeded a threshold in the last 5 minute
interval. This counter monitors the failure rate of attempted handovers from E-UTRAN to
a UTRAN SGSN using the S3 interface. The failure rate is compared against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Attempted handovers from E-UTRAN to a UTRAN SGSN via the S3 interface may fail
for any of the following reasons:
Protocol Errors on the S3 interface with the SGSN
S3 link inhibited due to link lock or link disabled due to dependency on the parent
managed object
The UE Context requested by and SGSN is not available in the MME
UE failed security validation at the MME
Failure to complete Routing Area Update procedure at the SGSN
Network connectivity problems between MME and SGSN
MME failure to release resources at completion of Routing Area Update procedure
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-27
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiHOfailuresToUTRANoverS3

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
For failures attributed to the SGSN, check the target UTRAN/GERAN network for
errors related to inter-system mobility procedures.
For failures attributed to the MME, check the S3 link status and MME service status.
Check fs.log for error indications related to S3 interface procedures, contact next level
of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-28 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMAFCommunicationFailureRate

....................................................................................................................................................................................................................................

LSS_cpiMAFCommunicationFailureRate
Description
The raised alarm, cpiMAFCommunicationFailureRate, indicates meeting a threshold of
MAF communication failure rate on a per MAF service basis in the last 5 minutes. The
failure rate is calculated from the measurement count VS.TotalMsgsRcvdFromMAF and
VS.TotalMsgsSentToMAF in every interval of 5 minutes. On the MI GUI the alarm
resource will indicate which MAF service has the problem in the MAF pool.
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the subsequent intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 95%
Major Alarm: 90% < CPI value <= 95%
Minor Alarm: 80% < CPI value <= 90%

Root Cause
This is a safety net alarm to provide notification in the event that MAF processing of
message traffic has significantly dropped (e.g. hung processes) and other monitoring
mechanisms have not identified and corrected the problem.
The communication issues between MIF and MAF services may also cause this alarm.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check the overload status of the MAF service firing this alarm.
...................................................................................................................................................................................................

2 Check if there is any hung process in the MAF service firing this alarm.
...................................................................................................................................................................................................

3 If the MAF service is duplex, try to switch the active MAF service.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-29
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMAFCommunicationFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

4 Contact Alcatel-Lucent Technical Support if problem still persists.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-30 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStartM3FailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionStartM3FailureRate
Description
The raised alarm LSS_cpiMBMSSessionStartM3FailureRate indicates meeting a
threshold of the MBMS Session Start M3 Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * ((VS.NbrSuccessMBMSsessionStartM3 + VS.AbortMBMSsessionStopM3)
/ VS.AttMBMSsessionStartM3))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Start procedures between the MME and MCEs using the M3
interface may fail for any of the following reasons:
Protocol Errors on an M3 interface with an MCE
MCE failure response
MME timeout awaiting MCE response
Internal MME resource overload or exhaustion
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MCE, check the MCE/eNB and network connections for
errors.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-31
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStartM3FailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the M3 link status and MME service status.
...................................................................................................................................................................................................

3 Check fs.log for error indications related to M3 interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-32 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStartSmFailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionStartSmFailureRate
Description
The raised alarm LSS_cpiMBMSSessionStartSmFailureRate indicates meeting a
threshold of the MBMS Session Start Sm Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * (VS.NbrSuccessMBMSsessionStartSm / VS.AttMBMSsessionStartSm))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Start procedures between MBMS-GW and the MME using the
Sm interface may fail for any of the following reasons:
iProtocol Errors on the Sm interface with the MBMS-GW
Sm link inhibited due to link lock or link disabled due to dependency on the parent
managed object
MBMS functionality disabled for the PLMN specified in the TMGI
Internal MME resource overload or exhaustion
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MBMS-GW, check the MBMS-GW and network for errors.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-33
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStartSmFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the Sm link status and MME service status.
...................................................................................................................................................................................................

3 Check fs.log for error indications related to Sm interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-34 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStopM3FailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionStopM3FailureRate
Description
The raised alarm LSS_cpiMBMSSessionStopM3FailureRate indicates meeting a
threshold of the MBMS Session Stop M3 Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * ((VS.NbrSuccessMBMSsessionStopM3 + VS.AbortMBMSsessionStopM3) /
VS.AttMBMSsessionStopM3))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Stop procedures between the MME and MCEs using the
M3 interface may fail for any of the following reasons:
Protocol Errors on an M3 interface with an MCE
Internal MME resource overload or exhaustion
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MCE, check the MCE/eNB and network connections for
errors.
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the M3 link status and MME service status.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-35
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStopM3FailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check fs.log for error indications related to M3 interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-36 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStopSmFailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionStopSmFailureRate
Description
The raised alarm LSS_cpiMBMSSessionStopSmFailureRate indicates meeting a
threshold of the MBMS Session Stop Sm Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * (VS.NbrSuccessMBMSsessionStopSm / VS.AttMBMSsessionStopSm))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Stop procedures between MBMS-GW and the MME using the
Sm interface may fail for any of the following reasons:
Protocol Errors on the Sm interface with the MBMS-GW
Sm link inhibited due to link lock or link disabled due to dependency on the parent
managed object
MBMS bearer context not found
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MBMS-GW, check the MBMS-GW and network for errors.
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the Sm link status and MME service status.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-37
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionStopSmFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check fs.log for error indications related to Sm interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-38 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionUpdateM3FailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionUpdateM3FailureRate
Description
The raised alarm LSS_cpiMBMSSessionUpdateM3FailureRate indicates meeting a
threshold of the MBMS Session Update M3 Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * ((VS.NbrSuccessMBMSsessionUpdateM3 + VS.AbortMBMSsession-
StopM3) / VS.AttMBMSsessionUpdateM3))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Update procedures between the MME and MCEs using the
M3 interface may fail for any of the following reasons:
Protocol Errors on an M3 interface with an MCE
MCE failure response
MME timeout awaiting MCE response
Internal MME resource overload or exhaustion
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MCE, check the MCE/eNB and network connections for
errors.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-39
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionUpdateM3FailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the M3 link status and MME service status.
...................................................................................................................................................................................................

3 Check fs.log for error indications related to M3 interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-40 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionUpdateSmFailureRate

....................................................................................................................................................................................................................................

LSS_cpiMBMSSessionUpdateSmFailureRate
Description
The raised alarm LSS_cpiMBMSSessionUpdateSmFailureRate indicates meeting a
threshold of the MBMS Session Update Sm Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * (VS.NbrSuccessMBMSsessionUpdateSm / VS.AttMBMSsessionUp-
dateSm))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Attempted MBMS Session Update procedures between MBMS-GW and the MME using
the Sm interface may fail for any of the following reasons:
Protocol Errors on the Sm interface with the MBMS-GW
Sm link inhibited due to link lock or link disabled due to dependency on the parent
managed object
MBMS bearer context not found
Internal MME error

Fault clearance procedure


...................................................................................................................................................................................................

1 For failures attributed to the MBMS-GW, check the MBMS-GW and network for errors.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-41
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMBMSSessionUpdateSmFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 For failures attributed to the MME, check the Sm link status and MME service status.
...................................................................................................................................................................................................

3 Check fs.log for error indications related to Sm interface procedures, contact


Alcatel-Lucent Customer Support if internal MME errors are indicated.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-42 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafAttachFailuresSysRelated

....................................................................................................................................................................................................................................

LSS_cpiMafAttachFailuresSysRelated
Description
The raised alarm, LSS_cpiMafAttachFailuresSysRelated, indicates meeting/exceeding a
threshold of the rate of system-related failures for Attach procedures, which is calculated
every 5 minutes, using the formula:
VS.NbrAttachFailureSysRelated_sum / VS.AttAttachRequests
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure :
Failures at the eNB elements
Failures at the SGW elements
Failures at the MME elements

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the S1, S6a, and S11 links are in-service/normal, using link_cli.
Verify that there are no overload alarms on the MME
Contact Alcatel-Lucent Customer Support

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-43
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafAttachWithPGWreselection

....................................................................................................................................................................................................................................

LSS_cpiMafAttachWithPGWreselection
Description
The raised alarm cpiAttachWithPGWreselection indicates meeting a threshold of the rate
of PGW reselection during Attach procedures CPI, which is calculated every 5 minutes
using this formula:
VS.AttachWithPGWreselection/VS.AttAttachRequests
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 50%
Major Alarm: 30% < CPI value <= 50%
Minor Alarm: 15% < CPI value <= 30%

Root Cause
Timeout or a Reject received from the SGW.

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent customer support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-44 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafAttachWithSGWreselection

....................................................................................................................................................................................................................................

LSS_cpiMafAttachWithSGWreselection
Description
The raised alarm cpiAttachWithSGWreselection indicates meeting a threshold of the rate
of SGW reselection during Attach procedures CPI, which is calculated every 5 minutes
using this formula:
VS.AttachWithSGWreselection/VS.AttAttachRequests
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 50%
Major Alarm: 30% < CPI value <= 50%
Minor Alarm: 15% < CPI value <= 30%

Root Cause
Timeout or a Reject received from the SGW.

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent customer support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-45
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafEIRfailuresS13

....................................................................................................................................................................................................................................

LSS_cpiMafEIRfailuresS13
Description
The raised alarm, LSS_cpiMafEIRfailuresS13, indicates that the value of
VS.LSS_cpiMafEIRfailuresS13 has exceeded a threshold in the last 5 minute interval.
This counter monitors the percentage of unsuccessful EquipmentCheckRequest (ECR) to
the number of ECRs attempted. The calculated percentage is compared against
provisioned thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure of ECR responses from the S13 EIR interface could be due any of the following
reasons:
Network problems between MME and HSS (EIR).
Errors or problems at far end HSS (EIR).
Internal errors at the MME.
Parsing/decoding errors in the ECR response.
IMEI not found in EIR DB

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the far end HSS (EIR) is functioning properly. Check fs.log for any ECR/ECA/S13
related errors to aid in determining the cause. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-46 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafExtServiceReqFailuresSysRelated

....................................................................................................................................................................................................................................

LSS_cpiMafExtServiceReqFailuresSysRelated
Description
The raised alarm LSS_cpiMafExtServiceReqFailuresSysRelated indicates meeting a
threshold of the Extended Service Request System Related Failure CPI, which is
calculated every 5 minutes using this formula:
100 * (VS.NbrFailedExtSvcRequestsSysRelated_sum / VS.AttExtServiceRequests)
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Failure could be due to any of the following reasons:
ENB returns UE Context Modification Failure (includes lack of resources, collision
with other procedure or protocol errors)
ENB returns Initial Context Setup Failure (includes lack of resources, collision with
other procedure or protocol errors)
ENB returns failure with cause relating to invalid/mismatched eRAB Id or mmeS1AP
Id sent by MME
MME fails to process Extended Service Request due to a system related failure on
MME
SGW failure on Modify Bearer Request during CSFB call

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-47
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafExtServiceReqFailuresSysRelated

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Verify that S1, S6a, S11 and SGs links are Unlocked/Enabled using link_cli
...................................................................................................................................................................................................

2 Verify that there are no overload alarms on MME


...................................................................................................................................................................................................

3 Contact Customer Support


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-48 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafExtServiceRequestFailures

....................................................................................................................................................................................................................................

LSS_cpiMafExtServiceRequestFailures
Description
The raised alarm LSS_cpiMafExtServiceRequestFailures indicates meeting a threshold of
the Extended Service Request Failure CPI, which is calculated every 5 minutes using this
formula:
100 - (100 * (VS.NbrSuccessExtServiceRequests / VS.AttExtServiceRequests))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Failure could be due to any of the following reasons:
Extended Service Request rejected by MME due to protocol errors
For Mobile Terminated CSFB call, UE included rejection in CSFB Response IE of
Extended Service Request
Extended Service Request rejected by MME due to access restrictions (PLMN, TA,
EPS service, non-EPS service not allowed)
Extended Service Request rejected by MME due to roaming restrictions
Extended Service Request rejected by MME due to TA not available
For SGs based CSFB call, Extended Service Request rejected by MME due to TAI not
mapped to LAI or mapped to LAI not supporting CSFB
Extended Service Request rejected by MME due to UE implicitly detached
Extended Service Request rejected by MME due to problems with SGs link to MSC
Extended Service Request rejected by MME due to congestion
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-49
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafExtServiceRequestFailures

....................................................................................................................................................................................................................................
Extended Service Request aborted on MME due to collision with other procedure
pending for UE
MME did not receive UE Context Release Request from ENB after successful
processing of Extended Service Request
CSFB call failed due to MME, ENB or SGW related System Failure. For more details
see the Extended Service Request System Failures description

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify MME provisioning data, especially PLMN, TAI-LAI-Mapping, LAI tables


...................................................................................................................................................................................................

2 Verify that S1, S6a, S11, and SGs links are Unlocked/Enabled using link_cli
...................................................................................................................................................................................................

3 Verify that there are no overload alarms on MME


...................................................................................................................................................................................................

4 Contact Customer Support


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-50 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafFailuresOverSGs

....................................................................................................................................................................................................................................

LSS_cpiMafFailuresOverSGs
Description
The raised alarm, LSS_cpiMafFailuresOverSGs, indicates meeting/exceeding a threshold
of the rate of failure for handling messages from the SGs interface, which is calculated
every 5 minutes, using the formula:
VS.NbrFailedSGsSignalingProcedures / VS.AttSGsSignalingProcedures
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Possible problems with the SGs links
Internal Failure

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the SGs links are in-service/normal, using link_cli.
Contact Alcatel-Lucent Customer Support

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-51
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafHLRAuthFail

....................................................................................................................................................................................................................................

LSS_cpiMafHLRAuthFail
Description
The raised alarm, LSS_cpiMafHLRAuthFail, indicates meeting/exceeding a threshold of
the rate of failure for handling Authentication failure messages from the HLR, which is
calculated every 5 minutes, using the formula:
100 * (1 - (VS.NbrSuccessAuthRequestsHLR / VS.AttAuthRequestsHLR))
Notes:
THIS ALARM IS RESERVED FOR FUTURE USE.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Possible problems with the Gr link
Protocol errors reported from the far end (such as unknown subscriber, unexpected
datavalue, missing data and system failure)
Internal Failure

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the Gr link is in-service/normal, using link_cli.
Contact Alcatel-Lucent Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-52 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafHSSreselection

....................................................................................................................................................................................................................................

LSS_cpiMafHSSreselection
Description
The raised alarm cpiHSSreselection indicates meeting a threshold of the rate of HSS
reselection during Authentication or Update Location procedures CPI, which is calculated
every 5 minutes using this formula:
VS.HssReselectionAtt/(VS.AttAuthRequestsHSS+VS.AttUpdateLocationRequest)
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 50%
Major Alarm: 30% < CPI value <= 50%
Minor Alarm: 15% < CPI value <= 30%

Root Cause
The following responses from the HSS may trigger this alarm:
Timeout on a request to HSS
HSS response code with error - TOO BUSY
HSS response code with error - RESOURCES EXCEEDED
HSS response code with error - UNABLE TO DELIVER
HSS response code with error - OUT OF SPACE

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent customer support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-53
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafPDNconnWithPGWreselection

....................................................................................................................................................................................................................................

LSS_cpiMafPDNconnWithPGWreselection
Description
The raised alarm cpiPDNconnWithPGWreselection indicates meeting a threshold of the
rate of PGW reselection during PDN connectivity procedures CPI, which is calculated
every 5 minutes using this formula:
VS.PdnConnPgwReselection/VS.AttPDNConnReq
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 50%
Major Alarm: 30% < CPI value <= 50%
Minor Alarm: 15% < CPI value <= 30%

Root Cause
Timeout or a Reject received from the SGW.

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent customer support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-54 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafServiceReqFailuresSysRelated

....................................................................................................................................................................................................................................

LSS_cpiMafServiceReqFailuresSysRelated
Description
The raised alarm, LSS_cpiMafServiceReqFailuresSysRelated, indicates
meeting/exceeding a threshold of the rate of system-related failures for UE Service
Request procedures, which is calculated every 5 minutes, using the formula:
VS.NbrServiceReqFailureSysRelated_sum / VS.AttServiceRequests
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Failures at the eNB elements
Failures at the SGW elements
Failures at the MME elements

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the S1, S6a, and S11 links are in-service/normal, using link_cli.
Verify that there are no overload alarms on the MME
Contact Alcatel-Lucent Customer Support

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-55
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafTauFailuresInterMme

....................................................................................................................................................................................................................................

LSS_cpiMafTauFailuresInterMme
Description
The raised alarm, LSS_cpiMafTauFailuresInterMme, indicates meeting/exceeding a
threshold of the rate of failure of Tracking Area Update procedures involving MME
relocation which is calculated every 5 minutes, using the formula:
(VS.TauInterMmeAtt - VS.TauInterMmeSucc) / VS.TauInterMmeAtt
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Possible problems with the eNB or the MME
Old MME does not respond
MME is not available to provide service to the UE in the new Tracking Area

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the eNB, and MME links are in-service/normal, using link_cli.
Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB
and the MME groups serving the eNB that is involved in the TAU procedure

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-56 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafTauFailuresInterMmeInterSgw

....................................................................................................................................................................................................................................

LSS_cpiMafTauFailuresInterMmeInterSgw
Description
The raised alarm, LSS_cpiMafTauFailuresInterMmeInterSgw, indicates
meeting/exceeding a threshold of the rate of failure of Tracking Area Update procedures
involving MME relocation and SGW relocation which is calculated every 5 minutes,
using the formula:
(VS.TauInterMmeInterSgwAtt - VS.TauInterMmeInterSgwSucc) / VS.TauInterM-
meInterSgwAtt
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Possible problems with the HSS, eNB, MME, or SGW
MME is not available to provide service to the UE in the new Tracking Area
RF problems may prevent the UE from sending or receiving messages
SGW is not available to provide service to the UE in the new Tracking Area
Because of SGW failure (no response from SGW or SGW reject the Create
Session/Modify Bearer Requests)
HSS failure/error response during Update Location Request
UE not allowed service due to UE subscription information

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the HSS, eNB, SGW, and MME links are in-service/normal, using link_cli.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-57
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafTauFailuresInterMmeInterSgw

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 Verify UE subscription information in HSS.


...................................................................................................................................................................................................

3 Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB,
MME groups, and SGW Pools serving the eNB that are involved in the TAU procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-58 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafTauFailuresInterSgw

....................................................................................................................................................................................................................................

LSS_cpiMafTauFailuresInterSgw
Description
The raised alarm, LSS_cpiMafTauFailuresInterSgw, indicates meeting/exceeding a
threshold of the rate of failure of Tracking Area Update procedures involving SGW
relocation, which is calculated every 5 minutes, using the formula:
(VS.TauInterSgwSucc - VS.TauInterSgwAtt) / VS.TauInterSgwAtt
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
RF problems may prevent the UE from sending or receiving messages
Possible problems with the eNB or the SGW
SGW is not available to provide service to the UE in the new Tracking Area
Because of SGW failure (no response from SGW or SGW reject the Create
Session/Modify Bearer Requests)
Internal failure

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-59
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiMafTauFailuresInterSgw

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
Verify that the eNB, and SGW links are in-service/normal, using link_cli.
Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB
and the SGW Pools serving the eNB that is involved in the TAU procedure

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-60 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiNoPSHOFailuresOverSv

....................................................................................................................................................................................................................................

LSS_cpiNoPSHOFailuresOverSv
Description
The raised alarm LSS_cpiNoPSHOFailuresOverSv indicates meeting a threshold of the
Hand Down to UTRAN/GERAN via the Sv interface without PSHO Failure Rate CPI,
which is calculated every 5 minutes using this formula:
1 - ( ( VS.NbrSuccessCSHOSv + VS.NbrCSHOSvAbort_Other + VS.NbrCSHOSvAbort-
_Canceled ) / VS.AttCSHOSv )
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
attempted handovers of circuit-services only to UTRAN/GERAN via the Sv interface
may fail for any of the following reasons:
Sv interface problems - MME cannot communicate with the MSC
Handover preparation is rejected by the target UTRAN/GERAN network
UE fails to complete handover to the target radio access network due to RF conditions
Subscriber provisioning prohibits handover to UTRAN/GERAN
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-61
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiNoPSHOFailuresOverSv

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Check counters and alarms related to the Sv interface. Verify network connectivity and
proper configuration between the MME and MSC(s).
Check the target UTRAN/GERAN network for configuration problems that could
cause the handover preparation attempts to be rejected.
Check the source E-UTRAN network and target UTRAN/GERAN network for
handover failure conditions.
Check fs.log for error indications related to Sv interface procedures. Contact next
level of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-62 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiPSHOFailuresOverSv

....................................................................................................................................................................................................................................

LSS_cpiPSHOFailuresOverSv
Description
The raised alarm LSS_cpiPSHOFailuresOverSv indicates meeting a threshold of the
Hand Down to UTRAN/GERAN via the Sv interface with PSHO Failure Rate CPI, which
is calculated every 5 minutes using this formula:
1 - ( ( VS.NbrSuccessPSHOSv + VS.NbrPSHOSvAbort_Other + VS.NbrPSHOSvAbort-
_Canceled ) / VS.AttPSHOSv )
Notes:
The thresholds are configurable on MI CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
attempted SRVCC handover of circuit and packet services to UTRAN/GERAN via the Sv
interface may fail for any of the following reasons:
Sv interface problems - MME cannot communicate with the MSC
Handover preparation is rejected by the target UTRAN/GERAN network
UE fails to complete handover to the target radio access network due to RF conditions
Subscriber provisioning prohibits handover to UTRAN/GERAN
Internal MME error

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-63
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiPSHOFailuresOverSv

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Check counters and alarms related to the Sv interface. Verify network connectivity and
proper configuration between the MME and MSC(s).
Check the target UTRAN/GERAN network for configuration problems that could
cause the handover preparation attempts to be rejected.
Check the source E-UTRAN network and target UTRAN/GERAN network for
handover failure conditions.
Check fs.log for error indications related to Sv interface procedures. Contact next
level of support if internal MME errors are indicated.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-64 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailures

....................................................................................................................................................................................................................................

LSS_cpiS3TauFailures
Description
The raised alarm, LSS_cpiS3TauFailures, indicates meeting/exceeding a threshold of the
rate of failure of Tracking Area Update procedures from an SGSN to the MME over an S3
link. This alarm is calculated every 5 minutes using the formula:
(VS.TauAttS3 - VS.TauSuccS3) / VS.TauAttS3
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the eNB or the MME.
RF problems prevent UE from sending/receiving messages.
SGW is not available to provide service to the UE in the new Tracking Area.
SGSN does not respond.
Link failures to the SGSN, SGW, eNB and/or HSS.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the eNB, SGW, HSS, and SGSN S3 links are in-service/normal, using link_cli.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-65
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailures

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 Verify the operational status of the SGSN and that the SGSN is responding to messages
over the S3 link.
...................................................................................................................................................................................................

3 Verify the operational status of the DNS server and that the DNS entries for the SGW are
correct.
...................................................................................................................................................................................................

4 Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB, the
HSS, and the SGW serving the eNB that is involved in the TAU procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-66 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailuresInterSgw

....................................................................................................................................................................................................................................

LSS_cpiS3TauFailuresInterSgw
Description
The raised alarm, LSS_cpiS3TauFailuresInterSGW, indicates meeting/exceeding a
threshold of the rate of failure of Tracking Area Update procedures from an SGSN to the
MME over an S3 link that involves a change of serving SGW. This alarm is calculated
every 5 minutes using the formula:
(VS.TauInterSgwAttS3 - VS.TauInterSgwSuccS3) / VS.TauInterSgwAttS3
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the eNB or the MME.
RF problems prevent UE from sending/receiving messages.
SGW is not available to provide service to the UE in the new Tracking Area.
SGSN does not respond.
Link failures to the SGSN, SGW, eNB and/or HSS.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the eNB, SGW, HSS, and SGSN S3 links are in-service/normal, using link_cli.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-67
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailuresInterSgw

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 Verify the operational status of the SGSN and that the SGSN is responding to messages
over the S3 link.
...................................................................................................................................................................................................

3 Verify the operational status of the DNS server and that the DNS entries for the SGW are
correct.
...................................................................................................................................................................................................

4 Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB, the
HSS, and the SGW serving the eNB that is involved in the TAU procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-68 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailuresIntraSGW

....................................................................................................................................................................................................................................

LSS_cpiS3TauFailuresIntraSGW
Description
The raised alarm, LSS_cpiS3TauFailuresIntraSGW, indicates meeting/exceeding a
threshold of the rate of failure of Tracking Area Update procedures from an SGSN to the
MME over an S3 link that do not involve a change of serving SGW. This alarm is
calculated every 5 minutes using the formula:
(VS.TauIntraSgwAttS3 - VS.TauIntraSgwSuccS3) / VS.TauIntraSgwAttS3
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the eNB or the MME.
RF problems prevent UE from sending/receiving messages.
SGW is not available to provide service to the UE in the new Tracking Area.
SGSN does not respond.
Link failures to the SGSN, SGW, eNB and/or HSS.
Invalid DNS entries for the SGW (if SGW Discovery is active/enabled).
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the eNB, SGW, HSS, and SGSN S3 links are in-service/normal, using link_cli.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-69
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiS3TauFailuresIntraSGW

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 Verify the operational status of the SGSN and that the SGSN is responding to messages
over the S3 link.
...................................................................................................................................................................................................

3 Verify the operational status of the DNS server and that the DNS entries for the SGW are
correct.
...................................................................................................................................................................................................

4 Contact Alcatel-Lucent Customer Support to determine the status of the serving eNB, the
HSS, and the SGW serving the eNB that is involved in the TAU procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-70 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate

....................................................................................................................................................................................................................................

LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate
Description
The raised alarm LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate indicates meeting a
threshold of the Stop Warning Message Delivery S1MME Failure Rate CPI, which is
calculated every 5 minutes using this formula:
100 - (100 * (VS.NbrSuccessStopWarnMsgDeliveryS1MME /
VS.AttStopWarnMsgDeliveryS1MME))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the MME.
Link failure to target eNBs.
Verify the MME_TAI table contains the correct TAIs.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the S1MME links are in-service/normal, using link_cli.


...................................................................................................................................................................................................

2 Verify the operational status of the eNBs and that the eNBs are responding to messages
over the S1MME link.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-71
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiStopWarnMsgDeliveryS1MMEFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Contact Alcatel-Lucent Customer Support to determine the status of the eNBs that are
involved in the Stop Warning Message procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-72 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiStopWarnMsgDeliverySBcFailureRate

....................................................................................................................................................................................................................................

LSS_cpiStopWarnMsgDeliverySBcFailureRate
Description
The raised alarm LSS_cpiStopWarnMsgDeliverySBcFailureRate indicates meeting a
threshold of the Stop Warning Message Delivery SBc Failure Rate CPI, which is
calculated every 5 minutes using this formula:
100 - (100 * (VS.NbrSuccessStopWarnMsgDeliverySBc / VS.AttStopWarnMsgDelivery-
SBc))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the CBC or the MME.
Link failure to the CBC.
Link failure to target eNBs.
Verify the MME_TAI table contains the correct TAIs.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the SBC links are in-service/normal, using link_cli.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-73
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiStopWarnMsgDeliverySBcFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

2 Verify that the S1MME links are in-service/normal, using link_cli.


...................................................................................................................................................................................................

3 Verify the operational status of the CBC.


...................................................................................................................................................................................................

4 Verify the operational status of the eNBs and that the eNBs are responding to messages
over the S1MME link.
...................................................................................................................................................................................................

5 Contact Alcatel-Lucent Customer Support to determine the status of the CBC and eNBs
that are involved in the Stop Warning Message procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-74 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiUECapacityUsage

....................................................................................................................................................................................................................................

LSS_cpiUECapacityUsage
Description
The raised alarm, cpiUECapacityUsage, indicates meeting a threshold of a UE capacity
utilization rate on a per board basis in the last 5 minutes. The utilization rate is calculated
in every interval of 5 minutes by using this formula:
( Number of maximum registered on a board / UE capacity of a single board ) * 100%
Notes:
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the subsequent intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 99%
Major Alarm: 95% < CPI value <= 99%
Minor Alarm: 90% < CPI value <= 95%

Root Cause
The alarm is fired when the number of the maximum registered UEs crosses the
predefined threshold on a single board.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check how many boards the WMM has and consider to install more boards to increase
the WMM capacity.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-75
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiWarnMsgDeliveryS1MMEFailureRate

....................................................................................................................................................................................................................................

LSS_cpiWarnMsgDeliveryS1MMEFailureRate
Description
The raised alarm LSS_cpiWarnMsgDeliveryS1MMEFailureRate indicates meeting a
threshold of the Warning Message Delivery S1MME Failure Rate CPI, which is
calculated every 5 minutes using this formula:
100 - (100 * (VS.NbrSuccessWarnMsgDeliveryS1MME /
VS.AttWarnMsgDeliveryS1MME))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the MME.
Link failure to target eNBs.
Verify the MME_TAI table contains the correct TAIs.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the S1MME links are in-service/normal, using link_cli.


...................................................................................................................................................................................................

2 Verify the operational status of the eNBs and that the eNBs are responding to messages
over the S1MME link.
....................................................................................................................................................................................................................................
2-76 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiWarnMsgDeliveryS1MMEFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Contact Alcatel-Lucent Customer Support to determine the status of the eNBs that are
involved in the Write Replace Warning Message procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-77
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiWarnMsgDeliverySBcFailureRate

....................................................................................................................................................................................................................................

LSS_cpiWarnMsgDeliverySBcFailureRate
Description
The raised alarm LSS_cpiWarnMsgDeliverySBcFailureRate indicates meeting a threshold
of the Warning Message Delivery SBc Failure Rate CPI, which is calculated every 5
minutes using this formula:
100 - (100 * (VS.NbrSuccessWarnMsgDeliverySBc / VS.AttWarnMsgDeliverySBc))
Notes:
The thresholds are configurable on CPI GUI.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible failure reasons:
Possible problems with the CBC or the MME.
Link failure to the CBC.
Link failure to target eNBs.
Verify the MME_TAI table contains the correct TAIs.
Internal MME errors.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the SBC links are in-service/normal, using link_cli.


...................................................................................................................................................................................................

2 Verify that the S1MME links are in-service/normal, using link_cli.

....................................................................................................................................................................................................................................
2-78 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_cpiWarnMsgDeliverySBcFailureRate

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Verify the operational status of the CBC.


...................................................................................................................................................................................................

4 Verify the operational status of the eNBs and that the eNBs are responding to messages
over the S1MME link.
...................................................................................................................................................................................................

5 Contact Alcatel-Lucent Customer Support to determine the status of the CBC and eNBs
that are involved in the Write Replace Warning Message procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-79
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_dataMismatch

....................................................................................................................................................................................................................................

LSS_dataMismatch
Description
A data mismatch has been detected, which indicates that there has been an error in
provisioning. The additionalText field of the event provides the details of the data
mismatch. Currently supported data mismatches are listed in the table below:

MH_SH_PROVISIONING WMM link (S1mme, S6a, An interface profile has been


S13, SGs) associated with an SCTP
profile that indicates either
single-homed or
multi-homed, but the network
interface types (ni-types)
associated with a network
interface do not match that
configuration. For example
for SGs, the singled-homed
ni-type is SGS, and the
multi-homed ni-types are
SGS_1 and SGS_2, so if the
SCTP profile indicates
single-homed, an SGs
network interface must have
an ni-type type of SGS, and if
the SCTP profile indicates
multi-homed, an SGs
interface must have ni-types
of SGS_1 and SGS_2. The
additionalText field indicates
the type of homing
provisioned and the
inconsistency found.

....................................................................................................................................................................................................................................
2-80 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_dataMismatch

....................................................................................................................................................................................................................................

MH_PROVISIONED_IPS Link An SCTP multi-homed link


has been provisioned with
remote addresses that do not
match the addresses that were
learned from the remote end
in the INIT-ACK message.
There is either a provisioning
mistake on the WMM, or on
the remote end of the
connection. The
additionalText field of the
event will indicate the
provisioned (LCL) IP
addresses and the remote
(RMT) learned addresses
which caused the discrepancy.
The state of each IP address is
indicated after the IP address,
e.g. 1.2.3.4(STATE), where
STATE is one of; R -
reachable, U - unreachable, C
- unconfirmed.

This alarm must be manually cleared after the provisioned data is corrected.

Default severity
WARNING

Root Cause
The following table indicates the cause for the events referred to by their identifier in the
previous table:

MH_SH_PROVISIONING An SCTP profile associated with an interface


indicates multi-homing or single-homing, but
the ni-types associated with the interface do
not match the indicated type of homing.
MH_PROVISIONED_IPS The SCTP IP addresses provisioned for an
interface do not match the IP addresses learned
from the remote end in the SCTP INIT-ACK
message.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-81
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_dataMismatch

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 The following table indicates the recovery procedure for the events referred to by their
identifier in the previous table:

MH_SH_PROVISIONING Correct the SCTP profile for this link type to


indicate the correct homing type (SH or MH),
or configure the correct ni-types for the link,
so that they match the SCTP profile.
MH_PROVISIONED_IPS Correct the provisioning of the locally
provisioned remote IPs to match the learned
remote IPs, or change the provisioning on the
remote end to match the locally provisioned
remote IPs.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-82 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_excessiveExternalLinksDown

....................................................................................................................................................................................................................................

LSS_excessiveExternalLinksDown
Description
An excessive number of links of a given type (e.g. s1mme, s11, etc.) are down. This is
usually due to a network connectivity problem and not the individual links between the
WMM and the external entity. Once this alarm is triggered the WMM will stop reporting
alarms and status for links of the given type. Once the network problem is resolved and
the number of links down is no longer excessive, this alarm will clear and the status of all
links of the given type will be updated. This alarm is raised when at least 100 links of a
given type are down. This alarm clears when 95 or fewer links are down.

Default severity
CRITICAL

Root Cause
The possible causes of this alarm are:
1. A large number network entities are out-of-service or undergoing initialization.
2. Packet or HeartBeat message loss due to network issues.
3. Provision data is incorrect on MME for network entities the MME communicates
with.
4. Software failure prevents communication established between MME and other
network entities.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine that there are no errors within the IP network.


...................................................................................................................................................................................................

2 If the network entity data is provisioned on MME, verify the data is correct.
...................................................................................................................................................................................................

3 Verify the network entity that MME fails to communicate with is in service.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-83
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_externalLinkConfigurationLimit

....................................................................................................................................................................................................................................

LSS_externalLinkConfigurationLimit
Description
The maximum number of links for a given link type has been reached. When this limit is
reached, it is not possible to create any new links of the given link type. Every 15 minutes
a check will be performed in an attempt to recover any links which have not been used or
have been disabled due to lack of far-end response. A configurable parameter, TdynMO ,
is used to control the aging algorithm for link recovery.

Default severity
MAJOR

Root Cause
This alarm is caused when there are too many link of a given type in use.

Fault clearance procedure


...................................................................................................................................................................................................

1 Wait at least TdynMO time interval to allow the system to recover inactive or disabled
links.
...................................................................................................................................................................................................

2 If the system does not recover any links after TdynMO time interval, contact
Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-84 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_externalLinkDown

....................................................................................................................................................................................................................................

LSS_externalLinkDown
Description
Communication between WMM and another network entity can not be established.

Default severity
CRITICAL, MAJOR

Root Cause
The possible causes of this alarm are:
1. Remote network entity is out-of-service or undergoing initialization.
2. Packet or HeartBeat message loss due to network issues.
3. Provision data is incorrect on WMM for network entities the WMM communicates
with.
4. Software failure prevents communication established between WMM and other
network entities.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the network entity that WMM fails to communicate with is in service.
...................................................................................................................................................................................................

2 Determine that there are no errors within the IP network.


...................................................................................................................................................................................................

3 If the network entity data is provisioned on WMM, verify the data is correct.
...................................................................................................................................................................................................

4 If multiple links that terminate on the MIF (X1_1 or X2) are down, try switching MIF to
hot-standby mate.
...................................................................................................................................................................................................

5 If multiple links that terminate on the MPH (non-X1_1 and non-X2) are down, try
switching MPH to hot-standby mate.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-85
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedAttachReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedAttachReqsRateExceeded
Description
The raised alarm, LSS_failedAttachReqsRateExceeded, indicates the value of the
VS.cpiAttachFailures measurement, monitored when failure Attach request CPI exceeded
a threshold in the last 15 minute interval. This value computes the failure rate for the UE
Attach procedure, and compares the calculation against provisioned thresholds for Minor,
Major, and Critical alarm conditions
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Possible reasons for failure:
Internal error
Procedure collision with ongoing HSS or SGW procedure
Invalid data in Attach Request message (includes protocol failures or invalid message
content)
UE Authentication Failure due to invalid validation of RES returned in Authentication
Response message or Authentication Failure received by UE
HSS failure in response to AIR
HSS failure in response to ULR
NAS message timeout (message include Authentication Response, Security Mode
Complete, Attach Complete)
ENB returns Initial Context Setup Failure
Timeout occurs while waiting for Initial Context Setup Response
Double S1 connection
Unexpected S1 release
....................................................................................................................................................................................................................................
2-86 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedAttachReqsRateExceeded

....................................................................................................................................................................................................................................
Bad NAS ESM Information Response
Bad NAS message in ESM container (PDN Connectivity Request, Activate Default
Bearer Response)
SGW failure in response to Create Session Request
UE returns Activate Default Bearer Reject
SGW failure in response to Modify Bearer Request

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that the eNB, HSS and SGW links are in-service/normal, using link_cli.
If the links look normal, and the alarm persists, contact Alcatel-Lucent Customer
Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-87
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedAuthRequestsHSSRateExceeded

....................................................................................................................................................................................................................................

LSS_failedAuthRequestsHSSRateExceeded
Description
The raised alarm, LSS_failedAuthRequestsHSSRateExceeded, indicates the value of
VS.cpiHSSauthFailures measurement, monitored when HSS failed Authentication
requests exceeded a threshold in the last 15 minute interval. This value computes the
failure rate for the Authentication procedure between the MME and the HSS, and
compares the calculation against provisioned thresholds for Minor, Major, and Critical
alarm conditions
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: Rate value > 10

Root Cause
The Authentication Information Request (AIR message from the MME to the HSS for
requesting authentication vectors) failures exceeded the failure limit. The AIR could have
failed for any of the following reasons:
Internal database error
Internal error sending messages between proxies/managers
Can not send the AIR message to the HSS
The HSS did not respond
The response from the HSS was empty
The response from the HSS could not be decoded
The response from the HSS had a failure result code or experimental result code
The response from the HSS included more authentication vectors than what was
requested

....................................................................................................................................................................................................................................
2-88 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedAuthRequestsHSSRateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Clearance options include:


Ensure communication between the MME and HSS (ping)
If the HSS (S6a) link looks normal (using link_cli), and alarm persists, contact
Alcatel-Lucent Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-89
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedAuthRequestsUERateExceeded

....................................................................................................................................................................................................................................

LSS_failedAuthRequestsUERateExceeded
Description
The raised alarm, LSS_failedAuthRequestsUERateExceeded, indicates the value of
VS.cpiUEauthFailures measurement, monitored when UE failed Authentication requests
exceeded a threshold in the last 15 minute interval. This value computes the failure rate
for the Authentication procedure between the MME and the UE and compares the
calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: Rate value > 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Double S1 connection
Unexpected S1 release
HSS failure in response to AIR
UE Authentication Failure due to invalid validation of RES returned in Authentication
Response message or Authentication Failure received by UE

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify HSS and UE authentication data, using ueadmin_cli.
If the authentication data looks good, and the alarm persists, contact Alcatel-Lucent
Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-90 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedCrDedBearerReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedCrDedBearerReqsRateExceeded
Description
The raised alarm, LSS_failedCrDedBearerReqsRateExceeded, indicates the value of
VS.cpiCreateDedicatedBearerFailures measurement, monitored when failure on Create
Dedicated Bearer request exceeded a threshold in the last 15 minute interval. This value
computes the failure rate for the Create Dedicated Bearer procedure, and compares the
calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Collision with another EMM or BSM procedure
No resource available (currently MME only supports one dedicated bearer)
Bad S11 message (Create Bearer Request)
UE returns failure on Activate Dedicated Bearer Request
ENB returns failure on E-RAB Setup Request
Timeout waiting for UE or ENB's response

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-91
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedCrDedBearerReqsRateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
Contact Alcatel-Lucent Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-92 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedDeactDedBearerReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedDeactDedBearerReqsRateExceeded
Description
The raised alarm, LSS_failedDeactDedBearerReqsRateExceeded, indicates the value of
VS.cpiDeactivateDedBearerFailures measurement, monitored when failure on Deactivate
Dedicated Bearer request exceeded a threshold in the last 15 minute interval. This value
computes the failure rate for the Deactivate Dedicated Bearer procedure, and compares
the calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Invalid data in Delete Bearer Request message (includes protocol failures or invalid
message content)
The response to the SGW could not be encoded
SGW failure on Delete Bearer Request

Fault clearance procedure


...................................................................................................................................................................................................

1
Contact Alcatel-Lucent Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-93
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedHRPDhandoverRateExceeded

....................................................................................................................................................................................................................................

LSS_failedHRPDhandoverRateExceeded
Description
The raised alarm, LSS_failedHRPDhandoverRateExceeded, indicates the value of
VS.cpiHRPDHoFailures measurement, monitored when failure on a HRPD Handover
request exceeded a threshold in the last 15 minute interval. This value computes the
failure rate for the Handover to HRPD procedure, and compares the calculation against
provisioned thresholds for Minor, Major, and Critical alarm conditions.
Notes:
THIS ALARM IS RESERVED FOR FUTURE USE.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
See the list of failure counters for the Handover to HRPD procedure. There are several
failure causes, each with a separate failure counter. The calculation for this alarm
implicitly uses the sum of all the failure counters for the Attach procedure.

Fault clearance procedure


...................................................................................................................................................................................................

1 Look at all the failure counters for the Handover to HRPD procedure in the PMC XML
files to determine if one failure cause predominates. If one is found, check User
Documentation for any remedies specific to the found cause.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-94 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedMobileTermLocRequestRateExceeded

....................................................................................................................................................................................................................................

LSS_failedMobileTermLocRequestRateExceeded
Description
The raised alarm LSS_failedMobileTermLocRequestRateExceeded indicates meeting a
threshold of the Mobile Termination Location Request Failure CPI, which is calculated
every 5 minutes using this formula:
1 - ( ( VS.NbrSuccessMobileTermLocRequests + VS.AbortMobileTermLocRequest_HO
+ VS.AbortMobileTermLocRequest_MMEreloc + VS.AbortMobileTermLocRequest_O-
ther + VS.AbortMobileTermLocRequest_UEdetach ) / VS.AttMobileTermLocRequests )
Notes:
The thresholds are configurable.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible reasons for failure:
Problems with the eNB or SMLC involved in request
Incorrect provisioning of SMLC to TA
SMLC is not available to provide service

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-95
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedMobileTermLocRequestRateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
Verify that the S1-MME and SLs links are in-service/normal, using link_cli.
Refer to the Location Based Services failure counters to get a more specific failure
reason.
Contact Alcatel-Lucent Customer Support to determine the status of the SMLC that
are involved in the LCS procedure.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-96 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNetwrkInducedLocRequestRateExceeded

....................................................................................................................................................................................................................................

LSS_failedNetwrkInducedLocRequestRateExceeded
Description
The raised alarm LSS_failedNetwrkInducedLocRequestRateExceeded indicates meeting
a threshold of the Network Induced Location Request Failure CPI, which is calculated
every 5 minutes using this formula:
1 - ( ( VS.NbrSuccessNetwrkInducedLocRequests + VS.AbortNetwrkInducedLocRe-
quest_HO + VS.AbortNetwrkInducedLocRequest_MMEreloc + VS.AbortNetwrkInduc-
edLocRequest_Other + VS.AbortNetwrkInducedLocRequest_UEdetach ) /
VS.AttNetwrkInducedLocRequests )
Notes:
The thresholds are configurable.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15%
Major Alarm: 10% < CPI value <= 15%
Minor Alarm: 5% < CPI value <= 10%

Root Cause
Possible reasons for failure:
Problems with the eNB or SMLC involved in request
Incorrect provisioning of SMLC to TA
SMLC is not available to provide service

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-97
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNetwrkInducedLocRequestRateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1
Verify that the S1-MME and SLs links are in-service/normal, using link_cli.
Refer to the Location Based Services failure counters to get a more specific failure
reason.
Contact Alcatel-Lucent Customer Support to determine the status of the SMLC that
are involved in the LCS procedure.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-98 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNumHOFwdRelocRateExceeded

....................................................................................................................................................................................................................................

LSS_failedNumHOFwdRelocRateExceeded
Description
The raised alarm, LSS_failedNumHOFwdRelocRateExceeded, indicates the value of
VS.cpiHOwMMErelocFailures_atTarget measurement, monitored when failure on
Handover request, with MME forward relocation, exceeded a threshold in the last 15
minute interval. This value computes the failure rate at the Target MME for the Handover
procedure with MME relocation, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
See the list of failure counters for the Handover procedure with MME relocation (at the
Target MME). There are several failure causes, each with a separate failure counter. The
calculation for this alarm implicitly uses the sum of all the failure counters for the Attach
procedure.

Fault clearance procedure


...................................................................................................................................................................................................

1 Look at all the failure counters for the Handover procedure with MME relocation (at the
Target MME) in the PMC XML files to determine if one failure cause predominates. If
one is found, check User Documentation for any remedies specific to the found cause.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-99
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNumHOPathSwNewSgwRateExceeded

....................................................................................................................................................................................................................................

LSS_failedNumHOPathSwNewSgwRateExceeded
Description
The raised alarm, LSS_failedNumHOPathSwNewSgwRateExceeded, indicates the value
of VS.cpiHOwSGWrelocFailures measurement, monitored when failure on Handover
Path Switch request, to a different Serving Gateway, exceeded a threshold in the last 15
minute interval. This value computes the failure rate for the Handover procedure without
MME relocation and with SGW relocation, and compares the calculation against
provisioned thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
See the list of failure counters for the Handover procedure without MME relocation and
with SGW relocation. There are several failure causes, each with a separate failure
counter. The calculation for this alarm implicitly uses the sum of all the failure counters
for the Attach procedure.

Fault clearance procedure


...................................................................................................................................................................................................

1 Look at all the failure counters for the Handover procedure without MME relocation and
with SGW relocation in the PMC XML files to determine if one failure cause
predominates. If one is found, check User Documentation for any remedies specific to the
found cause.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-100 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNumHOPathSwSameSgwRateExceeded

....................................................................................................................................................................................................................................

LSS_failedNumHOPathSwSameSgwRateExceeded
Description
The raised alarm, LSS_failedNumHOPathSwSameSgwRateExceeded, indicates the value
of VS.cpiHOwNoRelocFailures measurement, monitored when failure on Handover Path
Switch request, to same Serving Gateway, exceeded a threshold in the last 15 minute
interval. This value computes the failure rate for the Handover procedure without MME
relocation and without SGW relocation, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
SGW failed to switch the S1-U downlink path to the new ENB

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-101
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedNumHORequiredRateExceeded

....................................................................................................................................................................................................................................

LSS_failedNumHORequiredRateExceeded
Description
The raised alarm, LSS_failedNumHORequiredRateExceeded, indicates the value of
VS.cpiHOwMMErelocFailures_atSource measurement, monitored when failure on
Handover request, with MME relocation, exceeded a threshold in the last 15 minute
interval. This value computes the failure rate at the source MME for the Handover
procedure with MME relocation, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
See the list of failure counters for the Handover procedure with MME relocation (at the
Source MME). There are several failure causes, each with a separate failure counter. The
calculation for this alarm implicitly uses the sum of all the failure counters for the Attach
procedure.

Fault clearance procedure


...................................................................................................................................................................................................

1 Look at all the failure counters for the Handover procedure with MME relocation (at the
Source MME) in the PMC XML files to determine if one failure cause predominates. If
one is found, check User Documentation for any remedies specific to the found cause.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-102 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedS1MMEconnEstRateExceeded

....................................................................................................................................................................................................................................

LSS_failedS1MMEconnEstRateExceeded
Description
The raised alarm, LSS_failedS1MMEconnEstRateExceeded, indicates the value of
VS.cpiS1MMEconnFailures measurement, monitored when failed S1MME Connect
request exceeded a threshold in the last 15 minute interval. This value computes the
failure rate for the eNB connection over S1-MME, and compares the calculation against
provisioned thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
PLMN and tracking area data was not provisioned correctly

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify PLMN or TAI provisioning data, via the SAM. After validation of the data, if the
problem persists, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-103
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedServiceReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedServiceReqsRateExceeded
Description
The raised alarm, LSS_failedServiceReqsRateExceeded, indicates the value of
cpiServiceRequestFailures measurement, monitored when failure on Service request
exceeded a threshold in the last 15 minute interval. This value computes the failure rate
for the UE Service Request procedure, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Procedure collision with ongoing HSS or SGW procedure
Invalid data in Service Request message (includes protocol failures or invalid
message content)
UE Authentication Failure due to invalid validation of RES returned in Authentication
Response message or Authentication Failure received by UE
HSS failure in response to AIR
NAS message timeout (message include Authentication Response or Security Mode
Complete)
ENB returns Initial Context Setup Failure
Timeout occurs while waiting for Initial Context Setup Response
Double S1 connection
Unexpected S1 release
SGW failure on Modify Bearer Request
....................................................................................................................................................................................................................................
2-104 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedServiceReqsRateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Ensure the S11 links to SGW are normal, using link_cli. If the links look normal, and
alarm persists, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-105
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedTAURateExceeded

....................................................................................................................................................................................................................................

LSS_failedTAURateExceeded
Description
The raised alarm, LSS_failedTAURateExceeded, indicates the value of
VS.cpiTauFailures measurement, monitored when failure on Tracking Area Update
request exceeded a threshold in the last 15 minute interval. This value computes the
failure rate for the TAU procedure, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Procedure collision with ongoing HSS or SGW procedure
Invalid data in Tracking Area Update Request message (includes protocol failures or
invalid message content)
ENB returns Initial Context Setup Failure
Timeout occurs while waiting for Initial Context Setup Response
Double S1 connection
Unexpected S1 release
SGW failure on Modify Bearer Request

....................................................................................................................................................................................................................................
2-106 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedTAURateExceeded

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-107
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedUpdBearerReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedUpdBearerReqsRateExceeded
Description
The raised alarm, LSS_failedUpdBearerReqsRateExceeded, indicates the value of
cpiUpdateBearerFailures measurement, monitored when failure on Update Bearer request
exceeded a threshold in the last 15 minute interval. This value computes the failure rate
for the Update Bearer procedure, and compares the calculation against provisioned
thresholds for Minor, Major, and Critical alarm conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
SGW failure on Modify Bearer Request

Fault clearance procedure


...................................................................................................................................................................................................

1 Ensure S11 links are normal, using link_cli. If the links are normal, and the alarm persists,
contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-108 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_failedUpdDedBearerReqsRateExceeded

....................................................................................................................................................................................................................................

LSS_failedUpdDedBearerReqsRateExceeded
Description
The raised alarm, LSS_failedUpdDedBearerReqsRateExceeded, indicates the value of
VS.cpiUpdateDedicatedBearerFailures measurement, monitored when failure on Update
Dedicated Bearer request exceeded a threshold in the last 15 minute interval. This value
computes the failure rate for the Update Dedicated Bearer procedure, and compares the
calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure could be due to any of the following reasons:
Internal error
Invalid data in Update Dedicated Bearer Request message (includes protocol failures
or invalid message content)
The response to the SGW could not be encoded
SGW failure on Update Dedicated Bearer Request

Fault clearance procedure


...................................................................................................................................................................................................

1
Verify that S11 links are normal, using link_cli. If links are normal, and the alarm
persists, contact Alcatel-Lucent Customer Support.

E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-109
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_ggsnDnsError

....................................................................................................................................................................................................................................

LSS_ggsnDnsError
Description
GGSN DNS Selection unable to retrieve IP Address. This alarm must be manually
cleared.

Default severity
MINOR

Root Cause
WMM is unable to retrieve GGSN IP Address.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verfiy that the GGSN IP Address is provisioned correctly on DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-110 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_internalCommunicationFailure

....................................................................................................................................................................................................................................

LSS_internalCommunicationFailure
Description
Communication between active MIF member and active MAF/SAF member failed or
communications between active MIF member and active MPH member failed.

Default severity
CRITICAL, MAJOR

Root Cause
The possible causes of this alarm are:
1. MPH, MIF or MAF/SAF pool has duplex failed or is undergoing initialization.
2. Software failure prevents communication establishment between MIF and MAF/SAF
or MIF and MPH.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verfiy MPH, MIF and/or MAF/SAF have not been forced out-of-service.
...................................................................................................................................................................................................

2 If communication is lost between the MPH and the MIF and it does not come back
automatically, and MPH pool is in Active / Hot-standby state, try switching MPH to the
standby member.
...................................................................................................................................................................................................

3 If communication is lost between the MAF/SAF and the MIF and it does not come back
automatically, and MAF/SAF pool is in Active / Hot-standby state, try switching
MAF/SAF to the standby member.
...................................................................................................................................................................................................

4 If communiaction is lost between the MIF and MPH and the MIF and MAF/SAFs and it
does not come back automatically, and MIF pool is in Active / Hot-standby state, try
switching MIF to the standby member.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-111
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_ippuBusError

....................................................................................................................................................................................................................................

LSS_ippuBusError
Description
There is a bus error on the indicated host between the HSPP4 hardware (iPPU) in the
AMC slot and the host hardware.

Default severity
CRITICAL

Root Cause
List of root causes:
The HSPP4 AMC itself has failed.
The iPPU service on HSPP4 is in a transient state.
The iPPU service on HSPP4 has failed.
There is no HSPP4 AMC and a user is attempting to run the iPPU/PMB software for
SGSN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine if any related alarms are also present, such as on the ESC, chassis, or board
itself. Correct those alarms first and see if this alarm clears as a result.
...................................................................................................................................................................................................

2
1. On Alcatel-Lucent 9471 WMM:
Utilize ippu_cli to print the status of the board on the OAM host.
...................................................................................................................................................................................................

3
1. On Alcatel-Lucent 9471 WMM:
Verify the appropriate FRUID via shelf manager is present in the given ShelfId
CardId.
...................................................................................................................................................................................................

4
1. On Alcatel-Lucent 9471 WMM:
Visually verify HSPP4 hardware is present in the AMC slot of the alarm indicated
with a ShelfId and cardId.

....................................................................................................................................................................................................................................
2-112 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_ippuBusError

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

5
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, attempt to powercycle the card.
...................................................................................................................................................................................................

6
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, attempt to re-seat the card in the alarm by
ShelfId and CardId.
...................................................................................................................................................................................................

7
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, replace the card used for this host using the
appropriate FRU procedure as necessary.
...................................................................................................................................................................................................

8
1. On Alcatel-Lucent 9471 WMM:
Attempt to reset the entire host (ShelfId/CardId) via appropriate CLI or MI. Before
attempting this action, verify that there is an ACTIVE or STANDBY mate present in
the system.
...................................................................................................................................................................................................

9 If the above steps do not clear the alarm, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-113
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_ippuResourceReset

....................................................................................................................................................................................................................................

LSS_ippuResourceReset
Description
There was a software reset on the iPPU in the HSPP4 AMC or a restart by the PMB
process in the host identified by ShelfId and CardId.

Default severity
MAJOR

Root Cause
List of root causes:
The iPPU HSPP4 software has reset.
The iPPU HSPP4 software has restarted.
The PMB process on the given host has restarted.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine if any related alarms are present. Correct those alarms first and see if this
alarm clears as a result.
...................................................................................................................................................................................................

2
1. On Alcatel-Lucent 9471 WMM:
Utilize ippu_cli to print the status of the board on the OAM host.
...................................................................................................................................................................................................

3
1. On Alcatel-Lucent 9471 WMM:
Before attempting this action, verify that there is an ACTIVE or STANDBY mate
present in the system. Attempt to reset the entire card (shelf/slot) via appropriate CLI
interface or MI.
...................................................................................................................................................................................................

4 If the above steps do not clear the alarm, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-114 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_liNearingCapacityLimit

....................................................................................................................................................................................................................................

LSS_liNearingCapacityLimit
Description
The number of lawful interceptions has reached 80% of MAF/SAF capacity.

Default severity
WARNING

Root Cause
The possible causes of this alarm are:
1. Use of lawful interception beyond design capacity.
2. Software failure causing unnecesary interception.

Fault clearance procedure


...................................................................................................................................................................................................

1 Use the query option of the li_target_cli command to verify that the appropriate set of
UEs are selected for lawful interception.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-115
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_maxDurationExpiredOnHRPDhandover

....................................................................................................................................................................................................................................

LSS_maxDurationExpiredOnHRPDhandover
Description
The raised alarm, LSS_maxDurationExpiredOnHRPDhandover, indicates the value of
VS.cpiMaxDurationHRPDhandover measurement, monitored when timed out on HRPD
handover request exceeded a threshold in the last 15 minute interval. This value is the
maximum time taken to perform a Handover to HRPD.
Notes:
THIS ALARM IS RESERVED FOR FUTURE USE.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: Timeout value > 300

Root Cause
The cause for exceeding the expected maximum cannot be determined precisely.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check the network routers for possible network delay. When the MME is programmed to
include internal delay measurements, check these PMC values.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-116 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_mmeDnsError

....................................................................................................................................................................................................................................

LSS_mmeDnsError
Description
MME DNS Selection unable to retrieve MME IP Address associated with FQDN. This
alarm must be manually cleared.

Default severity
MINOR

Root Cause
MME is unable to retrieve MME IP Address associated with FQDN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the FQDN is provisioned correctly in DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-117
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_noResetAckReceived

....................................................................................................................................................................................................................................

LSS_noResetAckReceived
Description
No RESET ACKNOWLEDGEMENT message was received from the RNC after the
WMM has sent and resent a RESET message.

Default severity
MINOR

Root Cause
The possible causes of this alarm are:
1. Remote network entity is out-of-service or undergoing initialization.
2. Message loss due to network issues.
3. Software failure prevents communication between WMM and the RNC.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the RNC that WMM fails to get the message from with is in service.
...................................................................................................................................................................................................

2 Determine that there are no errors within the IP network.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-118 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_numTOS10gtpcRateExceeded

....................................................................................................................................................................................................................................

LSS_numTOS10gtpcRateExceeded
Description
The raised alarm, LSS_numTOS10gtpcRateExceeded, indicates the value of
VS.cpiGTPcResponseTO_S10 measurement, monitored when missing replies to
S10(gtpc) request exceeded a threshold in the last 15 minute interval. This value
computes the cpiage of Response messages that are not received over S10, and compares
the calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
The cause is Unknown from the measurements involved in this calculation.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check the network routers for any problems. Check to determine if any other MME
elements are having problems.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-119
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_numTOS11gtpcRateExceeded

....................................................................................................................................................................................................................................

LSS_numTOS11gtpcRateExceeded
Description
The raised alarm, LSS_numTOS11gtpcRateExceeded, indicates the value of
VS.cpiGTPcResponseTO_S11 measurement, monitored when missing replies to
S11(gtpc) request exceeded a threshold in the last 15 minute interval. This value
computes the cpiage of Response messages that are not received over S11, and compares
the calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure could be due to any of the following reasons:
Internal error
Timeout waiting for SGW response on MME's Request

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that S11 links are normal, using link_cli. If links are normal, and the alarm persists,
contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-120 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_numTOS3gtpcRateExceeded

....................................................................................................................................................................................................................................

LSS_numTOS3gtpcRateExceeded
Description
The raised alarm, LSS_numTOS3gtpcRateExceeded, indicates the value of
VS.numTOS3gtpcRateExceeded measurement, monitored when missing replies to
S3(gtpc) request exceeded a threshold in the last 5 minute interval. This value computes
the percentage of Response messages that are not received over S3, and compares the
calculation against provisioned thresholds for Minor, Major, and Critical alarm
conditions.
Notes:
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure could be due to any of the following reasons:
Internal error
Timeout waiting for SGW response on MME's Request

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that S3 links are normal, using link_cli. If links are normal, and the alarm persists,
contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-121
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_pathAvailability

....................................................................................................................................................................................................................................

LSS_pathAvailability
Description
This alarm is raised when SCTP path becomes unavailable. The local and remote
provisioned addresses need to be checked for use of the correct 2 sub-networks provided.
If the provisioned addresses match the 2 physical subnets, and if all address provisioned
are also correct, then the physical network that carries the subnet used in the path
"unavailable" alarm needs to be investigated for trouble. The specifics of the path are
documented in the "additionalText" field of the alarm. These alarms may need to be
cleared manually: as alarms are reported when path connectivity is established, however
their contents are a function of provisioned addresses (paths) that may be wrong and
changed when the connection is down, and may no longer match with the path that was
originally alarmed.

Default severity
MINOR

Root Cause
The provisioning of the SCTP endpoints on either (i.e. IP addresses) the WMM or the
remote entity are incorrect, or the network between the endpoints is experiencing
problems.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the endpoints IP addresses on the WMM are the remote entity are provisioned
correctly.
...................................................................................................................................................................................................

2 Verify that the network between the WMM and the remote entity is functioning correctly.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-122 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_pgwDnsError

....................................................................................................................................................................................................................................

LSS_pgwDnsError
Description
MME DNS Selection unable to retrieve PGW IP Address associated with FQDN. This
alarm must be manually cleared.

Default severity
MINOR

Root Cause
MME is unable to retrieve PGW IP Address associated with FQDN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the FQDN is provisioned correctly in DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-123
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_provisioningError

....................................................................................................................................................................................................................................

LSS_provisioningError
Description
Missing provisioning of TAI-to-LAI mapping to MSC in 2G/3G operator for SGS based
CSFB/SMS.

Default severity
WARNING

Root Cause
Missing provisioning of TAI-LAI mapping to MSC in 2G/3G operator.

Fault clearance procedure


...................................................................................................................................................................................................

1 Provision missing entries in TAI-LAI mapping table utilizing LAI in 2G/3G operator.
Refer to user text in alarm.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-124 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_sgsnDnsError

....................................................................................................................................................................................................................................

LSS_sgsnDnsError
Description
SGSN DNS Selection unable to retrieve SGSN IP Address associated with FQDN. This
alarm must be manually cleared.

Default severity
MINOR

Root Cause
WMM is unable to retrieve SGSN IP Address associated with FQDN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the FQDN is provisioned correctly in DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 2-125
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
MME Alarms LSS_taiFqdnError

....................................................................................................................................................................................................................................

LSS_taiFqdnError
Description
MME DNS Selection unable to retrieve SGW IP Address associated with FQDN. this
alarm must be manually cleared.

Default severity
MINOR

Root Cause
MME is unable to retrieve SGW IP Address associated with FQDN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the FQDN is provisioned correctly in DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
2-126 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
3 SGSN Alarms
3

Overview
Purpose
This chapter contains alarms that are specific only to the SGSN.

Contents

LSS_cdrStorageSpaceThreshold 3-3
LSS_cgfNotResponding 3-4
LSS_cgfServiceNotSupported 3-5
LSS_cgfSystemFailure 3-6
LSS_cgfVersionNotSupported 3-7
LSS_cpiGTPcResponseTOGn 3-8
LSS_cpiGTPcResponseTOS3 3-10
LSS_cpiUECapacityUsage 3-12
LSS_excessiveExternalLinksDown 3-13
LSS_externalLinkDown 3-14
LSS_ggsnDnsError 3-15
LSS_internalCommunicationFailure 3-16
LSS_ippuBusError 3-17
LSS_ippuResourceReset 3-19
LSS_liNearingCapacityLimit 3-20
LSS_msThreshold 3-21
LSS_noResetAckReceived 3-22
LSS_nseBandwidthThreshold 3-23
LSS_pathAvailability 3-24

...................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-1
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms Overview

....................................................................................................................................................................................................................................

LSS_pdpThreshold 3-25
LSS_sgsnDnsError 3-26

....................................................................................................................................................................................................................................
3-2 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cdrStorageSpaceThreshold

....................................................................................................................................................................................................................................

LSS_cdrStorageSpaceThreshold
Description
CDRs storage space threshold reached

Default severity
MINOR, MAJOR

Root Cause
Loss of communication with the Charging Gateway

Fault clearance procedure


...................................................................................................................................................................................................

1 Continue to the next action only if the system does not clear the alarm.
...................................................................................................................................................................................................

2 Test the accessibility to the Charging Gateway (ping command).


...................................................................................................................................................................................................

3 Trace the route to the Charging Gateway (traceroute command).


...................................................................................................................................................................................................

4 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-3
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cgfNotResponding

....................................................................................................................................................................................................................................

LSS_cgfNotResponding
Description
SGSN/CGF interface: CGF not responding

Default severity
WARNING

Root Cause
Loss of communication with the Charging Gateway

Fault clearance procedure


...................................................................................................................................................................................................

1 Continue to the next action only if the system does not clear the alarm.
...................................................................................................................................................................................................

2 Test the accessibility to the Charging Gateway (ping command).


...................................................................................................................................................................................................

3 Trace the route to the Charging Gateway (traceroute command).


...................................................................................................................................................................................................

4 if the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-4 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cgfServiceNotSupported

....................................................................................................................................................................................................................................

LSS_cgfServiceNotSupported
Description
The Charging Gateway is not able to process the CDRs transmitted by the SGSN.

Default severity
WARNING

Root Cause
The Charging Gateway is not able to process the CDRs transmitted by the SGSN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Continue to the next action only if the system does not clear the alarm.
...................................................................................................................................................................................................

2 Contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-5
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cgfSystemFailure

....................................................................................................................................................................................................................................

LSS_cgfSystemFailure
Description
SGSN/CGF interface: 'system failure' response

Default severity
WARNING

Root Cause
A problem has occurred at Charging Gateway. GTP' cause received by SGSN is System
failure.

Fault clearance procedure


...................................................................................................................................................................................................

1 Continue to the next action only if the system does not clear the alarm.
...................................................................................................................................................................................................

2 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-6 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cgfVersionNotSupported

....................................................................................................................................................................................................................................

LSS_cgfVersionNotSupported
Description
The version of GTP' supported by the SGSN is not supported by the Charging Gateway.

Default severity
WARNING

Root Cause
The version of GTP supported by the SGSN is not supported by the Charging Gateway.

Fault clearance procedure


...................................................................................................................................................................................................

1 Continue to the next action only if the system does not clear the alarm.
...................................................................................................................................................................................................

2 Check GTP version at Charging Gateway.


...................................................................................................................................................................................................

3 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-7
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cpiGTPcResponseTOGn

....................................................................................................................................................................................................................................

LSS_cpiGTPcResponseTOGn
Description
The raised alarm, LSS_cpiGTPcResponseTOGn, indicates that the value of
VS.cpiGTPcResponseTOGn has exceeded a threshold in the last 15 minute interval. This
counter monitors the percentage of GTP Requests sent over a Gn interface for which no
Response is received by the WMM. The Gn interface connects the WMM with one or
more SGSNs. The calculated percentage is compared against provisioned thresholds for
Minor, Major, and Critical alarm conditions.
Notes:
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 15
Major Alarm: 10 < Rate value <= 15
Minor Alarm: 5 < Rate value <= 10

Root Cause
Failure to receive GTP responses from an SGSN could be due to any of the following
reasons:
Errors or problems at the far end SGSN
Network problems between the WMM and the SGSN
Internal errors at the WMM

Fault clearance procedure


...................................................................................................................................................................................................

1 Check neighboring SGSNs for error conditions or ongoing problems. Verify network
connectivity and proper configuration between WMM and SGSNs. If SGSNs and
network connectivity are verified, examine all the GTP failure counters to determine if
one failure cause predominates, and check fs.log to determine if errors related to the Gn

....................................................................................................................................................................................................................................
3-8 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cpiGTPcResponseTOGn

....................................................................................................................................................................................................................................
interface have been reported. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-9
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cpiGTPcResponseTOS3

....................................................................................................................................................................................................................................

LSS_cpiGTPcResponseTOS3
Description
The raised alarm, LSS_cpiGTPcResponseTOS3, indicates meeting a threshold of GTP
response failure rate in the last 5 minute interval. This failure rate monitors the percentage
of GTP Requests sent over an S3 interface for which no Response is received by the
MME. The S3 interface connects the MME with one or more SGSNs. The calculated
percentage is compared against provisioned thresholds for Minor, Major, and Critical
alarm conditions.
Notes:
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: Rate value > 10
Major Alarm: 5 < Rate value <= 10
Minor Alarm: 2 < Rate value <= 5

Root Cause
Failure to receive GTP responses from an SGSN could be due to any of the following
reasons:
Errors or problems at the far end SGSN
Network problems between the MME and the SGSN
Internal errors at the MME

Fault clearance procedure


...................................................................................................................................................................................................

1 Check neighboring SGSNs for error conditions or ongoing problems. Verify network
connectivity and proper configuration between MME and SGSNs. If SGSNs and network
connectivity are verified, examine all the GTP failure counters to determine if one failure
cause predominates, and check fs.log to determine if errors related to the S3 interface
....................................................................................................................................................................................................................................
3-10 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cpiGTPcResponseTOS3

....................................................................................................................................................................................................................................
have been reported. Contact next level of support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-11
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_cpiUECapacityUsage

....................................................................................................................................................................................................................................

LSS_cpiUECapacityUsage
Description
The raised alarm, cpiUECapacityUsage, indicates meeting a threshold of a UE capacity
utilization rate on a per board basis in the last 5 minutes. The utilization rate is calculated
in every interval of 5 minutes by using this formula:
( Number of maximum registered on a board / UE capacity of a single board ) * 100%
Notes:
An alarm with the same severity will be raised only once for the same CPI and
component.
The alarm will be cleared if no threshold is met in one of the subsequent intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 99%
Major Alarm: 95% < CPI value <= 99%
Minor Alarm: 90% < CPI value <= 95%

Root Cause
The alarm is fired when the number of the maximum registered UEs crosses the
predefined threshold on a single board.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check how many boards the WMM has and consider installing more board to increase the
WMM capacity.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-12 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_excessiveExternalLinksDown

....................................................................................................................................................................................................................................

LSS_excessiveExternalLinksDown
Description
An excessive number of links of a given type (e.g. s1mme, s11, etc.) are down. This is
usually due to a network connectivity problem and not the individual links between the
WMM and the external entity. Once this alarm is triggered the WMM will stop reporting
alarms and status for links of the given type. Once the network problem is resolved and
the number of links down is no longer excessive, this alarm will clear and the status of all
links of the given type will be updated. This alarm is raised when at least 100 links of a
given type are down. This alarm clears when 95 or fewer links are down.

Default severity
CRITICAL

Root Cause
The possible causes of this alarm are:
1. A large number network entities are out-of-service or undergoing initialization.
2. Packet or HeartBeat message loss due to network issues.
3. Provision data is incorrect on MME for network entities the MME communicates
with.
4. Software failure prevents communication established between MME and other
network entities.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine that there are no errors within the IP network.


...................................................................................................................................................................................................

2 If the network entity data is provisioned on MME, verify the data is correct.
...................................................................................................................................................................................................

3 Verify the network entity that MME fails to communicate with is in service.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-13
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_externalLinkDown

....................................................................................................................................................................................................................................

LSS_externalLinkDown
Description
Communication between WMM and another network entity can not be established.

Default severity
CRITICAL, MAJOR

Root Cause
The possible causes of this alarm are:
1. Remote network entity is out-of-service or undergoing initialization.
2. Packet or HeartBeat message loss due to network issues.
3. Provision data is incorrect on WMM for network entities the WMM communicates
with.
4. Software failure prevents communication established between WMM and other
network entities.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the network entity that WMM fails to communicate with is in service.
...................................................................................................................................................................................................

2 Determine that there are no errors within the IP network.


...................................................................................................................................................................................................

3 If the network entity data is provisioned on WMM, verify the data is correct.
...................................................................................................................................................................................................

4 If multiple links that terminate on the MIF (X1_1 or X2) are down, try switching MIF to
hot-standby mate.
...................................................................................................................................................................................................

5 If multiple links that terminate on the MPH (non-X1_1 and non-X2) are down, try
switching MPH to hot-standby mate.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-14 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_ggsnDnsError

....................................................................................................................................................................................................................................

LSS_ggsnDnsError
Description
GGSN DNS Selection unable to retrieve IP Address. This alarm must be manually
cleared.

Default severity
MINOR

Root Cause
WMM is unable to retrieve GGSN IP Address.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verfiy that the GGSN IP Address is provisioned correctly on DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-15
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_internalCommunicationFailure

....................................................................................................................................................................................................................................

LSS_internalCommunicationFailure
Description
Communication between active MIF member and active MAF/SAF member failed or
communications between active MIF member and active MPH member failed.

Default severity
CRITICAL, MAJOR

Root Cause
The possible causes of this alarm are:
1. MPH, MIF or MAF/SAF pool has duplex failed or is undergoing initialization.
2. Software failure prevents communication establishment between MIF and MAF/SAF
or MIF and MPH.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verfiy MPH, MIF and/or MAF/SAF have not been forced out-of-service.
...................................................................................................................................................................................................

2 If communication is lost between the MPH and the MIF and it does not come back
automatically, and MPH pool is in Active / Hot-standby state, try switching MPH to the
standby member.
...................................................................................................................................................................................................

3 If communication is lost between the MAF/SAF and the MIF and it does not come back
automatically, and MAF/SAF pool is in Active / Hot-standby state, try switching
MAF/SAF to the standby member.
...................................................................................................................................................................................................

4 If communiaction is lost between the MIF and MPH and the MIF and MAFs/SAFs and it
does not come back automatically, and MIF pool is in Active / Hot-standby state, try
switching MIF to the standby member.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-16 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_ippuBusError

....................................................................................................................................................................................................................................

LSS_ippuBusError
Description
There is a bus error on the indicated host between the HSPP4 hardware (iPPU) in the
AMC slot and the host hardware.

Default severity
CRITICAL

Root Cause
List of root causes:
The HSPP4 AMC itself has failed.
The iPPU service on HSPP4 is in a transient state.
The iPPU service on HSPP4 has failed.
There is no HSPP4 AMC and a user is attempting to run the iPPU/PMB software for
SGSN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine if any related alarms are also present, such as on the ESC, chassis, or board
itself. Correct those alarms first and see if this alarm clears as a result.
...................................................................................................................................................................................................

2
1. On Alcatel-Lucent 9471 WMM:
Utilize ippu_cli to print the status of the board on the OAM host.
...................................................................................................................................................................................................

3
1. On Alcatel-Lucent 9471 WMM:
Verify the appropriate FRUID via shelf manager is present in the given ShelfId
CardId.
...................................................................................................................................................................................................

4
1. On Alcatel-Lucent 9471 WMM:
Visually verify HSPP4 hardware is present in the AMC slot of the alarm indicated
with a ShelfId and cardId.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-17
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_ippuBusError

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

5
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, attempt to powercycle the card.
...................................................................................................................................................................................................

6
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, attempt to re-seat the card in the alarm by
ShelfId and CardId.
...................................................................................................................................................................................................

7
1. On Alcatel-Lucent 9471 WMM:
On the Shelf Manager, verify the shelf and card in the alarm has an HSPP4 iPPU in
the AMC slot. If HSPP4 is not detected, replace the card used for this host using the
appropriate FRU procedure as necessary.
...................................................................................................................................................................................................

8
1. On Alcatel-Lucent 9471 WMM:
Attempt to reset the entire host (ShelfId/CardId) via appropriate CLI or MI. Before
attempting this action, verify that there is an ACTIVE or STANDBY mate present in
the system.
...................................................................................................................................................................................................

9 If the above steps do not clear the alarm, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-18 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_ippuResourceReset

....................................................................................................................................................................................................................................

LSS_ippuResourceReset
Description
There was a software reset on the iPPU in the HSPP4 AMC or a restart by the PMB
process in the host identified by ShelfId and CardId.

Default severity
MAJOR

Root Cause
List of root causes:
The iPPU HSPP4 software has reset.
The iPPU HSPP4 software has restarted.
The PMB process on the given host has restarted.

Fault clearance procedure


...................................................................................................................................................................................................

1 Determine if any related alarms are present. Correct those alarms first and see if this
alarm clears as a result.
...................................................................................................................................................................................................

2
1. On Alcatel-Lucent 9471 WMM:
Utilize ippu_cli to print the status of the board on the OAM host.
...................................................................................................................................................................................................

3
1. On Alcatel-Lucent 9471 WMM:
Before attempting this action, verify that there is an ACTIVE or STANDBY mate
present in the system. Attempt to reset the entire card (shelf/slot) via appropriate CLI
interface or MI.
...................................................................................................................................................................................................

4 If the above steps do not clear the alarm, contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-19
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_liNearingCapacityLimit

....................................................................................................................................................................................................................................

LSS_liNearingCapacityLimit
Description
The number of lawful interceptions has reached 80% of MAF/SAF capacity.

Default severity
WARNING

Root Cause
The possible causes of this alarm are:
1. Use of lawful interception beyond design capacity.
2. Software failure causing unnecesary interception.

Fault clearance procedure


...................................................................................................................................................................................................

1 Use the query option of the li_target_cli command to verify that the appropriate set of
UEs are selected for lawful interception.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-20 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_msThreshold

....................................................................................................................................................................................................................................

LSS_msThreshold
Description
Number of attached MS or UE threshold reached

Default severity
MINOR, MAJOR

Root Cause
The number of UEs attached to the SGSN has reached a minor / major value. This value
is given as a percentage of the maximum number of UEs that the SGSN can attach. The
SGSN processing capacity may be undersized

Fault clearance procedure


...................................................................................................................................................................................................

1 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-21
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_noResetAckReceived

....................................................................................................................................................................................................................................

LSS_noResetAckReceived
Description
No RESET ACKNOWLEDGEMENT message was received from the RNC after the
WMM has sent and resent a RESET message.

Default severity
MINOR

Root Cause
The possible causes of this alarm are:
1. Remote network entity is out-of-service or undergoing initialization.
2. Message loss due to network issues.
3. Software failure prevents communication between WMM and the RNC.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the RNC that WMM fails to get the message from with is in service.
...................................................................................................................................................................................................

2 Determine that there are no errors within the IP network.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-22 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_nseBandwidthThreshold

....................................................................................................................................................................................................................................

LSS_nseBandwidthThreshold
Description
NSE bandwidth threshold reached

Default severity
MINOR, MAJOR

Root Cause
The NSE bandwidth has reached a minor / major value. This value is given as a
percentage of the MAX NSE

Fault clearance procedure


...................................................................................................................................................................................................

1 Analyze the operation context of the alarm. Determine if this alarm is structural or
conjectural.
...................................................................................................................................................................................................

2 Analyse the figures reported by the observation counters to evaluate how quick the NSE
bandwidth has increased.Depending of the result of the investigations:
...................................................................................................................................................................................................

3 If the NSE bandwidth remains over this threshold most of the time, and if alarm with
major severity also appears, upgrade of the SGSN configuration must be performed.
Please contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-23
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_pathAvailability

....................................................................................................................................................................................................................................

LSS_pathAvailability
Description
This alarm is raised when SCTP path becomes unavailable. The local and remote
provisioned addresses need to be checked for use of the correct 2 sub-networks provided.
If the provisioned addresses match the 2 physical subnets, and if all address provisioned
are also correct, then the physical network that carries the subnet used in the path
"unavailable" alarm needs to be investigated for trouble. The specifics of the path are
documented in the "additionalText" field of the alarm. These alarms may need to be
cleared manually: as alarms are reported when path connectivity is established, however
their contents are a function of provisioned addresses (paths) that may be wrong and
changed when the connection is down, and may no longer match with the path that was
originally alarmed.
Note: this alarm is cleared when the path's SCTP association changes operational state,
either from "Enabled" to "Disabled" or from "Disabled" to "Enabled". This change could
result from an administrative lock or unlock action, or a change in the collective
availability of the association's paths. Any path unreachable alarms will be cleared. If the
new association state is "Disabled", a single link/association alarm (LSS_mmeExternal-
LinkDown) will be raised.

Default severity
MAJOR

Root Cause
The provisioning of the SCTP endpoints on either (i.e. IP addresses) the WMM or the
remote entity are incorrect, or the network between the endpoints is experiencing
problems.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the endpoints IP addresses on the WMM are the remote entity are provisioned
correctly.
...................................................................................................................................................................................................

2 Verify that the network between the WMM and the remote entity is functioning correctly.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-24 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_pdpThreshold

....................................................................................................................................................................................................................................

LSS_pdpThreshold
Description
Number of activated PDP context threshold reached.

Default severity
MINOR, MAJOR

Root Cause
The number of PDP contexts that the SGSN can support has reached a minor / major
value. This value is given as a percentage of the maximum number of PDP contexts that
the SGSN can support. The SGSN processing capacity may be undersized

Fault clearance procedure


...................................................................................................................................................................................................

1 Analyze the operation context of the alarm. Determine if this alarm is structural or
conjectural.
...................................................................................................................................................................................................

2 Analyze the observation counters values to evaluate how quick the number of activated
PDP contexts has increased. Depending on the result of the investigations:
...................................................................................................................................................................................................

3 If the activated PDP contexts overload corresponds to a specific peak, you don't need to
perform any upgrade of the SGSN. If alarm LSS_pdpThreshold is present as major, the
activated PDP contexts overload is constant. There is a gap between the demand of PS
services and the SGSN processing capacity. You need to upgrade the SGSN
configuration. Please contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 3-25
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
SGSN Alarms LSS_sgsnDnsError

....................................................................................................................................................................................................................................

LSS_sgsnDnsError
Description
SGSN DNS Selection unable to retrieve SGSN IP Address associated with FQDN. This
alarm must be manually cleared.

Default severity
MINOR

Root Cause
WMM is unable to retrieve SGSN IP Address associated with FQDN.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the FQDN is provisioned correctly in DNS server.


...................................................................................................................................................................................................

2 Manually clear the alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
3-26 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
4 BASE_ATCA Alarms
4

Overview
Purpose
This chapter contains platform alarms that may be applicable to Alcatel-Lucent products
that utilize the ATCA.
The WMM application is built on a common platform used by many different
applications. The WMM does not use all of the capabilities of the platform and therefore,
some base ATCA alarms may not be applicable. In addition, certain functionality defined
within some alarms may also not be applicable to the WMM such as the following: CDR,
SS7, FS5K, FS GUI, NGSS, TL1, and CPSB.

Contents

ATCA_AggregatePowerSensor 4-6
ATCA_AggregateTemperatureSensor 4-7
ATCA_BoardPower 4-8
ATCA_CPLDState 4-9
ATCA_DS75Temperature 4-11
ATCA_ExhaustTemp 4-13
ATCA_FPGATemp 4-15
ATCA_FanSpeed 4-17
ATCA_FanTrayPresence 4-18
ATCA_FanTraysFRU 4-19
ATCA_FilterPresence 4-21
ATCA_I2CLocalBus 4-22
ATCA_IPMBLink 4-23
ATCA_InletTemp 4-24

...................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-1
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms Overview

....................................................................................................................................................................................................................................

ATCA_LM75Temperature 4-26
ATCA_LM83Temperature 4-28
ATCA_LMeUC75Temperature 4-30
ATCA_LMeUC75Top-Rig 4-32
ATCA_LocalTemperature 4-34
ATCA_MMCTemp 4-35
ATCA_OcteonTemperature 4-37
ATCA_OutletTemp 4-38
ATCA_PayloadCurrent 4-40
ATCA_PayloadVoltage 4-42
ATCA_PowerOk 4-44
ATCA_ShelfFRUs 4-45
ATCA_UnexpectedDeact 4-47
ATCA_m48vSensor 4-48
LSS_cardConnectionLost 4-49
LSS_cardError 4-51
LSS_cpiAlrmCritical 4-52
LSS_cpiAlrmMajor 4-53
LSS_cpiAlrmMinor 4-54
LSS_cpiAlrmWarning 4-55
LSS_cpiAsrtEsc 4-56
LSS_cpiAsrtNonEsc 4-58
LSS_cpiAsrtNonEscCritical 4-60
LSS_cpiAsrtNonEscMajor 4-62
LSS_cpiAsrtNonEscMinor 4-64
LSS_cpiAudErrCount 4-66
LSS_cpiAudManAct 4-68
LSS_cpiAudNewEvent 4-70
LSS_cpiExceptionService 4-72
LSS_cpiFileSysUsage 4-74
LSS_cpiMemAllocFail 4-75
LSS_cpiReinitServiceSelf 4-76
LSS_cpuOverload 4-78
....................................................................................................................................................................................................................................
4-2 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms Overview

....................................................................................................................................................................................................................................

LSS_databaseConnectionLost 4-79
LSS_databaseReplicationLinkDown 4-80
LSS_databaseSizeExhausted 4-81
LSS_dbHighCpuUtilization 4-82
LSS_dbOffline 4-83
LSS_dbStatusUnexpected 4-84
LSS_degradedResource 4-85
LSS_degrow 4-126
LSS_diskGoingDown 4-127
LSS_diskSector 4-128
LSS_dnsThreshold 4-129
LSS_ethernetError 4-130
LSS_ethernetLinkDown 4-131
LSS_externalConnectivity 4-133
LSS_fru 4-134
LSS_grow 4-135
LSS_hostDown 4-136
LSS_memoryOverload 4-137
LSS_nodeGroupOOS 4-138
LSS_nodeOOS 4-139
LSS_numberOfTuplesInUse 4-140
LSS_osSecInfoModificationDetected 4-141
LSS_osSecInformationMissing 4-142
LSS_osSecUnexpectedInformation 4-143
LSS_patch 4-144
LSS_pktCorruptionDetectedViaRCCLANCheck 4-145
LSS_platformCommandFailure 4-146
LSS_pmDataNotCollected 4-147
LSS_processDown 4-148
LSS_processNotStarted 4-149
LSS_remoteQueryServerFailure 4-152
LSS_remotedbLinkDown 4-153
LSS_restore 4-154
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-3
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms Overview

....................................................................................................................................................................................................................................

LSS_serviceOnewayCommunication 4-155
LSS_sheddingOverload 4-156
LSS_shmcEthernetError 4-157
LSS_simxml 4-158
LSS_softwareAllocatedResourceOverload 4-159
LSS_softwareComponentStandbyNotReady 4-160
LSS_svcdegrow 4-161
LSS_svcgrow 4-162
LSS_swVersionMismatch 4-163
LSS_tftpDownloadCorrupt 4-164
LSS_threadsExhausted 4-166
LSS_upgrade 4-167
LSS_virtualClusterDown 4-168
RALARM_Loop 4-169
RALARM_Power 4-170
SYS_BackupFailure 4-171
SYS_CPM_USERDATA_INCONSITENCY 4-172
SYS_CPM_USERDATA_RESTORED 4-173
SYS_Configuration 4-174
SYS_EventQueueCapacity 4-176
SYS_ICMPFailure 4-177
SYS_IPsecConfig 4-178
SYS_LinkDown 4-179
SYS_NotifyDisabled 4-180
SYS_NotifyLocked 4-181
SYS_RADIUS_TO_LDAP_FAILURE 4-182
SYS_ROOT_ACCESS_DENIED 4-183
SYS_ROOT_FTP_VIOLATION 4-184
SYS_ROOT_LOGIN_VIOLATION 4-185
SYS_ROOT_SSH_LOGIN_VIOLATION 4-186
SYS_SNETrapOverload 4-187
SYS_SNMPAuthenticationFailure 4-188
SYS_SNMPFailure 4-189
....................................................................................................................................................................................................................................
4-4 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms Overview

....................................................................................................................................................................................................................................

SYS_SU_TO_ROOT_FAILURE 4-190
SYS_SYSTEMTrapOverload 4-191
SYS_SetupAAAFailure 4-192
SYS_TestAlarm 4-193
SYS_ThresholdCrossed 4-194
SYS_UndiscoveredObject 4-195
SYS_WriteAAAFailure 4-196

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-5
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_AggregatePowerSensor

....................................................................................................................................................................................................................................

ATCA_AggregatePowerSensor
Description
The aggregate power sensor alarm provides a summary status of all power related
conditions adversely affecting a resource. When this alarm occurs, in most cases, there is
another power related alarm that provides more details about the exact resource power
sensor that is reporting the condition. From the MI GUI, alarms on a resource may be
retrieved by selecting the managed object for that resource and then selecting the
right-click operation to display related alarms.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
There is a power related problem with the resource.

Fault clearance procedure


...................................................................................................................................................................................................

1 Investigate all other temperature and power related alarms on the resource and follow
those alarms fault recovery procedures. Once all of these related alarms are cleared, this
alarm clears.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-6 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_AggregateTemperatureSensor

....................................................................................................................................................................................................................................

ATCA_AggregateTemperatureSensor
Description
The aggregate temperatures sensor alarm provides a summary status of all temperature
related conditions adversely affecting a resource. When this alarm occurs, in most cases,
there is another temperature related alarm that provides more details about the exact
resource temperature sensor that is reporting the condition. From the MI GUI, alarms on a
resource may be retrieved by selecting the managed object for that resource and then
selecting the right-click operation to display related alarms.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
There is a temperature related problem with the resource.

Fault clearance procedure


...................................................................................................................................................................................................

1 Investigate all other temperature and power related alarms on the resource and follow
those alarms fault recovery procedures. Once all of these related alarms are cleared, this
alarm clears.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-7
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_BoardPower

....................................................................................................................................................................................................................................

ATCA_BoardPower
Description
A board is either in the inactive or not present state. This means that the board has been
powered down.

Default severity
MAJOR

Root Cause
Possible root causes:
Blade has been powered off.
Blade has been removed from the chassis.
There is a faulty connection between the blade and the chassis.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the blade is powered on. This can be performed remotely using CLI on the
shelf manager or locally by observing specific LEDs and their status
...................................................................................................................................................................................................

2 Verify that the blade is seated correctly in the chassis. Try to re-seat the blade in the
chassis.
...................................................................................................................................................................................................

3 Replace the blade if necessary, refer to FRU procedure. Contact Alcatel-Lucent Customer
Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-8 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_CPLDState

....................................................................................................................................................................................................................................

ATCA_CPLDState
Description
This alarm indicates a change in the redundancy status of the shelf management cards.
The specific problem of the alarm contains the specific redundancy state of the shelf
management card.
Possible states are as follows:
STATE_00: The current Shelf Manager is Active with no Backup.
STATE_01: The current Shelf Manager is Active with a Backup.
STATE_02: The current Shelf Manager is a Backup.
STATE_04: The Shelf Manager is a Backup but the remote presence bit is not set.
STATE_05: The Shelf Manager is a Backup but the remote switchover request bit is
not set.
STATE_06: The Shelf Manager is a Backup but the CPLD Active bit is set.
STATE_07: The Shelf Manager is Active with a Backup but the remote presence bit is
not set.
STATE_08: The Shelf Manager is Active with a Backup but the remote healthy bit is
not set.
STATE_09: The Shelf Manager is Active with a Backup but the CPLD Active bit is
not set.
STATE_10: The local presence bit is not set for the current Shelf Manager.
STATE_11: The Shelf Manager is Active with no Backup but the remote healthy bit is
set.
STATE_12: The Shelf Manager is Active with no Backup but the remote switchover
request bit is set.

Default severity
MINOR

Root Cause
Possible root causes:
One of the shelf management cards is not present.
One of the shelf management cards is not seated appropriately.
One of the shelf management cards has a hardware problem

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-9
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_CPLDState

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 Verify that the shelf management card is inserted properly.


...................................................................................................................................................................................................

2 If the shelf management card is inserted, reseat the shelf management card
...................................................................................................................................................................................................

3 If reseating the shelf management card does not correct the problem, replace the shelf
management card
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-10 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_DS75Temperature

....................................................................................................................................................................................................................................

ATCA_DS75Temperature
Description
This alarm indicates that the AMC (Advanced Mezzanine Card) temperature monitoring
sensor has detected a threshold being crossed. By default, the thresholds are as follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 40.000
Upper Major Threshold(RW) 60.000
Upper Major Threshold(RW) 60.000
Upper Critical Threshold(RW) 70.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC HDD has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 There is a condition in which this alarm, with minor severity, is being erroneously
reported by the hardware, so ignore any minor alarms pertaining to this sensor.
...................................................................................................................................................................................................

2 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-11
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_DS75Temperature

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

4 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

5 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

6 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-12 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_ExhaustTemp

....................................................................................................................................................................................................................................

ATCA_ExhaustTemp
Description
This alarm indicates that the ASS7BF AMC (Advanced Mezzanine Card) temperature
monitoring sensor has detected a threshold being crossed. By default, the thresholds are
as follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 40.000
Upper Major Threshold(RW) 60.000
Upper Major Threshold(RW) 60.000
Upper Critical Threshold(RW) 70.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC SS7 has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 There is a condition in which this alarm, with minor severity, is being erroneously
reported by the hardware, so ignore any minor alarms pertaining to this sensor.
...................................................................................................................................................................................................

2 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-13
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_ExhaustTemp

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

4 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

5 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

6 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-14 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FPGATemp

....................................................................................................................................................................................................................................

ATCA_FPGATemp
Description
This alarm indicates that the DCI AMC (Advanced Mezzanine Card) FPGA Temp
monitoring sensor has detected a threshold being crossed. This indicates there is a
problem with the die temperature of the DCI FPGA. The values for the FPGA Temp
sensor thresholds can be retrieved from the Shelf Management Card and have the
following format:

Lower Minor Threshold <value1>


Lower Major Threshold <value1>
Lower Critical Threshold <value1>
Upper Minor Threshold <value1>
Upper Major Threshold <value1>
Upper Critical ThresholdRW) <value1>
Positive Threshold Hysteresis <value1>
Negative Threshold Hysteresis <value1>

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-15
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FPGATemp

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-16 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FanSpeed

....................................................................................................................................................................................................................................

ATCA_FanSpeed
Description
This alarm indicates that a fan's speed has crossed a threshold. By default, the threshold
settings are as follows:

Lower Minor Threshold not supported


Lower Major Threshold(RW) 492.000
Lower Critical Threshold not supported
Upper Minor Threshold not supported
Upper Major Threshold not supported
Upper Critical Threshold not supported
Positive Threshold Hysteresis 0.000
Negative Threshold Hysteresis 0.000

The additional text field of the alarm will indicate the fan and fan tray exhibiting the
behavior.

Default severity
MAJOR

Root Cause
One of the chassis' fan units has failed.

Fault clearance procedure


...................................................................................................................................................................................................

1 Replace the faulty fan unit according to the appropriate replacement procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-17
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FanTrayPresence

....................................................................................................................................................................................................................................

ATCA_FanTrayPresence
Description
This alarm indicates that one of the fan trays is not present in the chassis. The fan tray in
question will be identified in the additonalText field of the alarm.

Default severity
MAJOR

Root Cause
Possible root causes:
The fan tray is not properly seated.
The fan tray has been removed.

Fault clearance procedure


...................................................................................................................................................................................................

1 Insert the fan tray.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-18 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FanTraysFRU

....................................................................................................................................................................................................................................

ATCA_FanTraysFRU
Description
This alarm indicates a problem with the fan trays.
The state of the fan tray FRU information is present in the specific problem of the alarm
and is one of the following:

STATE_00 All Fan Trays are OK.


STATE_01 Fan Trays type are different which is not an
allowed configuration.
STATE_02 Cooling parameters for the Fan Trays are not
compatible.
STATE_03 Cooling parameters for one or more of the Fan
Trays are not valid.
STATE_04 One or more of the Fan Trays up/front are
absent.

Default severity
MAJOR

Root Cause
One of the following:
1. Fan Tray types are different and not allowed by the site configuration.
2. Cooling parameters for Fan Trays are not compatible.
3. Cooling parameters for Fan Trays are not valid.
4. Fan Tray is absent.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that all Fan Trays are properly seated in the chassis.
...................................................................................................................................................................................................

2 Verify that the type of Fan Trays are compatible. Contact Alcatel-Lucent Customer
Support if incompatible.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-19
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FanTraysFRU

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Verify that the cooling parameters are set correctly. Contact Alcatel-Lucent Customer
Support to adjust parameters.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-20 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_FilterPresence

....................................................................................................................................................................................................................................

ATCA_FilterPresence
Description
A filter is not present in the chassis. The additional text field of the alarm will indicate
which filter is not present.

Default severity
MINOR

Root Cause
A filter is not present in the chassis.

Fault clearance procedure


...................................................................................................................................................................................................

1 Insert the filter that is not present.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-21
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_I2CLocalBus

....................................................................................................................................................................................................................................

ATCA_I2CLocalBus
Description
This alarm reports an abnormal condition in the hardware state of the I2C Local Bus.
From the I2C local Bus Monitoring point of view, the bus is divided in two parts :
internal channel: I2C Bus aboard the NBSHMC, in front of the I2C Bus MUX
external channels: I2C buses behind the I2C Bus MUX, there are 4 external channels:
channel 0: linked to ADT7462 of Fan Tray Up
channel 1: linked to ADT7462 of Fan Tray Low
channel 2: linked to Shelf EEPROM#1
channel 3: linked to Shelf EEPROM#2
Possible states of the I2C Local Bus are:
STATE_00: OK
STATE_01: internal BUS NOK
STATE_02: external channel 0 NOK
STATE_03: external channel 1 NOK
STATE_04: external channel 2 NOK
STATE_05: external channel 3 NOK

Default severity
MAJOR

Root Cause
I2C Local Bus sensor has detected a failure of the I2C bus.

Fault clearance procedure


...................................................................................................................................................................................................

1 If the condition does not clear, contact Alcatel-Lucent Customer Support


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-22 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_IPMBLink

....................................................................................................................................................................................................................................

ATCA_IPMBLink
Description
This alarm indicates a problem with the IPMB(Intelligent Platform Management Bus)
Link between the shelf manager and the board. This alarm may be reported by the shelf
manager for the portion of the link that it monitors, or by the board for the portion of the
link it monitors. The specific problem of the alarm will indicate the specific IPMB link
that is failing and the state of the link, which can be one of the following:
STATE_00:IPMB-A disabled, IPMB-B disabled
STATE_01:IPMB-A enabled, IPMB-B disabled
STATE_02:IPMB-A disabled, IPMB-B enabled
STATE_03:IPMB-A enabled, IPMB-B enabled

Default severity
MINOR

Root Cause
Possible root causes:
Hardware failure.
The IPMB link has been manually put in a disabled state.

Fault clearance procedure


...................................................................................................................................................................................................

1 If the Link has been manually disabled, try to enable the link from the active shelf
manager card with the command, "clia setipmbstate <IPMB address> [AB] 1".
...................................................................................................................................................................................................

2 If the board is reporting a link failure, replace the board.


...................................................................................................................................................................................................

3 If the shelf is reporting a link failure, replace the shelf management card.
...................................................................................................................................................................................................

4 If replacing the board and shelf management card do not solve the problem, replace the
shelf. Contact Alcatel-Lucent Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-23
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_InletTemp

....................................................................................................................................................................................................................................

ATCA_InletTemp
Description
This alarm indicates that the AMC (Advanced Mezzanine Card) Inlet Temp monitoring
sensor at the upper edge of the AMC has detected a threshold being crossed. The values
for the Inlet Temp sensor thresholds can be retrieved from the Shelf Management Card
and have the following format:

Lower Minor Threshold <value1>


Lower Major Threshold <value1>
Lower Critical Threshold <value1>
Upper Minor Threshold <value1>
Upper Major Threshold <value1>
Upper Critical Threshold <value1>
Positive Threshold Hysteresis <value1>
Negative Threshold Hysteresis <value1>

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
....................................................................................................................................................................................................................................
4-24 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_InletTemp

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-25
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LM75Temperature

....................................................................................................................................................................................................................................

ATCA_LM75Temperature
Description
This alarm indicates a temperature problem with a board.
The shelf management card has one LM75 sensor that monitors the temperature of the top
of the board (LM75 Temp. Up) and another that monitors the temperature of the bottom
of the board (LM75 Temp. Down). The default thresholds are as follows:

Lower Minor Threshold(RW) -56.000


Lower Major Threshold(RW) -56.000
Lower Critical Threshold(RW) -56.000
Upper Minor Threshold(RW) 50.000
Upper Major Threshold(RW) 70.000
Upper Critical Threshold(RW) 70.000
Positive Threshold Hysteresis not supported
Negative Threshold Hysteresis not supported

The non-shelf management cards have an LM75 temperature sensor (LM75 Local Temp)
that monitors the temperature of the rear side of the board. The default thresholds are as
follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 60.000
Upper Major Threshold(RW) 70.000
Upper Critical Threshold(RW) 90.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

....................................................................................................................................................................................................................................
4-26 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LM75Temperature

....................................................................................................................................................................................................................................
Root Cause
Possible root causes:
Temperature of the board has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 There is a condition in which this alarm, with minor severity, is being erroneously
reported by the hardware, so ignore any minor alarms pertaining to this sensor.
...................................................................................................................................................................................................

2 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

3 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

4 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

5 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

6 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-27
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LM83Temperature

....................................................................................................................................................................................................................................

ATCA_LM83Temperature
Description
This alarm indicates a temperature problem with a board. There are 5 LM83
sensors(LM83_1 Local,LM83_1 DBG,LM83_1 BASE,LM83_1 LSI,LM83_2 Local) that
monitor the temperature of the board. The default thresholds are as follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 60.000
Upper Major Threshold(RW) 70.000
Upper Critical Threshold(RW) 90.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 90.000
Upper Major Threshold(RW) 100.000
Upper Critical Threshold(RW) 110.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the board has crossed an alarmable threshold.
The room or chassis air conditioning unit is defective.
The shelf manager board is defective and overheating.

....................................................................................................................................................................................................................................
4-28 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LM83Temperature

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 There is a condition in which this alarm, with minor severity, is being erroneously
reported by the hardware, so ignore any minor alarms pertaining to this sensor.
...................................................................................................................................................................................................

2 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

3 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

4 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

5 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

6 If the problem persists, contact Alcatel-Lucent Customer Support..


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-29
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LMeUC75Temperature

....................................................................................................................................................................................................................................

ATCA_LMeUC75Temperature
Description
This alarm indicates that the ASS7NB AMC (Advanced Mezzanine Card) temperature
monitoring sensor has detected a threshold being crossed. By default, the thresholds are
as follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 40.000
Upper Major Threshold(RW) 60.000
Upper Major Threshold(RW) 60.000
Upper Critical Threshold(RW) 70.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC SS7 has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 There is a condition in which this alarm, with minor severity, is being erroneously
reported by the hardware, so ignore any minor alarms pertaining to this sensor.
...................................................................................................................................................................................................

2 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
....................................................................................................................................................................................................................................
4-30 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LMeUC75Temperature

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

4 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

5 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

6 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-31
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LMeUC75Top-Rig

....................................................................................................................................................................................................................................

ATCA_LMeUC75Top-Rig
Description
This alarm indicates that the ASS7BN AMC (Advanced Mezzanine Card) temperature
monitoring sensor has detected a threshold being crossed. By default, the thresholds are
as follows:

Lower Minor Threshold not supported


Lower Major Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Lower Critical Threshold not supported
Upper Minor Threshold(RW) 40.000
Upper Major Threshold(RW) 60.000
Upper Major Threshold(RW) 60.000
Upper Critical Threshold(RW) 70.000
Positive Threshold Hysteresis 2.000
Negative Threshold Hysteresis 2.000

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
A list of root causes:
Temperature of the AMC SS7 has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.

....................................................................................................................................................................................................................................
4-32 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LMeUC75Top-Rig

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-33
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_LocalTemperature

....................................................................................................................................................................................................................................

ATCA_LocalTemperature
Description
This alarm indicates a temperature problem with a board.

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the board has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-34 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_MMCTemp

....................................................................................................................................................................................................................................

ATCA_MMCTemp
Description
This alarm indicates that the DCI AMC (Advanced Mezzanine Card) MMC Temp
monitoring sensor has detected a threshold being crossed. This indicates there is a
problem with the die temperature of the MMC FPGA. The values for the MMC Temp
sensor thresholds can be retrieved from the Shelf Management Card and have the
following format:

Lower Minor Threshold <value1>


Lower Major Threshold <value1>
Lower Critical Threshold <value1>
Upper Minor Threshold <value1>
Upper Major Threshold <value1>
Upper Critical ThresholdRW) <value1>
Positive Threshold Hysteresis <value1>
Negative Threshold Hysteresis <value1>

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-35
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_MMCTemp

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-36 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_OcteonTemperature

....................................................................................................................................................................................................................................

ATCA_OcteonTemperature
Description
This alarm indicates a temperature problem with the Octeon module.

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the board has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-37
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_OutletTemp

....................................................................................................................................................................................................................................

ATCA_OutletTemp
Description
This alarm indicates that the AMC (Advanced Mezzanine Card) Outlet Temp monitoring
sensor at the lower edge of the AMC has detected a threshold being crossed. The values
for the Outlet Temp sensor thresholds can be retrieved from the Shelf Management Card
and have the following format:

Lower Minor Threshold <value1>


Lower Major Threshold <value1>
Lower Critical Threshold <value1>
Upper Minor Threshold <value1>
Upper Major Threshold <value1>
Upper Critical ThresholdRW) <value1>
Positive Threshold Hysteresis <value1>
Negative Threshold Hysteresis <value1>

Default severity
MINOR, MAJOR, CRITICAL

Root Cause
Possible root causes:
Temperature of the AMC has crossed a threshold.
The room or chassis air conditioning unit is defective.
The board is defective and overheating.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if there are other alarms that could explain the rise in temperature, especially fan
alarms. If there are, troubleshoot these alarms first.
...................................................................................................................................................................................................

2 Check that the room air conditioning system is operating properly.


...................................................................................................................................................................................................

3 Check that the fan units of the suspect chassis are operating correctly. If they are not,
replace the fan units according to the replacement procedure.
....................................................................................................................................................................................................................................
4-38 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_OutletTemp

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

4 If fans are operating properly and if there is no other alarm, replace faulty FRU according
to the appropriate replacement procedure.
...................................................................................................................................................................................................

5 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-39
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_PayloadCurrent

....................................................................................................................................................................................................................................

ATCA_PayloadCurrent
Description
This alarm indicates a current problem on a board, resulting from the Payload Amps
sensor threshold being crossed.
The values for the Payload Amps sensor thresholds can be retrieved from the Shelf
Management Card, and have the following format:

Lower Minor Threshold(RW) <value1>


Lower Major Threshold(RW) <value2>
Lower Critical Threshold(RW) <value3>
Upper Minor Threshold(RW) <value4>
Upper Major Threshold(RW) <value5>
Upper Critical Threshold(RW) <value6>

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
Possible root causes:
The card may have a current problem.
The power supply unit may have a problem.
The thresholds for the sensors may be incorrectly set.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if other cards in the chassis have a similar alarm. If this is the case, there may be a
problem with the power supply unit(s).
...................................................................................................................................................................................................

2 Replace the faulty card according to the appropriate replacement procedure.


...................................................................................................................................................................................................

3 Replace the interface unit located behind the faulty card according to the appropriate
replacement procedure.

....................................................................................................................................................................................................................................
4-40 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_PayloadCurrent

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

4 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-41
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_PayloadVoltage

....................................................................................................................................................................................................................................

ATCA_PayloadVoltage
Description
This alarm indicates a voltage problem with a board.
There may be several voltage sensors present on each board (e.g. 5V, 3.3V, 12V), any of
which may be reporting a voltage problem. The Specific Problem field in the alarm will
indicate which sensor is reporting the problem. The threshold values may be retrieved
from the Shelf Management Card and have the following format:

Lower Minor Threshold(RW) <value1>


Lower Major Threshold(RW) <value2>
Lower Critical Threshold(RW) <value3>
Upper Minor Threshold(RW) <value4>
Upper Major Threshold(RW) <value5>
Upper Critical Threshold(RW) <value6>
Positive Threshold Hysteresis <value7>
Negative Threshold Hysteresis <value8>

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
Possible root causes:
The card may have a voltage problem.
The power supply unit may have a problem.
The thresholds for the sensors may be incorrectly set.

Fault clearance procedure


...................................................................................................................................................................................................

1 Check if all of the cards in the chassis have the same alarm. If this is the case, replace the
power supply unit(s) according to the appropriate replacement procedure.
...................................................................................................................................................................................................

2 Replace the faulty card according to the appropriate replacement procedure.

....................................................................................................................................................................................................................................
4-42 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_PayloadVoltage

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

3 Replace the interface unit located behind the faulty card according to the appropriate
replacement procedure.
...................................................................................................................................................................................................

4 If the problem persists, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-43
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_PowerOk

....................................................................................................................................................................................................................................

ATCA_PowerOk
Description
This alarm indicates state of power ok signal from ISPPAC.

Default severity
CRITICAL

Root Cause
The POWEROK is used to indicate that all voltages of SPM are OK, also including
D1V8_DIMM1, D1V8_DIMM2 and D1V8_DIMM3.
Any voltage issue of SPM card may lead to this alarm.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify that the blade is powered on.


...................................................................................................................................................................................................

2 Verify that the blade is seated correctly in the chassis. Try to re-seat the blade in the
chassis.
...................................................................................................................................................................................................

3 Replace the blade if necessary, refer to FRU procedure. Contact Alcatel-Lucent Customer
Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-44 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_ShelfFRUs

....................................................................................................................................................................................................................................

ATCA_ShelfFRUs
Description
This alarm indicates a problem with the shelf FRU information stored in the EEPROMSs
(located on the NFATCAV2 back panel and accessed via the I2C local bus). The
EEPROMS contents are validated when a shelf manager is initialized as the active shelf
manager, and periodically by the active shelf manager. The state of the shelf FRU
information is present in the specific problem of the alarm and is one of the following:

STATE_00: SHELF_FRUS_STATE_OK No problems.


STATE_01: SHELF_FRUS_STATE_INIT_0 The data is valid in both EEPROMS, but the
contents are not equal. Depending on the
configuration of EXIT_IF_NO_SHELF_FRU
(set to FALSE), the shelf manager may start
running in a non-working state, but no IPMCs
in the shelf can be powered up.
STATE_02: SHELF_FRUS_STATE_INIT_1 The data is invalid in one EEPROM. The
contents of the valid EEPROM are used to
initialize, and the invalid EEPROM is updated
to match the valid EEPROM.
STATE_03: SHELF_FRUS_STATE_FRU1_ The data is invalid in EEPROM1. The
INV contents of the valid EEPROM are used to
initialize, and the invalid EEPROM is updated
to match the valid EEPROM.
STATE_04: SHELF_FRUS_STATE_FRU2_ The data is invalid in EEPROM2. The
INV contents of the valid EEPROM are used to
initialize, and the invalid EEPROM is updated
to match the valid EEPROM.
STATE_05: SHELF_FRUS_STATE_FRU12_ The data is invalid in both EEPROMs.
INV
STATE_06: SHELF_FRUS_STATE_FRU12_ The data is valid in both EEPROMs, but they
DIF are different.

Default severity
CRITICAL, MINOR

Root Cause
The shelf FRU EEPROM data has been corrupted.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-45
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_ShelfFRUs

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 A firmware upgrade may be needed, contact Alcatel-Lucent Customer Support.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-46 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_UnexpectedDeact

....................................................................................................................................................................................................................................

ATCA_UnexpectedDeact
Description
This sensor reports unexpected deactivation (transition to INACTIVE state) origin. It is
asserted
upon transition to INACTIVE state and de-asserted upon transition to any other state.

000 none (in de-assertion event only).


100 power failure.
200 temperature protection mechanism requested
power off.
400 local CPU requested deactivation.

Default severity
CRITICAL

Root Cause
There should be additional information available to explain what caused the deactivation.
Voltage, temperature are two possibilities.

Fault clearance procedure


...................................................................................................................................................................................................

1 Look at sensor alarms to see why the card was deactivated and resolve underlying
problems.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-47
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms ATCA_m48vSensor

....................................................................................................................................................................................................................................

ATCA_m48vSensor
Description
This alarm indicates a problem with the -48V shelf power supply A and/or B feeds.

Default severity
MINOR, MAJOR

Root Cause
Possible root causes:
Circuit breakers tripped for power supply.
Power feed lost to chassis.

Fault clearance procedure


...................................................................................................................................................................................................

1 Contact the local power team.


...................................................................................................................................................................................................

2 Check the top rack power distribution unit's LED and circuit breakers. If any circuit
breakers are tripped, reset them.
...................................................................................................................................................................................................

3 If the alarm severity is MINOR, this indicates the power level has dropped to between
-48V and -41V, and may indicate the system is running on battery backup.
...................................................................................................................................................................................................

4 If both local and remote A sensors are reporting MAJOR alarms, this indicates the
problem is in the A power cable feeding the PDU.
...................................................................................................................................................................................................

5 If both local and remote B sensors are reporting MAJOR alarms, this indicates the
problem is in the B power cable feeding the PDU.
...................................................................................................................................................................................................

6 The alarm should clear once the problem is rectified. If it does not, contact Alcatel-Lucent
Customer Support.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-48 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cardConnectionLost

....................................................................................................................................................................................................................................

LSS_cardConnectionLost
Description
REM detected a problem with its connectivity to a service member under its control, or a
service member has missed a heartbeat to REM.

Default severity
CRITICAL, MAJOR

Root Cause
Possible causes of this alarm are:
1. The service member has undergone an initialization or reconfiguration due to an
automatic action.
2. The communication path from REM to the service member has been lost or
interrupted.
3. REM has detected a loss of a heartbeat from the service member.

Fault clearance procedure


...................................................................................................................................................................................................

1 Verify the status on the service on the MI GUI. It should be "Out of Service". If it is
active, or standby hot, with its mate being active, or manually out-of-service
(unlocked/disabled/idle) then the alarm condition is not valid.
...................................................................................................................................................................................................

2 If communication to the service does not come back within several minutes (e.g. the
cardConnectionLost alarm does not clear), it may be necessary to connect to the card's
console-port to get the status of the service. Consult card specific documentation about
the console commands to obtain the card service state.
If you are not successful in connecting to the console, this could be due to either a
networking problem, or a fault in the card. If the card is inaccessible via console, it can be
recovered via the reset button, or by powercycling. Continued trouble may mean the card
is having some hardware difficulty; and Alcatel-Lucent Customer Support should be
contacted to determine the next step(s).
...................................................................................................................................................................................................

3 Try to ping the internal fixed service ip address of the service member from the host
which is running the active CNFG service. If pinging the service member from the CNFG
host succeeds, then go to Step 4; else go to Step 5.

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-49
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cardConnectionLost

....................................................................................................................................................................................................................................
...................................................................................................................................................................................................

4 Determine if REM has a connection to the service member via the use of the netstat
command on the host which has the active CNFG service The following command gives
a list of IP addresses that REM has connected to via well-known port 20000:
netstat -a | grep 20000
Look for an "Established" connection to the service's IP address in the output of the above
command. If the service's IP address is not found in the output and this is the first time
you have visited this step, then go to Step 6. If the service's IP address is not found in the
output and this is the second time you have visited this step, then go to Step 7.
...................................................................................................................................................................................................

5 Check the IP connections from the host that has the active CNFG service member to the
switches and the routers. Check the connection to the card. If connection problems are
found, they must be fixed. One can also verify that the appropriate service IP addresses
have been plumbed and the appropriate service image has been downloaded to the card.
...................................................................................................................................................................................................

6 Try switching the CNFG service to its currently standby hot member via MI GUI.
...................................................................................................................................................................................................

7 Stop and start the CNFG service via the stopCNFG and startCNFG commands,
respectively. This will stop the REM process and restart it, among others within the
CNFG service. Once the CNFG service is active, the virtual cluster can be switched back.
Note that error recovery and provisioning ability is affected if the CNFG service is not
operational.
...................................................................................................................................................................................................

8 Restart/reload the service. This may be done via the MI GUI.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-50 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cardError

....................................................................................................................................................................................................................................

LSS_cardError
Description
This alarm indicates that a hardware diagnostic failure has been detected. Depending on
the criticality of the checks, alarms with various severities are generated.

Default severity
CRITICAL, MAJOR, MINOR

Root Cause
List of root causes:
The Field Replaceable Unit (FRU) programmed data has been corrupted or was not
programmed correctly in the factory. (Minor alarm)
The hardware diagnostic results or FRU data were not retrievable from the hardware.
(Major alarm)
The hardware of the card for this host has reported a Built-in Self Test (BIST) or
Power-on Self Test (POST) diagnostic failure. (Critical alarm)

Fault clearance procedure


...................................................................................................................................................................................................

1 For the Critical Alarm, the card should be taken OOS and replaced.
...................................................................................................................................................................................................

2 For the Major Alarm, the card should be taken OOS and rebooted to see if the alarm
clears. If it does not clear or there are other reports from the card (such as Asserts)
reporting problems, the card should be left OOS and Alcatel-Lucent Customer Support
should be contacted.
...................................................................................................................................................................................................

3 For the Minor Alarm, contact Alcatel-Lucent Customer Support for the correction
procedure.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-51
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAlrmCritical

....................................................................................................................................................................................................................................

LSS_cpiAlrmCritical
Description
The raised alarm LSS_cpiAlrmCritical indicates the value of the VS.alrmCritical
measurement monitored by the Critical Alarms Count CPI exceeded a threshold in the last
15 minute interval.
This Performance Measurement (PM) counts the number of critical alarms issued by the
reporting resource.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 10

Root Cause
This is a summary alarm to bring more attention to the other critical alarms being raised

Fault clearance procedure


...................................................................................................................................................................................................

1 Using the Maintenance Interface, examine the set of critical alarms or any other alarms
recently raised and address them.
...................................................................................................................................................................................................

2 This alarm clears automatically if the rate of critical alarm generation drops below the
threshold.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-52 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAlrmMajor

....................................................................................................................................................................................................................................

LSS_cpiAlrmMajor
Description
The raised alarm LSS_cpiAlrmMajor indicates the value of the VS.alrmMajor
measurement monitored by the Major Alarms Count CPI exceeded a threshold in the last
15 minute interval.
This Performance Measurement (PM) counts the number of major alarms issued by the
reporting resource.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 15

Root Cause
This is a summary alarm to bring more attention to the other major alarms being raised

Fault clearance procedure


...................................................................................................................................................................................................

1 Using the Maintenance Interface, examine the set of major alarms or any other alarms
recently raised and address them.
...................................................................................................................................................................................................

2 This alarm clears automatically if the rate of major alarm generation drops below the
threshold.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-53
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAlrmMinor

....................................................................................................................................................................................................................................

LSS_cpiAlrmMinor
Description
The raised alarm LSS_cpiAlrmMinor indicates the value of the VS.alrmMinor
measurement monitored by the Minor Alarms Count CPI exceeded a threshold in the last
15 minute interval.
This Performance Measurement (PM) counts the number of minor alarms issued by the
reporting resource.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 30
Major Alarm: 20 < CPI value <= 30

Root Cause
This is a summary alarm to bring more attention to the other minor alarms being raised

Fault clearance procedure


...................................................................................................................................................................................................

1 Using the Maintenance Interface, examine the set of minor alarms or any other alarms
recently raised and address them.
...................................................................................................................................................................................................

2 This alarm clears automatically if the rate of minor alarm generation drops below the
threshold.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
4-54 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAlrmWarning

....................................................................................................................................................................................................................................

LSS_cpiAlrmWarning
Description
The raised alarm LSS_cpiAlrmWarning indicates the value of the VS.alrmWarning
measurement monitored by the Warning Alarms Count CPI exceeded a threshold in the
last 15 minute interval.
This Performance Measurement (PM) counts the number of warning alarms issued by the
reporting resource.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
CRITICAL, MAJOR, MINOR

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Critical Alarm: CPI value > 50
Major Alarm: 25 < CPI value <= 50
Minor Alarm: 15 < CPI value <= 25

Root Cause
This is a summary alarm to bring more attention to the other warning alarms being raised

Fault clearance procedure


...................................................................................................................................................................................................

1 Using the Maintenance Interface, examine the set of warning alarms or any other alarms
recently raised and address them.
...................................................................................................................................................................................................

2 This alarm clears automatically if the rate of warning alarm generation drops below the
threshold.
E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-55
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtEsc

....................................................................................................................................................................................................................................

LSS_cpiAsrtEsc
Description
The raised alarm LSS_cpiAsrtEsc indicates the value of the VS.asrtESC measurement
monitored by the Escalating Asserts CPI exceeded a threshold in the last 15 minute
interval.
In software, defensive checks are placed to ensure expected inputs or boundary conditions
within routines are met. If such a check fails, an assert report is logged containing
information for the code author to debug the problem. The assert messages themselves are
not of much value to the operator.
When the problem is serious enough, such as being associated with a critical resource, an
assert that can result in escalation is used. Such asserts are tied to a leaky bucket
mechanism that measures the rate of these events. If the specified thresholds are reached,
a task restart is attempted. If that level of escalation continues to fail, then a process
initialization is done which leads to a switch-over of the service. Immediate escalation is
possible.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: CPI value > 20
Minor Alarm: 10 < CPI value <= 20
Warning Alarm: 5 < CPI value <= 10

Root Cause
In this case, the cause of the problem is non-specific and is dependent upon the function
of the software generating the assert and the defensive check it is performing.

....................................................................................................................................................................................................................................
4-56 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtEsc

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 This alarm clears automatically if the rate of escalating assert generation drops below the
threshold. An automatic escalation would result in a switch over and should also drop the
rate of assert generation.
...................................................................................................................................................................................................

2 Determine if any other alarms have been recently raised on the resource reported and
address them.
...................................................................................................................................................................................................

3 Examine the recent Performance (PM) counts on the resource reported; they may suggest
more regarding this issue.
...................................................................................................................................................................................................

4 If a provisioning or configuration change was executed just before the alarm was raised,
consider that the change is causing the problem.
...................................................................................................................................................................................................

5 If a Software Update (SU) or Patch is being soaked, then this could indicate a problem
with the software delivered; immediately contact Alcatel-Lucent Customer Support.
...................................................................................................................................................................................................

6 If the PM counts indicates degradation of service and a switch over has not occurred,
switch the service to its redundant mate.
...................................................................................................................................................................................................

7 If the situation persists after a switch-over, be sure that the prior active host of the service
was removed from service and restored completely. Attempt another switch over to the
original active host for the service.
...................................................................................................................................................................................................

8 In all cases, contact customer support regarding this alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-57
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEsc

....................................................................................................................................................................................................................................

LSS_cpiAsrtNonEsc
Description
The raised alarm LSS_cpiAsrtNonEsc indicates the value of the VS.asrtNonESC
measurement monitored by the Non-Escalating Asserts CPI exceeded a threshold in the
last 15 minute interval.
In software, defensive checks are placed to ensure expected inputs or boundary conditions
within routines are met. If such a check fails, an assert report is logged containing
information for the code author to debug the problem. The assert messages themselves are
not of much value to the operator. Most of the time asserts are isolated events; however,
they can begin to accumulate in which case a more serious failure may be occurring for
which this alarm is bringing attention to.
Defensive checks are performed to prevent more serious outages if possible.
These asserts do not invoke any automatic recovery through escalation. A different assert
type is used for that and is monitored by the CPI called cpiAsrtEsc
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: CPI value > 20
Minor Alarm: 10 < CPI value <= 20
Warning Alarm: 5 < CPI value <= 10

Root Cause
In this case, the cause of the problem is non-specific and is dependent upon the function
of the software generating the assert and the defensive check it is performing.

....................................................................................................................................................................................................................................
4-58 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEsc

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 This alarm clears automatically if the rate of non-escalating assert generation drops below
the threshold.
...................................................................................................................................................................................................

2 Determine if any other alarms have been recently raised on the resource reported and
address them.
...................................................................................................................................................................................................

3 Examine the recent Performance (PM) counts on the resource reported; they may suggest
more regarding this issue.
...................................................................................................................................................................................................

4 If a provisioning or configuration change was executed just before the alarm was raised,
consider that the change is causing the problem.
...................................................................................................................................................................................................

5 If a Software Update (SU) or Patch is being soaked, then this could indicate a problem
with the software delivered; immediately contact Alcatel-Lucent Customer Support.
...................................................................................................................................................................................................

6 If the PM counts indicates degradation of service, switch the service to its redundant
mate.
...................................................................................................................................................................................................

7 If the situation persists after a switch-over, be sure that the prior active host of the service
was removed from service and restored completely. Attempt another switch over to the
original active host for the service.
...................................................................................................................................................................................................

8 In all cases, contact customer support regarding this alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-59
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEscCritical

....................................................................................................................................................................................................................................

LSS_cpiAsrtNonEscCritical
Description
The raised alarm LSS_cpiAsrtNonEscCritical indicates the value of the
VS.asrtNonESCCritical measurement monitored by the Critical Non-Escalating Asserts
CPI exceeded a threshold in the last 15 minute interval.
In software, defensive checks are placed to ensure expected inputs or boundary conditions
within routines are met. If such a check fails, an assert report is logged containing
information for the code author to debug the problem. The assert messages themselves are
not of much value to the operator. Most of the time asserts are isolated events; however,
they can begin to accumulate in which case a more serious failure may be occurring for
which this alarm is bringing attention to.
Defensive checks are performed to prevent more serious outages if possible.
Asserts, which do not invoke any automatic recovery through escalation, are tagged with
levels of severity, in this case critical, to provide guidance to Alcatel-Lucent Customer
Support on the seriousness of the problem.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: CPI value > 10
Minor Alarm: 5 < CPI value <= 10
Warning Alarm: 3 < CPI value <= 5

Root Cause
In this case, the cause of the problem is non-specific and is dependent upon the function
of the software generating the assert and the defensive check it is performing.

....................................................................................................................................................................................................................................
4-60 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEscCritical

....................................................................................................................................................................................................................................
Fault clearance procedure
...................................................................................................................................................................................................

1 This alarm clears automatically if the rate of non-escalating assert generation drops below
the threshold.
...................................................................................................................................................................................................

2 Determine if any other alarms have been recently raised on the resource reported and
address them.
...................................................................................................................................................................................................

3 Examine the recent Performance (PM) counts on the resource reported; they may suggest
more regarding this issue.
...................................................................................................................................................................................................

4 If a provisioning or configuration change was executed just before the alarm was raised,
consider that the change is causing the problem.
...................................................................................................................................................................................................

5 If a Software Update (SU) or Patch is being soaked, then this could indicate a problem
with the software delivered; immediately contact Alcatel-Lucent Customer Support.
...................................................................................................................................................................................................

6 If the PM counts indicates degradation of service, switch the service to its redundant
mate.
...................................................................................................................................................................................................

7 If the situation persists after a switch-over, be sure that the prior active host of the service
was removed from service and restored completely. Attempt another switch over to the
original active host for the service.
...................................................................................................................................................................................................

8 In all cases, contact customer support regarding this alarm.


E...................................................................................................................................................................................................
N D O F S T E P S

....................................................................................................................................................................................................................................
9471 WMM Alarms Alcatel-Lucent Proprietary 4-61
9YZ-05481-0005-RKZZA Release Use pursuant to applicable agreements
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEscMajor

....................................................................................................................................................................................................................................

LSS_cpiAsrtNonEscMajor
Description
The raised alarm LSS_cpiAsrtNonEscMajor indicates the value of the VS.asrtNonESC-
Major measurement monitored by the Major Non-Escalating Asserts CPI exceeded a
threshold in the last 15 minute interval.
In software, defensive checks are placed to ensure expected inputs or boundary conditions
within routines are met. If such a check fails, an assert report is logged containing
information for the code author to debug the problem. The assert messages themselves are
not of much value to the operator. Most of the time asserts are isolated events; however,
they can begin to accumulate in which case a more serious failure may be occurring for
which this alarm is bringing attention to.
Defensive checks are performed to prevent more serious outages if possible.
Asserts, which do not invoke any automatic recovery through escalation, are tagged with
levels of severity, in this case major, to provide guidance to Alcatel-Lucent Customer
Support on the seriousness of the problem.
Notes:
The thresholds are configurable on FSGUI CPI window.
An alarm with the same severity is raised only once for the same CPI and component.
The alarm clears if no threshold is met in one of the following intervals.

Default severity
MAJOR, MINOR, WARNING

Severity Details
THE ALARM SEVERITY IS DETERMINED BY THE THRESHOLD SETTINGS. THE
DEFAULT ALARM SEVERITY FOLLOWS THE CRITERIA BELOW:
Major Alarm: CPI value > 15
Minor Alarm: 7 < CPI value <= 15
Warning Alarm: 4 < CPI value <= 7

Root Cause
In this case, the cause of the problem is non-specific and is dependent upon the function
of the software generating the assert and the defensive check it is performing.

....................................................................................................................................................................................................................................
4-62 Alcatel-Lucent Proprietary 9471 WMM Alarms
Use pursuant to applicable agreements 9YZ-05481-0005-RKZZA Release
WM7.0.0
Issue 1 August 2013
BASE_ATCA Alarms LSS_cpiAsrtNonEscMajor

...........................................................................................