Operations and Maintenance for SM

Operation and maintenance (O&M) functions of the iAX™ Subscription Manager are possible via the PMI where you can access Alarms and logs for monitoring the running the Subscription Manager.

The PMI

The PMI runs on the iAX™ Platform and provides operator access via a standard web browser. The PMI combines a web server and a back‐end application interface.

pmi overview

The PMI monitors the status of the system and all active alarms for the Tango applications (iAX™ Subscription Manager) and the underlying iAX™ Platform including the SS7 stack via the back end interface. The web server allows the collected information to be displayed via any standard web browser.

Configuration of the application is managed via the web browser and applied to the application via the back‐end interface.


Alarm and Event Handler

The Alarm and Event Handler (AEH) is a core platform service that manages the collection, logging and reporting of alarms and events. Key features of the AEH include:

  • Time and date stamping and logging to file on disk of all alarm and events

  • This file has a configurable maximum file size and configurable rollover timer

  • A user interface to view the currently active alarms. Each alarm is assigned one of three levels: critical, major or minor. Each level is denoted by a number and colour code on the active alarm screen.

The AEH has the following features: * A user interface to view the alarm and event history logs * Function to reset alarm and event logs (separately) * SNMP interface to external alarm management systems for raising alarms * AEH services are available locally or remotely via a standard web browser

Active alarms can be viewed via the PMI. The active alarm list details the time, date and severity of each alarm and the host machine on which the alarm occurred. The refresh rate of this alarm screen is configurable.


Log file export

Daily alarm and event log files are stored on the iAX™ Platform for a configurable period. The log files are available for download (in zip format) via the PMI web‐based interface.


Alarm definitions

Alarm Level Description

OVERFLOW STORE

CRITICAL

Disk space used by long‐term store exceeds configurable threshold.

FAILOVER STORE

MAJOR

Active long‐term message store has failed over to backup.

MANAGED PROCESS FAILED

CRITICAL

Identifies a process managed by the Tango Process Manager which has failed on start‐up or during execution.

TIMED OUT AWAITING POLL

MAJOR

Identifies IP connectivity problem on a single interface.

MTPL3:DESTINATION INACCESSIBLE

CRITICAL

SS7 stack generates this alarm when an SS7 network element is unreachable.

POWER SUPPLY HAS FAILED OR NO INPUT

CRITICAL

One of dual power supply has failed on specified node.

CPU IDLE TIME BELOW THRESHOLD

MAJOR

CPU idle time has fallen below a configurable threshold. System may need to be expanded.

Refer to your Tango Alarm Reference Sheet details the alarms and events common to all services which can be deployed (SMS, GPRS, Voice etc…). These include alarms related to hosts and interfaces.