This post was written by Sathish D of the Dell OpenManage Connections team.

OVERVIEW

HP Operations Manager (HPOM) supports scheduled task policy, which you can use to invoke external applications from the HPOM console. Dell Smart Plug-in (SPI) has a scheduled task policy, which retrieves the device health status and generates corresponding status messages in the HPOM console.

This post explains the Periodic Health monitoring of Dell EqualLogic Storage Arrays, and Dell PowerVault MD Storage Arrays using Dell SPI Scheduled Status Poll Polices with HP Operation Manager for Windows.

Periodic Health Monitoring Scheduled Status Poll policies for Dell Storage Arrays in HPOM console:-

PREREQUISITES:

Before scheduling the status poll policies, complete these prerequisites:

-      Configure Simple Network Management Protocol (SNMP) and enable support for Dell EqualLogic PS-Series Storage Arrays.

-      Schedule the Dell Hardware Auto-grouping policy in the HPOM console to classify Dell Storage Array devices under the Dell groups.

Reference Documents:

  • For Equal Logic Storage Array Configuration:

http://psonlinehelp.equallogic.com/V3.3/configuring_an_array_and_creating_a_group.htm

  • For Equal Logic Storage Array SNMP Configuration

http://psonlinehelp.equallogic.com/V3.3/using_snmp_to_monitor_a_group.htm#Pr_group_4853_17826

 

MONITORING DELL EQUALLOGIC STORAGE ARRAY DEVICE IN HPOM CONSOLE:

Once the “Dell Hardware Auto-grouping Policy” is completed, the policy identifies any discovered EqualLogic devices and creates a hierarchy based on EqualLogic Group (IP address) and sets the storage pool to belong to the EqualLogic Group. Discovered EqualLogic devices are classified under the associated Storage Pool on both Node and Service Hierarchies in the HPOM console.

During classification, EqualLogic devices also create “global health Status” child nodes in the Service hierarchy. The service id represents the health status of the corresponding EqualLogic node.

Hierarchical Representation of EqualLogic device in Service Hierarchy:  

                                                              

DEPLOYING THE DELL EQUALLOGIC ARRAYS HEALTH STATUS SCHEDULED TASK POLICY:

The scheduled task policy “Dell EqualLogic Arrays Scheduled Status Poll” is used to monitor the overall system health status of Dell EqualLogic Storage Array devices in the HPOM console. The default interval of this policy is every day 2:00 A.M. The interval can be changed to custom values as required. Policies are deployed via the Management Server.

Once the policy is run, the health status of the device is queried through the SNMP protocol and a corresponding message with the associated severity is shown in the HPOM console. The health status message is also associated with the device, classified under Dell groups in both the Node and Service hierarchies.

The schedule task policy acknowledges the health message and posts the current health status message for the classified EqualLogic device in the Active Message Browser of the HPOM console. The latest health status of the EqualLogic device is always displayed in the console.

The Scheduled Status Poll policy generates health messages with three different severities in the HPOM console.

   

Message Association in Service Hierarchy:

Once the Dell EqualLogic Arrays Scheduled Status Poll policy is run, the policy retrieves the overall health status of classified EqualLogic devices, and the retrieved health status is mapped with the appropriate health message severity (Normal, Warning or Critical).The health message associated for the EqualLogic device child node (EqualLogic node: Global System Status) is seen on the Active Message Browser of the HPOM Console. The message severity for the child the node is also propagated to the parent Node in both Node and the Service Hierarchies.

Service Hierarchy: Only the Global System Status message with its corresponding health severity will be associated and updated for the node. The health severity will propagate to the device parent group.

Node Hierarchy:  In Node hierarchy, SNMP Trap Messages and Health messages are associated with the EqualLogic device node; the worst case message severity is propagated to its parent Node group.

Health Status Message Association for EqualLogic Storage Array:

 

 Critical health messages are generated for these following reasons:

  • If the EqualLogic Member components Health are in Critical state.  Some possibilities include:  Power Supply Failure, Temperature exceeds the critical threshold value, Controller Failure, Physical Disk Failure, Fan Failure, and Battery failure, and so on.
  • If the device is in unreachable condition or SNMP timeout occurred.

TROUBLESHOOTING STEPS:

When the device Global health is in warning or critical status, follow these steps to troubleshoot the issues further:

-  Review the outstanding messages in the Active Message browser of the device.  If any issues exist, resolve them as per instructions in the message browser.

 -  Launch 1:1 console  for further troubleshooting,

  • Dell EqualLogic Group Manager Console launches for all EqualLogic PS-Series Storage Array Members

 -   Launch Dell tools for further troubleshooting and taking corrective action:

  • Warranty Report:

                   The Warranty report page is used to retrieve the warranty related information via the service tag              associated with the system.  You can review the warranty details of the system and also renew            the warranty.

  • Open Manage Essentials (OME) :

                        OME Console can be used to verify the device and the components health. It also provides                 rich device inventory information. You can launch the OME console to further troubleshoot the                       device specific information.

MONITORING DELL POWERVAULT MD STORAGE ARRAYS IN HPOM CONSOLE:

Once the Dell Hardware Auto-grouping Policy is executed, the policy identifies the discovered MD Storage Arrays; the corresponding MD Arrays are classified under Dell Storage groups on both Node and Service Hierarchy in the HPOM console.

During classification, MD Storage Array device also creates the “global health Status” child node as shown in the Service hierarchy.

   

Hierarchical Representation of MD Storage Array in Service Hierarchy

DEPLOYING THE DELL MD STORAGE ARRAYS SCHEDULED STATUS POLL:

The scheduled task policy “Dell MD Storage Arrays Scheduled Status Poll Policy” monitors the overall system health status of Dell PowerVault MD Storage Arrays in the HPOM console. The default interval of this policy is every day 2:00 A.M. The interval can be changed to custom value as required. Polices are deployed via the Management Server.

Once the policy is run, the health status of the device is queried through the communication protocol and a corresponding message with the associated severity will be shown in the HPOM console.

The schedule task policy polls periodically, and acknowledges the previous health message and updates the current health status message for the classified MD Storage Array device in the Active Message Browser of the HPOM console.

This Scheduled Status Poll policy supports three different Health Message Severities in the HPOM console.

   

Message Association in Service Hierarchy:

            Once the Dell MD Storage Array Scheduled Status Poll policy is run, the policy retrieves the overall health status of the classified MD Storage Array device, and the retrieved health status is mapped with appropriate health message severity (Normal, Warning or Critical).

The health message is associated with the MD Storage Array device child node (MD Storage Array node: Global System Status) is seen on the Active Message Browser of the HPOM Console. The message severity for the child node is also propagated to the parent Node in both Node and the Service Hierarchy.

Service Hierarchy: Only the Global System Status message with its corresponding health severity will be associated and updated for the MD Storage Arrays node. The health severity will be propagated to the device parent group.

Node Hierarchy:  In Node hierarchy, all messages including Global System Status message are associated with the MD Storage Array device; the worst case message severity is propagated to its parent Node group.

Health Status Message Association for MD Storage Array device:

The Critical message gets generated when the MD Storage Array device is in unreachable condition or the communication error.

 

TROUBLESHOOTING STEPS:

When the device Global health status is “Warning” or “Critical”, do one or more of follow these steps to troubleshoot the issue further:

-   Review the outstanding messages in the Active Message browser of the device.  If any issues exist, resolve them as per instructions in the message browser.

 -  Launch 1:1 console  for further troubleshooting

  • Dell MD Storage Manager Console launch for the MD Array and verify the Recovery Guru Failure events

-   Launch Dell tools for further troubleshooting and take corrective action:

  • OME – Open Manage Essentials (OME) Console to verify the health of the device and its components, and the device inventory information

APPENDIX:

Refer to the following links: