MD3420 Error : Controller Module RAID Removed or Replaced - DELL|EMC Storage Forum - Storage - Dell Community

MD3420 Error : Controller Module RAID Removed or Replaced

Storage

Storage
Information and ideas on Dell storage solutions, including DAS, NAS, SAN and backup.

MD3420 Error : Controller Module RAID Removed or Replaced

This question is not answered

The MD3420 hosts the shared storage of a VMware-Failover-Cluster.

Frequently, hosts lose paths to storage , the eventlog of the cluster nodes shows at the same time a series of errors (storage paths losed).

Date / Time: 30/03/17 14:27:53
Sequence number: 4294
Type of event: 1712
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: The RAID controller module wide port is in the optimum state
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c6 10 00 00 00 00 00 00
12 17 48 00 a9 dd dc 58 14 00 00 00 00 01 01 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 03 30 00 00 00 04 00 00 08
00 00 00 00 04 00 26 88 02 00 00 00 1c 00 61 87
50 0a 09 84 ac d1 75 bf 50 0a 09 84 ac d1 75 00
18 04 04 01 01 00 00 00 03 00 00 00


Date / Time: 30/03/17 14:27:31
Sequence number: 4292
Event Type: 400B
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: Removed or Replaced RAID Controller Module
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c4 10 00 00 00 00 00 00
0b 40 48 00 93 dd dc 58 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 00 00 00 00 00


Date / Time: 30/03/17 14:27:29
Sequence number: 4290
Type of event: 2606
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: Routine Beginning of day launched
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c2 10 00 00 00 00 00 00
06 26 48 10 91 dd dc 58 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 01 08 00 00 00 04 00 23 08
01 00 00 00


But the Modular Disk Storage Manager shows all disks ok.

I updated firmware to Dell PowerVault MD 34/38 Series Storage Controller Firmware and NVSRAM firmware version 08.25.09.61
What happens with the Storage?

All Replies
  • Hi,

    Are you still having the connection issues after the firmware update? How long are the cables? 

    Thanks,

    Josh Craig

    Dell EMC | Enterprise Support Services
    Get support on Twitter: @DellCaresPRO

    Download our QRL app: iOSAndroidWindows

  • Hi,

    The cables are new, they are 4 months old.
    I have updated the firmware for a month and the problems are repeated again.

  • After the firmware update, we are still faced with a problem. New SAS cables (2m) are in place.

    Frequently, hosts lose paths to storage. We do not see at log level where this problem can come from.

    Frequently, we have: Removed or Replaced RAID Controller Module (for all controllers), without intervention. 

    ESXi and MD3420 Firmwares are up to date.

  • How can you help us, we are afraid that the problem will recur again, our infrastructure is in production.

  • Hello tcrescence,

    Wide port errors are generally associated with a controller reset. It may be that the controller reset during its boot sequence after the battery replacement. This is not unusual when changes are made. The controller will see the changes, commit them to running memory, then reboot to commit them to permanent memory.

    What you need to look at the event log to see if there are corresponding wide port optimal messages. You also might look for Start of Day Routine start and completed messages. If that’s all completing then you are good to go as there is no error.  If you have a support bundle we can review it to see what the errors are.

    Please let us know if you have any other questions.

    DELL-Sam L
    Dell | Social Outreach Services - Enterprise
    Download the Dell Quick Resource Locator app today to access PowerEdge support content on your mobile device! (iOS, Android, Windows)

  • I transfer you the logs your analysis.
    Sorry, the MD3420 storage array is configured in French.
    We must start by reading from the bottom of the document.

    log_md3420.log

  • Hello tcrescence,

    What I need is a support bundle from your MD3420. You can gather a support bundle by going to the tools tab and selecting gather support logs. Once you have the logs if you can email them to me so that I can review them that would be great. I will send you an email using your address in your profile that you can reply back to with the logs.

    Please let us know if you have any other questions.

    DELL-Sam L
    Dell | Social Outreach Services - Enterprise
    Download the Dell Quick Resource Locator app today to access PowerEdge support content on your mobile device! (iOS, Android, Windows)

  • MD3420_support_data.txt

    Hello,

    In attachment, please find MD3420 support data.

    Could you send me back an email to tesis-exploitation@nextiraone.eu?

    Thanks.

  • Hello tcrescence,

    I sent you an email. If you can please reply back with the logs that would be great.

    Please let us know if you have any other questions.

    DELL-Sam L
    Dell | Social Outreach Services - Enterprise
    Download the Dell Quick Resource Locator app today to access PowerEdge support content on your mobile device! (iOS, Android, Windows)