i got strange replication issues on two customer installation.
- PS6110XS replicating to PS4110E using Smart Replicas (HIT/VE)- Both EQLs running V. 6.0.1- Both EQLs connected to PC8024F switch stack. Configured as "Dell Best Practice (Flow Control, Jumbo frames, Port Fast, STP off)- Group Admin Web Interface shows up "Partner down" suddenly for some replications. Last replication stucks in "in progress". Other replications (to same member) working fine.- After manual controller failover "partner down" changes back to "in progress" but no data is transfered.
- PS6110XV replicating to PS6110XV using Smart Replicas (HIT/VE)- Both EQLs running V. 6.0.1- Each EQL is connected to PC8024 switch stack. Both stacks connected via 2* 10GBit LAG (~300 Meter) Configured as "Dell Best Practice (Flow Control, Jumbo frames, Port Fast, STP off)- Group Admin Web Interface shows up "Partner down" suddenly for some replications. Last replication stucks in "in progress". Other replications (to same member) working fine.- After manual controller failover "partner down" changes back to "in progress" but no data is transfered.- Massiv "packet errors" (around ~100 errors / minute) on all management interfaces (even after manual controller failover). Change switch port to 100/Fdx. No Change. Switch (HP 8212 and HP 2810) shows no errors. Can't belive in four broken cable.
You need to open a case, but first thing is to make sure that DCB is disabled on the 8024 switch. That switch doesn't support the iSCSI features needed to fully support DCB on iSCSI SANs.
Case is already open. But at the moment it doesn't look like that support gets the clou.
DCB is disabled on switch and eql
We don't know if the packet errors on management interface are causing the partner down error.
I change the switch, duplex settings, cable -> Still packet errors.
Also updated the replication site to 6.0.2. Still packet errors.
Packet errors on Mgmt interface won't impact replication at all. Replication is an iSCSI connection between members. Are you using the default VLAN on the 8024's? WAN accelerator? Support should provide you with a script to ping FROM all member ports TO all member ports to make sure all ports are accessible.
No, iSCSI is running on VLAN 10. No WAN Accelerators. Replication site (~400 meter) is connected via 20Gbit LAG (two PC8024er stacks (one stack á 2 switches each site).
There are some strange things:
Some replications are working fine while others are run into "partner down". I have to delete the complete replica set and start over. Sometimes it works for a few replications until it ran into "partner down" again.
The replication destinations always shows that the replication with "partner down" was successfull.
As you can see (source site). Replication from 14:31 still "in progress" / replication status "partner down"
Destination site shows that this replication is completed:
Have you disabled DCB on the 8024's? You need to be at the most recent version to do this. In the EQL GUI, is DCB enabled there and the VLAN set to 10?
DCB is disabled on EQL.
AFAIK is there no "DCB off" switch on PC8024. Firmware is 188.8.131.52
PFC (priority flow control) is inactive
There is a way to turn it off on the switch. Worst case is disable LLDP. Is the VLAN set to 10 in the EQL GUI?
(I know it seems counter intuitive if DCB is turned off)
iSCSI VLAN 10 is untagged to all storage ports.
LLDP is active for all storage ports.