Flood of errors: section is locked by dist sharer ID 0xD2
Posted: Mon Jun 21, 2021 9:19 am
I am using StarWind Virtual SAN for VSphere in a two node setup. Both nodes are running as appliances, with RAID'd SSDs configured with a VDO on top. The servers are connected over 10gig fibre for synch and ethernet links for the heartbeat.
Last night, there was a flood of errors across the setup just after midnight beginning with
Following this, there were many instances of and various iSCSI disconnections and reconnections.
This caused the CPU usage of one of the nodes to skyrocket, and issues with the VMs as the SAN appeared to flip on and off.
The other node also had errors in the logs but these seemed to be more related to the primary issues.
I have checked the health of the
After 5 hours this behaviour appeared to resolve itself, with both nodes and VSAN performance returning to normal.
Does anyone here have an idea as to why this might occur?
I have log files available for further investigation.
Last night, there was a flood of errors across the setup just after midnight beginning with
Code: Select all
DistCs::Enter: (CSid = 0xCCC5) section is locked by dist sharer ID 0xD2.
Code: Select all
DistCs::Enter: (CSid = 0xCCC5) retry #1 after sleep 152 ms...
This caused the CPU usage of one of the nodes to skyrocket, and issues with the VMs as the SAN appeared to flip on and off.
The other node also had errors in the logs but these seemed to be more related to the primary issues.
I have checked the health of the
After 5 hours this behaviour appeared to resolve itself, with both nodes and VSAN performance returning to normal.
Does anyone here have an idea as to why this might occur?
I have log files available for further investigation.