DataOn-Starwind Issue
Posted: Sat Dec 13, 2014 3:52 pm
Hello my fellow starwinders!
We currently are running a 2 node HA cluster using 2 DataOn DNS-1660 60 bay JBODS, The storage is configured as a 40 drive RAID 50 with hot spares.
Losing a drive periodically is to be expected, which is why we use redundancy and hot spares. However when I lose a drive, starwinds forces a full sync of all my HA images, and I lose iSCSI access to the node during the full sync. I opened a support ticket with starwinds support and they said "This is a normal behavior, since after one drive failure storage array performance degradation is observed, and it cannot be compared to performance of healthy array on the other node, so synchronization is the only way to fix it".
OK so I am curious if other users out there experience this same behavior when losing a drive in their DataON JBODS? Currently we only have about 8TB of HA images, so a full sync only takes about an hour, but my fear is what the sync time will be if we have 20T,40T, or 60T. Imagine if we had 40T and the full sync took 3-4 hours. That means 3-4 hours of running on ONE NODE, in which any further issue would cause my entire operation to come to a halt. This just seems totally asinine to me, and I have used storage from other vendors such as Compellent, EMC, NetApp, etc, and losing a drive is a minor event.
At this point I don't know if the blame lies with DataOn or Starwinds, I am just curious if other users out there share our pain. If so have you found any good work arounds?
**I just read a forum post in which Starwinds doesn't recommend using DataON JBODs, as they see the most issues with them. That post came about 18 months too late for us, as we are already invested in their product. Of course we will keep that thought in mind on our next hardware refresh cycle.
http://www.starwindsoftware.com/forums/ ... aon#p22192
We currently are running a 2 node HA cluster using 2 DataOn DNS-1660 60 bay JBODS, The storage is configured as a 40 drive RAID 50 with hot spares.
Losing a drive periodically is to be expected, which is why we use redundancy and hot spares. However when I lose a drive, starwinds forces a full sync of all my HA images, and I lose iSCSI access to the node during the full sync. I opened a support ticket with starwinds support and they said "This is a normal behavior, since after one drive failure storage array performance degradation is observed, and it cannot be compared to performance of healthy array on the other node, so synchronization is the only way to fix it".
OK so I am curious if other users out there experience this same behavior when losing a drive in their DataON JBODS? Currently we only have about 8TB of HA images, so a full sync only takes about an hour, but my fear is what the sync time will be if we have 20T,40T, or 60T. Imagine if we had 40T and the full sync took 3-4 hours. That means 3-4 hours of running on ONE NODE, in which any further issue would cause my entire operation to come to a halt. This just seems totally asinine to me, and I have used storage from other vendors such as Compellent, EMC, NetApp, etc, and losing a drive is a minor event.
At this point I don't know if the blame lies with DataOn or Starwinds, I am just curious if other users out there share our pain. If so have you found any good work arounds?
**I just read a forum post in which Starwinds doesn't recommend using DataON JBODs, as they see the most issues with them. That post came about 18 months too late for us, as we are already invested in their product. Of course we will keep that thought in mind on our next hardware refresh cycle.
http://www.starwindsoftware.com/forums/ ... aon#p22192