5.2 RAM CACHING PROBLEMS
Posted: Tue Feb 02, 2010 2:36 pm
A little background:
ESX 4.0 running on Dell R900 (64GB RAM) with QLogic 4062 for iSCSI connection. ESX is installed on local RAID 1 drives.
StartWind 5.2 server on Supermicro SC216E2, Xeon E5530 quad, 8GB RAM, Adaptec 2258800 24-port SATA/SAS RAID controller on dual-multilane SAS backplane with 13 300GB 10K RPM SAS drives (mostly RAID 5). Also, 8 Intel E1000 type NICs.
The system has been running VERY WELL with iSCSI using failover configuration with preferred adapters set in ESX to spread load between the two iSCSI channels (A trick I posted last year). All running well with jumbo frames enabled at Starwind NICs, Switch and (I think by default) on the QLogic adapters.
Three targets are setup in Basic Disk mode with clustering enabled (to allow for NIC failover trick to spread load). On Startwind 5.1, RAM caching was not enabled. When I installed v5.2 (last week), all continued to work well. As a test I removed and remounted a target (which had a Win2000 VM on it only) but this time with WRITE-THROUGH CACHING enabled and 1.5GB of RAM assigned. I rebooted the VM on the target and all seemed to work well though I did not do any performance analysis. This configuration worked well and on the weekend, I decided to shut down all VMs on two other targets and remount the targets with RAM CACHING enabled as well. Again, I assigned 2GB of RAM to each target and used WRITE-THROUGH mode. When I went to reboot the VMs, ALL HELL BROKE LOOSE!
First, the vSphere client would not connect to the ESX server. I was able to connect via Putty and start the webservices on ESX. The VMs were powering up but I could not open their consoles. Using Putty, I shut down the ESX server, removed the RAM CACHING from the two targets I had added the caching to and rebooted ESX. Now, I still could not connect the vSpere client until I used Putty to start the webservices on the ESX server. The Win2000 which still had RAM CACHING and worked well before was frozen on its login screen. Only when I removed RAM CACHING from the last target did everything return to normal.
Any advice? Has anyone experience problems with RAM CACHING? Maybe it was the multiple targets that caused problems?
ESX 4.0 running on Dell R900 (64GB RAM) with QLogic 4062 for iSCSI connection. ESX is installed on local RAID 1 drives.
StartWind 5.2 server on Supermicro SC216E2, Xeon E5530 quad, 8GB RAM, Adaptec 2258800 24-port SATA/SAS RAID controller on dual-multilane SAS backplane with 13 300GB 10K RPM SAS drives (mostly RAID 5). Also, 8 Intel E1000 type NICs.
The system has been running VERY WELL with iSCSI using failover configuration with preferred adapters set in ESX to spread load between the two iSCSI channels (A trick I posted last year). All running well with jumbo frames enabled at Starwind NICs, Switch and (I think by default) on the QLogic adapters.
Three targets are setup in Basic Disk mode with clustering enabled (to allow for NIC failover trick to spread load). On Startwind 5.1, RAM caching was not enabled. When I installed v5.2 (last week), all continued to work well. As a test I removed and remounted a target (which had a Win2000 VM on it only) but this time with WRITE-THROUGH CACHING enabled and 1.5GB of RAM assigned. I rebooted the VM on the target and all seemed to work well though I did not do any performance analysis. This configuration worked well and on the weekend, I decided to shut down all VMs on two other targets and remount the targets with RAM CACHING enabled as well. Again, I assigned 2GB of RAM to each target and used WRITE-THROUGH mode. When I went to reboot the VMs, ALL HELL BROKE LOOSE!
First, the vSphere client would not connect to the ESX server. I was able to connect via Putty and start the webservices on ESX. The VMs were powering up but I could not open their consoles. Using Putty, I shut down the ESX server, removed the RAM CACHING from the two targets I had added the caching to and rebooted ESX. Now, I still could not connect the vSpere client until I used Putty to start the webservices on the ESX server. The Win2000 which still had RAM CACHING and worked well before was frozen on its login screen. Only when I removed RAM CACHING from the last target did everything return to normal.
Any advice? Has anyone experience problems with RAM CACHING? Maybe it was the multiple targets that caused problems?