Really hope someone can point us in the right direction.
Scenario:
- 3 node WS2012 Hyper-V cluster
- iscsi SAN
- node 1 owns all the CSVs in the cluster
We reboot either node 2 or node 3, and occasionally random CSVs fail, *even though the CSV is not on the node being rebooted*!! After this, the VMs on those CSVs report disk errors, randomly restart, etc.
The cluster passes the storage and networking validation tests. Also, we were able to ping the SAN IPs during the node reboot continuously without any drops.
Events we get:
[5120] Cluster Shared Volume 'Volume2' ('CSV2') is no longer available on this node because of 'STATUS_NO_SUCH_DEVICE(c000000e)'. All I/O will temporarily be queued until a path to the volume is reestablished.
then
[1038] Ownership of cluster disk 'CSV2' has been unexpectedly lost by this node. Run the Validate a Configuration wizard to check your storage configuration.
then
[1069] Cluster resource 'CSV2' of type 'Physical Disk' in clustered role '46aa2179-91c5-410d-83d4-efd95399bcfc' failed.
Any ideas?