Quantcast
Channel: High Availability (Clustering) forum
Viewing all 6672 articles
Browse latest View live

Windows 2012 R2 Failover Cluster Hyper-V Invalid Class error when I'm trying to create VM

$
0
0

Hi, in my test lab environment I created Windows 2012 R2 Failover Cluster with 4 Servers to get Hyper-V HA.


I had no issues with Windows 2008 R2 or Windows 2012 before in same setup (2 NIC, FC HBA, SAN storage), but this time I cannot creat VM using Failover Cluster console:


Roles - Vurtual Machines - New Virtual Machine - Select Host -> <ANY HOST>

I'm using default settings during VM creation (except setting path to VM manually to pont to desired disk).

IN progress I see disk creation and both VM configuration and .vhdx files on target disk, but after that I see "The Operation has failed. An error occured creating a New Virtual Machine. Invalid Class.



In fact I see virtual machine in Hyper-V Manager and it's fully functional, but not added to Failover Cluster Roles. When I use Configure Role - Virtual Machine to see eligible machines - it's not there.

Cluster Validation says that everything is OK.

I wasn't able to find anything in Eventllog(s) or %SYSTEMROOT%\Cluster as it was in Win2k8.

How should I troubleshoot this issue ?


Windows 2008 R2 Cluster failover issues

$
0
0

Hi all,

I used to be quite proficient in Windows 2000/2003 clustering. I wanted to update my skill set so 

I have setup a 2008 R2 cluster at my home. Please note that this is in my home and is not exactly an ideal production environment. I like to setup services at home for practice. (I will REALLY miss Technet)

For my shared storage I am using a Windows 2012 server with iscsi target. (HP DL360G5) On all servers I have iscsi going over the same teamed nics as the network traffic. I know that this is not recommend/supported but it "seems" to be working ok for me. 

Both nodes are domain controllers. Both are also DFS servers and they run DFS replication. They are both also DNS servers. None of these services are clustered.  (The servers are an ML330G6 and DL385G5)

On node A I am also running Exchange 2010 (not clustered)

I have both nodes connected via iscsi to the virtual drives of the storage server.

The iscsi target has 3 shared drives for my clustered services.

DHCP and Print. (I also created a quorum disk to play with disk only Quorum)

After many trials and tribulations I am almost steady. I can failover back and forth between the nodes all day from the cluster administrator tool successfully. I have also tested failing over to node B and then booting node B. The resources successfully transfer over to node A. My issue is when I have node A as the owner of the resources and then I boot node A.  The resources don't failover to node B. The cluster admin tool (running on node B) eventually hangs. Node B is able to finally grab the resources when node A comes back online. (By itself. I don't force any failing over when this happens)
I have tried changing the Quorum mode. I tried Node and Disk Majority (using one of the shared storage disks as a witness). I tried Node and File Share majority (using the netlogon share on the storage server as the witness) and I also tried Disk Only. (created a third quorum shared disk)

Any clues?

Thanks.

Access is denied messages in Win2012 R2 Failover Cluster validation report and CSV entering a paused state

$
0
0

Been having some issues with nodes basically dropping out of clusters config.
Error showing was

"Cluster Shared Volume 'Volume1' ('Data') has entered a paused state because of '(c000020c)'. All I/O will temporarily be queued until a path to the volume is reestablished."

All nodes (Poweredge 420) connected a Dell MD3200 shared SAS storage.

Nodes point to Virtual 2012 R2 DC's

Upon running validation with just two nodes, get the same errors over and over again.

Bemused!

----------------

List Software Updates
Description: List software updates that have been applied on each node.
An error occurred while executing the test.
An error occurred while getting information about the software updates installed on the nodes.

One or more errors occurred.

Creating an instance of the COM component with CLSID {4142DD5D-3472-4370-8641-DE7856431FB0} from the IClassFactory failed due to the following error: 80070005 Access is denied. (Exception from HRESULT: 0x80070005 (E_ACCESSDENIED)).


and

List Disks
Description: List all disks visible to one or more nodes. If a subset of disks is specified for validation, list only disks in the subset.
An error occurred while executing the test.
Storage cannot be validated at this time. Node 'zhyperv2.KISLNET.LOCAL' could not be initialized for validation testing. Possible causes for this are that another validation test is being run from another management client, or a previous validation test was unexpectedly terminated. If a previous validation test was unexpectedly terminated, the best corrective action is to restart the node and try again.

Access is denied

-----------

The event viewer on one of the hosts shows
-------------
Cluster node 'zhyperv2' lost communication with cluster node 'zhyperv1'.  Network communication was reestablished. This could be due to communication temporarily being blocked by a firewall or connection security policy update. If the problem persists and network communication are not reestablished, the cluster service on one or more nodes will stop.  If that happens, run the Validate a Configuration wizard to check your network configuration. Additionally, check for hardware or software errors related to the network adapters on this node, and check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

The Cluster service is shutting down because quorum was lost. This could be due to the loss of network connectivity between some or all nodes in the cluster, or a failover of the witness disk.
Run the Validate a Configuration wizard to check your network configuration. If the condition persists, check for hardware or software errors related to the network adapter. Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

Only other warning is because the 4 nic ports in each node server are teamed on one ip address split over two switches - I am not concernd about this and could if required split then pairs, I think this is a red herring????

Cluster Shared Volume Offline on Windows server 2012 R2

$
0
0

Hi All,

I keep getting this error on my Windows Server 2012 R2 Cluster: "cluster shared volume is no longer accessible from this cluster node because of error '(1460')". I also get I/O timeout errors as well.

In my Lab, I have 2 Windows Server 2012 R2 Hosts both running the following specs

Server Specs:

Intel Gen3 i7 Processor

64-Gig of Ram

2x Intel I350-T2 (For use with SRIOV and VMQ) --> Latest Drivers installed from Intel

My NAS device is a Synology DS1813+, which supports ODX - Latest Updates applied from Synology

The above error is constantly occurring and is taking my CSV's offline, all VMs are either paused or in a crashed state.

I know their is hotfixes for Windows Server 2012, however the problem seems to persist on Windows Server 2012 R2.

Currently I've disabled VMQ and ODX, and will be monitoring it.

I'm not backing up any VMs i.e. not using VEAM or DPM, since this is a lab environment.

Are their any hotfixes for Windows Server 2012 R2 or any workarounds.

I will also be raising a case with Synology.

Error running unsigned driver inventory and test

$
0
0

I’m working a challenge where the cluster validation tool on a Server 2008 R2 SP1 based cluster throws an error running the unsigned driver inventory/validation. It doesn’t say that there are any unsigned drivers but rather says:

An error occurred while executing the test.
There was an error getting information about the unsigned drivers installed on the nodes.
There was an error retrieving information about the Unsigned Drivers from node 'someclusternode.contoso.com'.
Shutting down

If we run the tool from other nodes it will always pass itself and 4 out of the 5 other nodes, but there will be one node that will trigger this. And the node that it fails on is not consistent.

All of the other tests run just fine, and I can’t find anything in any KB that points me in a troubleshooting direction.

Failed to online the cluster generic script resource

$
0
0

Environment

Cluster Nodes = two

Cluster Nodes OS = Windows 2012

Application = IIS

Query

I configured the cluster. I want to cluster IIS as failover cluster. For this I am using the following article. As per the direction of the below article I replaced both (SITE_NAME = "MyWebSite" andAPP_POOL_NAME = "DefaultAppPool") . Now when I try to online the generic script resource it failed.

http://support.microsoft.com/kb/970759/en-us

Please Note: When I used the same script mentioned in the above link with default web site, it works fine.


Any comment will be appreciated. Thanks. Zahid Haseeb.


WWW service is not able to start via Microsoft Failover Cluster generic service resource

$
0
0

Environment

Cluster Nodes = two

Cluster Nodes OS = Windows 2008R2

Application = IIS

Query

I created generic service resources of many windows services under Microsoft Failover Cluster and they are failing over successfully but when I create a generic service resource for WWW, then the WWW service is not able to online via Microsoft Failover Cluster. It stuck in online pending.

I have noticed two things.

1.) If the WWW service is set to manual and started at passive node and I manually restart the Active node then the WWW service successfully switch over to stand by/passive node. but if the WWW service is set tomanual and not started on stand by/Passive node then the WWW service is not failing over.

2.) if I kill the WWW service manually (as a test case) on Active Node via this command (taskkill /f /pid XXXX) then the WWW service failed and is not failing over to standby/passive node.


Any comment will be appreciated. Thanks. Zahid Haseeb.


Repairing a Cluster

$
0
0

Hi!

One of the two windows 2012 cluster nodes' hard disk crashed and we had to change the hard disk and reinstall everything. How can we repair the cluster. We also need to change the scsi targets location for quorum disk.

Thanks.


Hyper-V Replica Broker resource not coming online

$
0
0

I was trying to create a cluster broker on a cluster using WMI. I am creating the required IPv4, IPv6 network resources and adding them as a dependency on the Hyper-V replica broker resource. When trying to bring the broker online, the Hyper-V Replica Broker resource is failing to come online with the below errors.

INFO  [NM] Received request from client address HRM-08-007.

ERR   [RES] Virtual Machine Replication Broker <Hyper-V Replica Broker C10022014XXXX>: 'Hyper-V Replica Broker C10022014XXXX' failed to start the network listener on destination node 'MachineName': No such host is known. (0x80072AF9). Please look at the event log on destination node for more details.

WARN  [RES] Virtual Machine Replication Broker <Hyper-V Replica Broker C10022014XXXX>: 'Hyper-V Replica Broker C10022014XXXX' failed to start the network listener for the Hyper-V Replica Broker resource: Unspecified error (0x80004005).

The node 'MachineName' does exist.

What might be the issue? How to debug this.

CUA: One or more errors occurred while checking the status of Windows Firewall on the cluster nodes

$
0
0

Cluster with 2 hosts 2012 R2

Scheduled CAU fails with:

CAU run {4EFE116C-AB49-456D-8EED-F7EDC764DA49} on cluster Cluster1 failed. Error Message:One or more errors occurred while checking the status of Windows Firewall on the cluster nodes. Review the errors for more information on how to resolve the problems. Error Code:-2146233088 Stack:   at MS.Internal.ClusterAwareUpdating.Util.<CheckFirewallsAsync>d__3a.MoveNext()
--- End of stack trace from previous location where exception was thrown ---
   at System.Runtime.CompilerServices.TaskAwaiter.ThrowForNonSuccess(Task task)
   at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
   at Microsoft.ClusterAwareUpdating.Commands.InvokeCauRunCommand.<_ProcessCluster>d__78.MoveNext()

If I run CAU "Analyze Readiness" ALL comes as PASS

If I run CUA by hand on same hosts with NO change to the system (not even reboot) it finishes OK

Anybody any ideas?

Thanks

Seb

Windows Server Failover Cluster Tahniques

$
0
0

We have Two Servers Ready to be Deploy the SQL 2012; Need to make Cluster of These two nodes.

We have Servers with 900 GB Unallocated local Disk Space in each server.

We Don't have any kind of Shared Storage.

My Questions are:

- how to make Qourum?

- How to Use these Two Disks on each server so that Data will be Replicated?

- After Making the Cluster, how to Install the SQL server?

Creating a cluster question

$
0
0

I am trying to create a High Availability Cluster on Serve 2012r2 that will run Remote Desktop Services i.e. RemoteApps.  The idea being that if one server goes down the second one will take over.  I have created a cluster with 2 nodes, the two nodes in the cluster are virtual servers on two separate physical servers running Server 2012r2 with Hyper-V. From what I can tell the cluster is setup properly (server manager>local server recognizes them as being apart of the cluster), but I'm unclear whether these two nodes are the High Availability cluster I require, or if I should have made the physical servers that host Hyper-v the nodes in the cluster.  I would think that once these 2 nodes are in a cluster if I make a change to one node i.e. install a feature that the other node would be automatically updated to match, and should one node shutdown the other node takes over.  Doesn't seem to be the case for me, which makes me think I shouldn't have used the two virtual servers as nodes. 

Should I have created the cluster between the two physical servers instead?  Then created a virtual machine from that cluster?

My only other experience with clustering is with Synology, so I'm basing a lot of my assumption on how Synology clusters their devices.

Any help is greatly appreciated.

Brendon

Failover Cluster Validation Report Error with IBM USB Remote NDIS Network device

$
0
0

We are setting up Microsoft Windows Server 2008 R2 Failover Cluster on IBM X3850 X5 and get errors in the Failover cluster Validation Report due to the IBM USB Remote NDIS Network Device is using APIPA adresses and both servers are using same APIPA-adresse.

How should I configure the Server and OS for the Failover cluster to be MS approved?

IBM don't recommend that i disable the Network device, but it is a possible solution!?!

Hyper-V Cluser New Node Validation Network Test Failure

$
0
0

I have a single node Hyper-V cluster in which I am trying to add a second node.  Both servers are identical hardware and are both installed with Windows Server 2012 R2 Datacenter.

When running the tests to validate the cluster configuration to add a new node, I receive the following errors preventing me from adding the new node. This error is received in the "Validate Cluster Network Configuration" section of the tests.

An error occurred while executing the test.
There was an error initializing
the network tests.

There was an error creating the server side agent
(CPrepSrv).

Unable to cast COM object of type 'System.__ComObject' to
interface type 'MS.Internal.ServerClusters.Validation.IClusterNetwork2'. This
operation failed because the QueryInterface call on the COM component for the
interface with IID '{2931C32C-F731-4C56-9FEB-3D5F1C5E72BF}' failed due to the
following error: No such interface supported (Exception from HRESULT: 0x80004002
(E_NOINTERFACE)).

Any advice on what to troubleshoot would be greatly appreciated.

win server 2012 two node cluster, local "cliuser" issue

$
0
0

Hello,

I have a two node Windows Server 2012 STN Cluster with a few SQL instances installed inside it.  Recently in my security event log I see these errors on both nodes:

An attempt was made to reset an account's password.

Subject:
Security ID: SYSTEM
Account Name:<>$
Account Domain:<>
Logon ID: 0x3E7

Target Account:
Security ID: lcoalmachinename\CLIUSR
Account Name: CLIUSR
Account Domain:localmachine name

==

When I look at the local account on both nodes, I see that password is set to never expire, and not be able to be reset.  I am quite confused then, how the above could happen.  Any advice or ideas would be greatly appreciated.

Thank you


Cluster-Aware Updating Readiness "Errors"

$
0
0

I have a two-node Windows Server 2012 failover cluster.  The Windows firewall is disabled on both nodes.

When I log on to one of the nodes (bcs-vmhyperv2), and run the Cluster-Aware Updating tool to analyze the cluster readiness, I receive this result:

When I log on to the other node and run the tool, I received the same two errors.  The problem computer is always the local computer.

I know that PowerShell remoting and WINRM are enabled!  So, the "resolution" steps don't help.

Here's proof:

If I log on to a different Windows Server 2012 system (not one of the cluster nodes), and run the tool, I receive no errors:

In fact, I setup CAU and used it to apply the latest set of Windows Updates from the third computer.

Why can't I use it from the cluster nodes?  How do I fix it?


-Tony

Is CIFS Share in Netapp Filer supported as File Share Witness?

$
0
0

Hi All,

I'm currently setting up a Windows 2008 R2 Cluster and I'm having a hard time making a CIFS share work as a File Share Witness. Is this a supported configuration?

Thank you!

adlpena

Failover High Availability recovery

$
0
0

When a node in a failover cluster fails:

¿How does it handle the recovery?

¿Are objects created by the failed node reused, if a single file system was being used, or are they recreated?

VMMS Hanging/crashing on 2008 r2 SP1 Hyper-V Cluster

$
0
0

I just wanted to put this out there in case someone else is having a similar issue.   

I have two Dell R620 servers as two Hyper-V Cluster nodes.

  • Connected to Equalogic SAN
  • NIC Teaming enabled on Hyper-V Guest Network not shared with Host
  • Intel 10G 2P x520 Adapter
  • Intel Gigabit 4P x540/I350 rNDC
  • VMQ Enabled

It seems there is an issue with VMQ being enabled, this was causing the VMMS service to hang/crash.

The symptoms were that all of the VM's were accessable, but when an action such as rebooting/starting vm guests, they would get stuck in the starting or stoping state.   The only fix was to reboot the server, hard boot.

Disabling VMQ has resolved this.

I spoke with microsoft, they said its with the development team and they will be relasing a windows update to resolve this soon.

When I get the KB number I will post it.

Can we add an SMB share to Cluster -> storage -> disks

$
0
0

Hi,

I have a two node cluster. I have an SSD on the storage server. configured SMB share on the server.

from the client side,  Is it possible to see my SMB share under  cluster -> storage -> disks from any of the nodes.

Thanks in advance


Thanks, Krishna

Viewing all 6672 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>