Quantcast
Channel: High Availability (Clustering) forum
Viewing all 6672 articles
Browse latest View live

Access is denied messages in Win2012 R2 Failover Cluster validation report and CSV entering a paused state

$
0
0

Been having some issues with nodes basically dropping out of clusters config.
Error showing was

"Cluster Shared Volume 'Volume1' ('Data') has entered a paused state because of '(c000020c)'. All I/O will temporarily be queued until a path to the volume is reestablished."

All nodes (Poweredge 420) connected a Dell MD3200 shared SAS storage.

Nodes point to Virtual 2012 R2 DC's

Upon running validation with just two nodes, get the same errors over and over again.

Bemused!

----------------

List Software Updates
Description: List software updates that have been applied on each node.
An error occurred while executing the test.
An error occurred while getting information about the software updates installed on the nodes.

One or more errors occurred.

Creating an instance of the COM component with CLSID {4142DD5D-3472-4370-8641-DE7856431FB0} from the IClassFactory failed due to the following error: 80070005 Access is denied. (Exception from HRESULT: 0x80070005 (E_ACCESSDENIED)).


and

List Disks
Description: List all disks visible to one or more nodes. If a subset of disks is specified for validation, list only disks in the subset.
An error occurred while executing the test.
Storage cannot be validated at this time. Node 'zhyperv2.KISLNET.LOCAL' could not be initialized for validation testing. Possible causes for this are that another validation test is being run from another management client, or a previous validation test was unexpectedly terminated. If a previous validation test was unexpectedly terminated, the best corrective action is to restart the node and try again.

Access is denied

-----------

The event viewer on one of the hosts shows
-------------
Cluster node 'zhyperv2' lost communication with cluster node 'zhyperv1'.  Network communication was reestablished. This could be due to communication temporarily being blocked by a firewall or connection security policy update. If the problem persists and network communication are not reestablished, the cluster service on one or more nodes will stop.  If that happens, run the Validate a Configuration wizard to check your network configuration. Additionally, check for hardware or software errors related to the network adapters on this node, and check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

The Cluster service is shutting down because quorum was lost. This could be due to the loss of network connectivity between some or all nodes in the cluster, or a failover of the witness disk.
Run the Validate a Configuration wizard to check your network configuration. If the condition persists, check for hardware or software errors related to the network adapter. Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

Only other warning is because the 4 nic ports in each node server are teamed on one ip address split over two switches - I am not concernd about this and could if required split then pairs, I think this is a red herring????


how to remove two nodes from old cluster, and put them in new one, in another domain

$
0
0

Hi there!
I've two servers (Server1 and Server2), in cluster with settings like below:

domain name: london.local
Server1 -> OS windows server 2012
Server2 -> OS windows server 2012

and they are part of cluster with two nodes:
cluster.london.local

Now, i want to remove these two servers from cluster, remove from actual domain, and put them in a new domain name paris.local

After that, i want to create the cluster between two servers in domain paris.local

which is the best way to remove the nodes from actual cluster?

Regards!


Lasandro Lopez

Proper Procedure to run Validation on Hyper-V Cluster with CSV

$
0
0

I had an issue sometime back where I was running validations on my Server 2008 R2 (RTM) Hyper-V cluster with an EMC Fibre Channel SAN. While running validations, one of my CSV LUNs came back with a damaged MBR and all data was seemingly lost. A call into Microsoft got the MBR restored, and all was fine, but I was down for the majority of a day.

My question is simply, was this a fluke experience, or is there a proper procedure for validating a Hyper-V cluster. I am now running SP1 on all nodes, and I am preparing to join another node to my cluster. I would like to perform all validation tests before I proceed.


-Richard

"Automaitc" clustering of installed software

$
0
0

(Newbie) Two node cluster, active\passive.

My Manager is under the impression that if we install Failover Clustering (at the OS level) then anything installed on the active\passive nodes, such as Sql Server, will automatically become clustered.  I don't think that is correct but if we did that and the active node failed what would be "missing" from the passive node after it becomes active?

TIAA,

edm2


Live Migration failed - failed to delete configuration: The request is not supported. (0x80070032). Event ID 21502

$
0
0

We have a 3 node cluster attached to a SAN running.  All nodes are running Server 2012. We have 2 virtual machines that will no longer live or quick migrate.  When we try, we get the following error message.

Event ID: 21502

Live migration of 'Virtual Machine Library' failed.

Virtual machine migration operation for 'SRV-XXX' failed at migration source 'NODE1'. (Virtual machine ID 8CC600A0-5491-45B1-896E-E99BB85AA856)

'SRV-XXX failed to delete configuration: The request is not supported. (0x80070032). (Virtual machine ID 8CC600A0-5491-45B1-896E-E99BB85AA856)

We are not having this issue with any of our other 15 virtual machines.  I have searched the forums and have not found any articles with the same situation.

Bug or am I missing something? Server 2012 Cluster Aware Updating task is always one hour off from schedule in wizard

$
0
0

Using the wizard, I specify...

Then I look at the scheduled task and it's always one hour off.

This isn't a timezone issue because I'm setting it and looking at it on the same server and all our servers are in the same time zone anyway (USA Eastern).


random long delay before VM come up from other node after live migration in a failover cluster

$
0
0

Backgrounder: 3 nodes Windows 2008 R2 enterprise Edition failover cluster with CSV + hyper-v. Networks contains iscsi, host only, VM only, heartbeat + live migration.

Trouble: I have some windows and Linux based (CentOS) VMs. When I live migrate a Linux VM, it sometimes takes anywhere between 5 seconds to 60 seconds before it becomes accessible via network again. To be precise, I always continuously ping the VM from another VM while doing a live migration of a VM. That is how I noticed how long it took to become accessible again. However, after live migration, I can immediately use hyper-v manager or failover manager to access that VM. What have caused the delay? In theory, there should be no interruption. Could I misconfigure something? The cluster is otherwise healthy and has been working fine.

Also I do have NIC teaming configured at the host level for VM only traffic. This network is used by VMs only, and no cluster communication is allowed to use it. no sure if this would make a difference.

Please enlighten me!

Cheers,

Bo


want to learn C#

Question About 2 Node Cluster Setup Windows Server 2012 R2

$
0
0

We have 2 servers that have identical configuration. Each server has 64 GB RAM and we are running Windows Server 2012 R2 Datacenter Edition on each server. Each Server has Hyper-V role and several VMS. We have created DC1 at Server1 and DC2 at Server2. We have Exchange 2013 VMS (EdgeTransport1, MX1) and (EdgeTransport2, MX2) on corresponding servers. We also have SQL Server VM at one of the servers.

We want to configure these two physical servers as nodes of a new cluster. From my knowledge we don't need to have Active Directory to configure these two servers with Failover Cluster. However, the resources I have read, says we won't be able to validate cluster setup.

I want to extend hardware and infrastructure setup so that we can have highly available system.

Can I specify the domain that is hosted by VMS named DC1 and DC2 for Cluster setup?

Because nodes of cluster will be powered prior to VMS, would there be any issue?

If this is an unsupported configuration, then do I really need to buy an additonal server and configure it as Domain Controller for environment?

Also, we have partnership agreement with Microsoft, so we would like to implement System Center products as well. What would be an ideal configuration/topology to achieve our goal for backup/monitoring and centralized management.

Thanks,

Ismet



Changing Storage for Cluster Resources

$
0
0

We have a Windows Server 2008 R2 Cluster set up that hosts SQL Services. We need to move the backend storage for the SQL Data/Log Drives to another storage. Can I -

  1. Shut down SQL (Drives:F,G) and Cluster services
  2. Make new storage available with NEW Drive letters H, I
  3. Copy all files from Drives F,G to Drives H,I
  4. Rename old Drives F,G to R,S (essentially discard)
  5. Rename new Drives H,I to F,G (original drive names)
  6. Restart SQL and Cluster services

In other words, do SQL and the Cluster service operate purely on the Drive Letter where this might work? Or does it use some identifier behind the scenes which would cause the cluster to break inspite of the new Drives having the same Drive Letter?

If it will break, is there another way to do this?

Thanks in advance,
Jake.

2012 Guest Clustered SQL instance - Upgrade platform

$
0
0

Currently I have several working 2012 Guest clusters for various functions.  I'm now looking at upgrading my "SQL Guest Cluster" which has 2 nodes (VMs) at Server 2012, 2 SQL 2012 instances, and is connected to shared storage through Virtual Fiber Channel.

I know I can't have mixed OS levels in a Failover Cluster, which is what is making this difficult.

What would be the best method for upgrading the 2 nodes to Server 2012 r2 (ie no-minimum downtime)?

For a file share guest cluster I was able to evict a node, remove failover role, upgrade node, create new cluster ("name_r2_cluster"), migrate File Server Role to new cluster using cluster tool...however SQL is not supported for the copy function.

Thanks in advance!

Loopback adapters and DSR: DAG Cluster node--which is not Cluster Host--crashes when another node restarts

$
0
0

An all-hardware Exchange 2010 SP3 UR4 DAG cluster is having an issue when the Microsoft Loopback adapter is installed (from Device Manager...Add Legacy Hardware) to support DSR operations with hardware load balancer (HLB).

  • The HLB provides HA endpoint for RPC Client Access, SMTP, etc. DSR is required to preserve source IP--on which      Exchange receive connectors that filter on source IP for security depend.
  • It is server DAG, with 3 x production severs at the datacenter and 2 x DAG DR servers located in a DR site.
  • Only the 3 x production servers at the main site have the loopback adapter installed.
  • The loopback-DSR-specific settings like 'weakhostrecive, etc' are in effect.

The problem only involves the 3 servers in the DAG with loopback adapters.

The issue is that when a DAG member restarts, sometimes it will cause the online production cluster node which isnot the Cluster Host Server to fail. Consider:

  • DAGNode1, Loopback enabled, Healthy, Is Cluster Host Server
  • DAGNode2, Loopback enabled, Healthy
  • DAGNode3, Loopback enabled, is Restarted

In this scenario, the cluster service on DAGNode2 will experience a loss of network connectivity when DAGNode3 rejoins the cluster (DAGNode2 reports cluster failure on all other nodes) and shortly afterwards the Cluster Service on DAGNode2 will terminate. FailoverClustering 1572 is seen on DAGNode2:

Node 'DAGNode2' failed to join the cluster because it could not send and receive failure detection network messages with other cluster nodes. Please run the Validate a Configuration wizard to ensure network settings. Also verify the Windows Firewall 'Failover Clusters' rules.

Interestingly, if you disable the Loopback on DAGNode3, DAGNode2 will immediately rejoin the cluster! Re-enable the Loopback on DAGNode3 and DAGNode2 immediately fails again! With some more server restarts possibly, you get a stable cluster again with Loopback enabled on all production nodes. The status of the loopback (enabled or not) on the Cluster Host does not impact this issue.

As I mentioned, it is only some restarts that this occurs, usually there is no problem. Also note the Loopback network/adapters do not appear in Cluster Manager and are not listed as cluster networks with cluster.exe. Cluster Validation Wizard passes everything except noting that every node has a duplicate IP on an installed adapter.

Looking for others with experience that have combined DSR-based HLB with CAS/Hub/MBX DAG Cluster on same Exchange computers and were able to use reliably.

There is an unanswered thread from 2010 on this topic:

http://social.technet.microsoft.com/Forums/windowsserver/en-US/7616b0e5-6fb6-4be7-a859-14baa2e9b925/cluster-network-is-partitioned-due-to-loopback-adapter?forum=winserverClustering

Some questions / any answers are very welcome!

  • Can I add the Loopback adapter to the cluster configuration so that I can use Cluster.exe to ignore the loopback adapter?
  • Can I prevent other cluster nodes from seeing the loopback adapters in the other nodes? Is there an ‘ignore partner adapter’ setting?

Thank you!


John Joyner MVP-SC-CDM

P.S. I add this information 3/1/2014:

This link suggests that if you allow the cluster network to partition it will discover the loopback adapters and they will appear in cluster manager: (Did this by enabled IPV6 on Loopback, when done this made Loopback Network appear in Cluster Manager. Then used Cluster.exe to set IgnoreNetwork=$true on the Loopback network.) Result: No change, still caused cluster communication outage when Loopback enabled on third production node that is not the Cluster Group Host.

http://social.technet.microsoft.com/Forums/windowsserver/en-US/311f7763-9f72-4dfe-bb35-3fd1a1dc567c/adding-additional-network-adapters-to-a-cluster?forum=winserverClustering

Developed: A workaround!

1. Just before restarting a node, after drain stop in NS, and after running StartDAGServerMaintenance.PS1 (which pauses the node in Cluster Manager) disable the Loopback Adapter so that when the computer restarts, Loopback is disabled.

2. After node restarts and rejoins cluster in Paused status, and after running StopDAGServerMaintenance.PS1, issue this command to move the Cluster Group host to the computer that was restarted and has the Loopback disabled.

cluster <clustername> group "Cluster Group" /moveto:<nodename>

3. Then safely enable the Loopback on the computer that was restarted and is now the Cluster Group host.

4. Then take the computer out of drain stop in the NS.

This of course only applies to controlled restarts.

In the event of unexpected server crash and recoveries, there is nothing stopping this from happening when the crashed server restarts. Still need a real fix! With knowledge of how to defuse the situation when it happens (disable loopback on the production node that is not the Cluster Group host), it clears the condition immediately. You than then fix it by steps 2 and 3 in the workaround.

How do I find the MAC address of a CLIENT ACCESS POINT created from the FILE SERVICES ROLE

$
0
0

I have several Client Access Points created within the clustered File Services Role.  The only way I seem to be able to determine the MAC address of each of these, is by visiting the DHCP server.

Does anyone know if there is a way of reporting on this from the server (active node) itself?  I have tried ipconfig all, checked the properties of the CAP in the FCS console etc.

Many thanks.


Kathleen Hayhurst Senior IT Support Analyst

Upgrade from Server 2012 cluster\hyper-v to Server 2012 R2 cluster\hyper-v

$
0
0
Are there any white papers available yet for upgrading from Server 2012 cluster\hyper-v to Server 2012 R2 cluster\hyper-v?

Rob Nunley

NIC teaming and Hyper-V switch recommendations in a cluster

$
0
0

HI,

We’ve recently purchased four HP Gen 8 servers with a total of ten NICS to be used in a Hyper-V 2012 R2 Cluster

These will be connecting to ISCSI storage so I’ll use two of the NICs for the ISCSI storage connection.

I’m then deciding between to options.

 

1. Create one NIC team, one Extensible switch and create VNics for Management, Live Migration and CSV\Cluster - QOS to manage all this traffic. Then connect my VMs to the same switch.

2. Create two NIC teams, four adapters in each.  Use one team just for Management, Live Migration and CSV\Cluster VNics - QOS to manage all this traffic. Then the other team will be dedicated just for my VMs.

Is there any benefit to isolating the VMs on their own switch?

Would having two teams allow more flexibility with the teaming configurations I could use, such as using Switch Independent\Hyper-V Port mode for the VM team? (I do need to read up on the teaming modes a little more)

Thanks,

Will network connectivity loss trigger a VM failover?

$
0
0

Does loss of network connectivity for a VM (either physical NIC failure, or even just disconnected cable) trigger a failover to another host in a HyperV Cluster?


Storage connectivity lost

$
0
0
Hello everyone,
I have the following question: frequently on a cluster windows server 2012 Hyper-V the disks present in storage are disconnected. Sometimes thedisconnection is so long that the virtual machines are turned off. The message that I read in the logs is: "Cluster Shared Volume 'Volume1' is no longer available on this node Because Of 'STATUS_CONNECTION_DISCONNECTED (c000020c).' All I / O will temporarily be queued until a path to the volume is reestablished. "
The connection is made via iSCSI adapters. I checked all iscsi configurations on the servers that are part of the cluster and everything seems ok. I don't know what else to check ...

How to delete the default "cluster" folder once the Quorum drive has been re-assigned?

$
0
0

Installed a Windows Server 2008 R2 Failover Cluster.  By default, the Failover Cluster Manager assigned the X:\ drive as the Quorum drive.  Using Failover Cluster Manager, I was able to move\reassign the Quorum drive as, let's say, the N: drive.  However this action does not delete the "Cluster" folder that was initially created on the X: drive.

What is the correct procedure to be rid of the "Cluster" folder that was created on the X: drive when the cluster was initially created without any impact to the existing failover cluster?

Windows Clustering Networks question...

$
0
0

Hi all;

This is my scenario:

I have installed Windows Server 2012 on two servers. Then enabled Windows Clustering feature on it. The shared storage is based on Fibre Channel technology. Each server has 4 NICs and I have splitted them as followis:

  • One NIC for remote mangement of the servers with the range of 172.16.105.0/24.
  • One NIC dedicated for heartbeat communication.
  • Two NICs has been bundled together with NIC Teaming feature of the operating system.

But as you see in the following figure there are 4 Cluster Network links:

Is it normal?

Thanks


Please VOTE as HELPFUL if the post helps you and remember to click “Mark as Answer” on the post that helps you, and to click “Unmark as Answer” if a marked post does not actually answer your question. This can be beneficial to other community members reading the thread.

Error message when accessing Cluster Manager (Windows Server 2012-based) console from a Windows 8 system

$
0
0

Hi all;

Please look at the following figures:

Any ideas?

Thanks


Please VOTE as HELPFUL if the post helps you and remember to click “Mark as Answer” on the post that helps you, and to click “Unmark as Answer” if a marked post does not actually answer your question. This can be beneficial to other community members reading the thread.

Unable to move SQL instance to another node

$
0
0

Hi,

I m unable to move SQL instance to Node B from A. When i checked the cluster.log, this is the only error i see just before it failed. Can some one help me fix this.

- this was working earlier this is not a new cluster, no config changes are made.

000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres] ODBC

sqldriverconnect failed
000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres]

checkODBCConnectError: sqlstate = 08001; native error = ffffffff; message = [Microsoft][SQL Server Native Client

10.0]SQL Server Network Interfaces: Error Locating Server/Instance Specified [xFFFFFFFF].
000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres] ODBC

sqldriverconnect failed
000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres]

checkODBCConnectError: sqlstate = HYT00; native error = 0; message = [Microsoft][SQL Server Native Client 10.0]

Login timeout expired
000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres] ODBC

sqldriverconnect failed
000015fc.000035d4::2014/06/04-07:27:05.204 ERR   [RES] SQL Server <SQL Server (SQLSHR)>: [sqsrvres]

checkODBCConnectError: sqlstate = 08001; native error = ffffffff; message = [Microsoft][SQL Server Native Client

10.0]A network-related or instance-specific error has occurred while establishing a connection to SQL Server.

Server is not found or not accessible. Check if instance name is correct and if SQL Server is configured to allow

remote connections. For more information see SQL Server Books Online.

After a lot of google, i ensured that SQL browser service is running, however i m unable to failover. Please help!

Viewing all 6672 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>