Quantcast
Channel: High Availability (Clustering) forum
Viewing all 6672 articles
Browse latest View live

Need suggestion on Clustring Books

$
0
0

Hi Team,

Can anyone recommend good books for learning on Microsoft Windows Clustring.

Thanks,

S.V.Ramana.


Ramana rao



Different but compatible version of the cluster service software

$
0
0

We have two physical servers in FCI and third server in different datacenter for DR with SQL Always On Configuredwhich is VM running same version of windows, we are getting the below alert 

Node 'N1' which is physical established a communication session with node 'N3' which is VM and detected that it is running a different but compatible version of the cluster service software. It is recommended that the same version of the cluster service software be installed on all nodes in the cluster.

Can we ignore this safely?

Server 2012 Cluster nodes hang and VMs lock up, memory leaks and critical stops on both nodes.

$
0
0

Last night my two-node cluster went down for no apparent reason.  All VMs (4) were down even though the cluster manager said they were running.  The cluster shared volume on my SAN was not accessible through Windows Explorer but the Dell mpio software showed it was connected and the SAN itself showed a connection and did not have any problem.  It took me five hours of struggle to get the cluster running again.  I had to remotely restart each node several times from another server using the command line because the RDP session would stop responding due to Explorer locking up.  I ended up removing the antivirus software from each node but that was in desperation; I don't know if that was the problem or not.  It finally started to work again when I manually brought the cluster IP back online, manually moved all resources to node1 and then did a pause and drain of node2 and restarted node2.  This error shows up twice in the Application log of both nodes:

Possible Memory Leak. Application (C:\Windows\Cluster\rhs.exe -key SYSTEM\CurrentControlSet\Services\ClusSvc\Parameters\Rhs\0428d6b3-5c3b-4757-bc31-70379129ad89 -parentPid 3060 -initEvent 1dbde958-779b-4cd7-8daa-7c9299d0303c -replyEndpoint OLEAA17D0EF8BDFFAD1F4F33871C878) (PID: 4520) has passed a non-NULL pointer to RPC for an [out] parameter marked [allocate(all_nodes)]. [allocate(all_nodes)] parameters are always reallocated; if the original pointer contained the address of valid memory, that memory will be leaked. The call originated on the interface with UUID ({4b324fc8-1670-01d3-1278-5a47bf6ee188}), Method number (64). User Action: Contact your application vendor for an updated version of the application.

There are also two critical stops logged in the Dell OpenManage logs on each node.

The symptoms are very similar to this technet article for Server 2008 R2:

http://support.microsoft.com/kb/2798093

Both nodes are fully updated with hotfix 2870270.

Can anyone shed some light on this?  What went wrong and how do I prevent it from happening again?

can anyone share?

$
0
0

Hi All,

We have sql 2005 clustering and can anyone share what's your NetBIOS settings for the public NIC?

Thank you.

Failed fail-over when pulling network cable

$
0
0

Hello

Setup:

2-node cluster with physical servers. 3 network teams, 1 Heartbeat, 1 LAN and 1 SAN. Its the Windows Server 2012 Team SW thats been used.

Problem:

All fail-over tests work fine, except "pulling the network cable" on the LAN-network. By pulling the cable I mean disabling the NICs that creates the LAN-team. That triggers the fail-over from that server, but the IP-address in the cluster fails with following error messages: "

The Cluster service failed to bring clustered service or application 'Cluster Group' completely online or offline. One or more resources may be in a failed state. This may impact the availability of the clustered service or application"

 "Cluster resource 'Cluster IP Address' of type 'IP Address' in clustered role 'Cluster Group' failed."

I tried finding relevant information online, but nothing seems to clearly solve the issue. The Cluster Validation completes without any errors at all. The cluster can be brough online manually, but not by itself.

Are there any views on how the cluster is supposed to handle sudden network losses? Any suggestions?

Regards

Alex

Two nodes cluster creation failed

$
0
0


Hello, MS specialist. I'm experiencing an issue about creating cluster. I have researched it for over one week, but I cannot resolved it, it really trouble me.

environment:
three guest machine in hyper-v.
DC
cluster1
cluster2

Host machine is windows2008 R2, three guest mancines are also windows2008 R2

First of all, validation is passed successfully without any error.
When I use wizard to create cluster, it stop at "Forming cluster "clusterdemo"" for a long time and failed with error:

An error occurred while creating the cluster.
An error occurred creating cluster "clusterdemo".
This operation returned because the timeout period expired.

Meanwhile I found message in event viewer:

Event 1570, Failoverclusting
Node 'cluster1' failed to establish a communication session while joining the cluster. This was due to an authentication failure. Please verify that the nodes are running compatible versions of the cluster service software.

Event 1280, Failoverclusting
Sponsor tried to create security context using package='Kerberos' with context requirment='165910' and timeout='30000'

Event 1281, Failoverclusting
Joiner tried to create security context using package='Kerberos' with context requirement='83990' and timeout='30000' for the target='cluster2'

I have read http://technet.microsoft.com/en-us/library/dd301029(WS.10).aspx it's about Event ID 1570 — Node Membership in Cluster, but it's no help to me

Since two nodes cluster creating failed, I create a one node cluster, the node machine is cluster1. it is successfully created.

When I use wizard to add another node cluster2 to cluster, it failed with error :

The server "cluster2" could not be added to the cluster. An error occurred while adding node "cluster2" to cluster "clusterdemo". the cluser node is not reachable.

and meanwhile I found error message in event viewer said:
Event 10009, DistributedCOM
Dcom was unable to communicate with the computer cluster2 using any of the cofigured protocols

I have read this document http://technet.microsoft.com/en-us/library/dd337741(WS.10).aspx it's about Event ID 10009 — COM Remote Service Availability
I think it's also no help to me

So finally I have entered an dead lane, Could anyone help me or give some suggestion to me.
Waiting your post. Thanks.

How to create a local, non-clustered storage pool

$
0
0

Hello,

I have setup a two-node Failover Cluster, with a shared SAS DAS. So far so good.

One of the nodes also has internal disks that I wish to use for system backups.

This storage pool should not be clustered, as the disks cannot be seen from the other node. The trouble is that as soon as I create the pool it gets added to the cluster (in failed state).

In fact, the "Storage Pools" window in the server manager will only show me the "clustered storage spaces", with my internal disks in the Primordial pool.

Get-StorageSubSystem will show me both subsystems (Clustered Storage Space on ... + Storage Spaces on node-1) but fails to create a storage pool on the "local" subsystem.

How can I create a local, non clustered storage pool on internal disks ?

Cheers

alex

SQL Server Failover Cluster & SAN Mirroring

$
0
0

Hi,

We have set up SQL Fail over Cluster (Windows Server 2012 Standard Edition). the 2 SQL Servers have access to 2 SAN Disks (2 x HP P2000 G3).

Now, we are able to see the 2 SANs as 2 drives in Windows Cluster without a problem, We need to set up SAN mirroring so if 1 SAN unit fails, the SQL serevr can retrieve the needed info from the second SAN unit.

We've realized that it's possible to mirror the 2 SAN disks from Windows rather (by going to computer management, and mirroring the disks) We tested and the mirrored volume seems to be accessible from SQL Server A and SQL Server B, as well as the cluster disks. We were  just wondering if setting up mirrored SAN disks from Computer Management is recommended for that purpose? open to any other suggestions.

Thanks


Failover Cluster & DFS on NAS SRV2012 R2

$
0
0

Dear all,

I've been searching for NAS solution to configure Failover Clustering and DFS.. There are too many scenarios as well definitions, and I need a best practices..

Shortly

I have 2 HP Servers each one has the following.

HP 8G

  1. Domain Controller (Physically)
  2. File Server (Hyper-v)
  3. Exchange Server (Hyper-v)

---------------

HP G6

  1. Additional Domain Controller (Physically)
  2. Additional File Server (Hyper-v)
  3. Additional Exchange Server (Hyper-v)

I believe that I can configure the Failover Clusters on the servers it self..?!

But i need to configure the files server DFS to be placed on the NAS, but I'm not sure about the NAS compatibility with 2012 R2 e.g (WD DX4000, QNAP, DELL..) and also whether this is a right figure.. So please can anyone help me?

ask your expertise.

$
0
0

Windows 2008 R2 and SQL 2005 clustering
When I run cluster validation, I got the following warning "
The RegisterAllProviderIP property for network name 'Name: winclustername' is set to 1 For the current cluster configuration this value should be set to 0." The server has two NICs and one is public and another one is private (heartbeat)
should it be a concern?

Also, If I run ipconfig /all, I also see Microsoft failover cluster virtual adapter in addtion to public and private NICs, Microsoft failover cluster virtual adapter has 169.254.X.X private address, is this by design?

Thank you.

10GB connections

$
0
0

Hi,

can a 10GB SPF be connected directly between 2 servers and not via a switch?

Thanks James.

Huge single storage pool for lots of VMs, or seperate storage pools for each VM

$
0
0

Background

Two identical 2008 R2 servers for running VMs with Failover Cluster (Hyper-V)
One IOmega NAS (old) with 15 VMs sharing one large (2 TB) clustered storage pool
One VNXe NAS (New) Nothing setup yet

I am getting ready to start exporting all of my VMs from our old SAN to our new SAN.

Is it better to create one huge clustered storage pool that will be shared between all the VMs, or would it be better to create separate smaller clustered storage spaces for each VM?

Thanks,
Brian

win server 2012 two node cluster, local "cliuser" issue

$
0
0

Hello,

I have a two node Windows Server 2012 STN Cluster with a few SQL instances installed inside it.  Recently in my security event log I see these errors on both nodes:

An attempt was made to reset an account's password.

Subject:
Security ID: SYSTEM
Account Name:<>$
Account Domain:<>
Logon ID: 0x3E7

Target Account:
Security ID: lcoalmachinename\CLIUSR
Account Name: CLIUSR
Account Domain:localmachine name

==

When I look at the local account on both nodes, I see that password is set to never expire, and not be able to be reset.  I am quite confused then, how the above could happen.  Any advice or ideas would be greatly appreciated.

Thank you

Cluster network name resource failed registration (Event ID 1196)

$
0
0

I recently implemented Cluster Aware Updating on my Windows 2012 cluster.  During the setup of the role I selected the option to "Add the CAU clustered role with self-updating mode enabled, to this cluster."  The wizzard created a computer object "CAUHyper4gf" in AD for this role.  Since that time, roughly every 15 mintues Event ID 1196 with the following error is logged:

"Cluster network name resource 'CAUHyper4gf' failed registration of one or more associated DNS name(s) for the following reason:
This operation returned because the timeout period expired.
.

Ensure that the network adapters associated with dependent IP address resources are configured with at least one accessible DNS server."

The Computer Object is listed in DNS and responds to pings.  I haven't been able to find any articles addressing this specific issue.  I'm not sure how to correct this issue.  What can be done to eliminate these errors?


Getting SMB Witness Client Errors in Eventlog on Witness Disk Clusters

$
0
0

Hi there.

I saw this on my third Windows Server 2012 Hyper-V Failover Cluster.

There are a lot of SMB-Witness Client Error Messages in the eventlogs of all nodes.

The problem is that these are Failover Clusters without SMB Witness Shares.

They have the "classic" Witness Disk attached. Quorum Configuration is "Node and Disk Majority (Witness Disk)"

I never use Witness Shares!

OS of the nodes are Windows Server 2012. All actual windows updates are installed and I've also used the technet articles for recommended cluster and hyper-v hotfixes to get the nodes as actual as possible.

Thanks in advance

Olaf 


Regards
Olaf


FailoverCount is not getting reset for QuorumResource in Windows2012 R2 failover clusters

$
0
0

Hi,

I have two-node failover cluster on windows server 2012 R2 with third party resource as quorum with typeNode and Disk Majority. on fault of quorum resource FOC is not failing over "Cluster Group" to other cluster node. Following log lines are seen in cluster log.

Here is cluster log from fail node.

00008bc.000014d8::2013/12/06-11:45:49.591
INFO  [RCM] rcm::RcmGroup::Failover:(ClusterGroup)
 
000008bc.000014d8::2013/12/06-11:45:49.592
WARN  [RCM]Not failing over groupClusterGroup, failoverCount 2,
failoverThresholdSetting 4294967295, lastFailover 2013/12/06-03:39:54.190
 
000008bc.000014d8::2013/12/06-11:45:49.592
INFO  [RCM]Willretry online fromlong delay restart of quoDG in3600000
milliseconds.

 Quorum resource failover policy’s Maximum failover count is set to one.

000008bc.000014d8::2013/12/06-11:45:49.591
INFO  [RCM] resource quoDG: failure count:1, restartAction:2
persistentState:1.

Is there a way to reset this FailoverCount ? When does FOC increments and resets this failovercount for a resource ?

Thanks in advance

Rakesh


Rakesh Agrawal

2012 Cluster Network Question

$
0
0

I have a 3 node cluster and I am experiencing some strange network issues. 

I have 4 10G adaptors in each node.  2 teamed for normal network traffic using LACP and 2 for ISCSI using MPIO.

The cluster throws a ton of errors out if I select "do not allow cluster network communication on this network" on the ISCSI network.  I enabled it a while back, because the guests were failing over every 5 minutes.

I would think that with 2 10G NIC's there would be enough throughput for the heartbeat.  I am thinking that is what causes this.

Any ideas?

Thank you

Quorom Disk on multiple hosts in Cluster

$
0
0

Good Day Everyone,

I have inherited a Hyper-V cluster and it has some major errors on "Validate Configuration". I am trying to take care of the quorum disk specifically at this point though and I need some clarification. My cluster has 6 hosts with 3 iSCSI volumes mapped to them. 1 volume is Quorum, the other 2 are Cluster Shared Volumes.

I have questions regarding the setup of the volumes that are mapped to the individual Hyper-V hosts.

1. I only see 1 host with the Quorum volume showing as an attached disk. The rest of the servers have the volume mapped but they are not initialized so they do not show up in the OS as a drive/storage. I was under the assumption that all servers in the cluster had to have the Quorum drive at least mapped even if it is configured as "Node Majority". The one server that is the witness does have this drive mapped/initialized.

2. I ran "Configure Cluster Quorum Wizard" but I do not find "Disk witness in Quorum" under storage. I do see the Quorum disk volume but it shows up as Available Storage.

Any help or info would be very helpful.

Antony

Microsoft Cluster error

$
0
0

Hi,

I am getting the following error in Windows 2008 R2 cluster with sql 2008 event logs...can anybody haelp me resolving the same.

Log Name:      System
Source:        Microsoft-Windows-FailoverClustering
Date:          9/26/2013 7:25:16 AM
Event ID:      1207
Task Category: Network Name Resource
Level:         Error
Keywords:      
User:          SYSTEM
Computer:      NODEDB1.NODE.COM
Description:
Cluster network name resource 'SQL Network Name (SQLNODE)' cannot be brought online. The computer object associated with the resource could not be updated in domain 'NODE.COM' for the following reason:
Unable to update password for computer account.

The text for the associated error code is: Access is denied.


The cluster identity 'NODECLUSTER$' may lack permissions required to update the object. Please work with your domain administrator to ensure that the cluster identity can update computer objects in the domain.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="Microsoft-Windows-FailoverClustering" Guid="{BAF908EA-3421-4CA9-9B84-6689B8C6F85F}" />
    <EventID>1207</EventID>
    <Version>0</Version>
    <Level>2</Level>
    <Task>19</Task>
    <Opcode>0</Opcode>
    <Keywords>0x8000000000000000</Keywords>
    <TimeCreated SystemTime="2013-09-26T01:55:16.483346600Z" />
    <EventRecordID>18934</EventRecordID>
    <Correlation />
    <Execution ProcessID="4068" ThreadID="6400" />
    <Channel>System</Channel>
    <Computer>NODEDB1.NODE.COM</Computer>
    <Security UserID="S-1-5-18" />
  </System>
  <EventData>
    <Data Name="ResourceName">SQL Network Name (SQLNODE)</Data>
    <Data Name="DomainName">NODE.COM</Data>
    <Data Name="FailureString">Unable to update password for computer account</Data>
    <Data Name="Status">Access is denied.
</Data>
    <Data Name="ClusterIdentity">NODECLUSTER$</Data>
    <Data Name="BinaryParameterLength">4</Data>
    <Data Name="BinaryData">05000000</Data>
  </EventData>
</Event>

 

Cluster Aware Updating - Failed to enter maintenance mode.

$
0
0

Hello everyone,

I am trying to run Cluster Aware Updating and I thought everything went well:-) But that was just on the first node of 2. When the second node of cluster is trying to get to maintenance mode I get an error: "Node XYZ failed to enter maintenance mode. No retries left." I tried to go trough logs but found just: "InvokeCauRunOperation:Node drain failed on node XYZ. Additional: System.Management.Automation.RemoteException: Node drain failed on node XYZ."

All VMs are clustered.

Analyze cluster updating readiness - successful

I would appreciate any advice!

Thank you very much!

Roman

Viewing all 6672 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>