Quantcast
Channel: High Availability (Clustering) forum
Viewing all 6672 articles
Browse latest View live

"Access denied/incorrect function" on CSV from hyper-v clustered Windows Server 2012 nodes

$
0
0

Hello,

I am running into a problem with CSV access on a 2 nodes Hyper-V cluster (WS2012). We are in the process of upraging the 2008R2 2 nodes cluster already in production and fully functionnal for the last 3 years.

Here are the new specs : 2 HP DL380p G8 Windows Server 2012 Datacenter (HP ROK installation) connected via iSCSI (MPIO) to the same Netapp FAS2020 (Ontap 7.3.3). 2 LUNs (980 GB each) : one for CSV1 and the other for CSV2. A LUN for the quorum. A basic design. Windows LBFO NIC teaming, dedicated NICs : LAN, CSV/heartbeat, LiveMigration, VM LAN... A design fully functionnal for several years at dozen of customers.

On the new WS2012, everything is validated OK in the cluster validation process. I can create the cluster as expected. The disks are added correctly. I can dynamically change the owner and coordinator node without problem. Well, everything works fine unless I add the disks to the Cluster shared volume. I can correctly switch the CSV owner though. I got a c:\cluster\volume1, as expected...

But, if I tried to access inside c:\cluster\volume1 (logged as domain admin, of course) from the node owner, I get "access denied". If I tried to acces the same CSV from the non owner node, I get "incorrect function" error.

I destroyed the cluster several times and recreated it but no luck !

The cluster is functioning without the CSV feature activated !?! But it is not an option.

I really need a helping hand on that.

BRGDS


Scale Out File Server Clustering - Storage Failure Issue

$
0
0

Hi,

I am trying to build a SOFS cluster which is backed by two iSCSI NAS devices, but I seem to be having issues if one of the NAS devices fails.

here is what I have (all systems are based on SERVER 2012 R2 Preview):

two NAS devices NAS1 and NAS2, there are two iSCSI disks on each because the SOFS cluster storage pool seems to want to have a minimum of 3 disks, so I’ve got 4 between the 2 NAS.

FS1 and FS2, both file servers have the iSCSI initiator connect to all targets on the NAS - so in each FS I see four attached iSCSI disks in disk management. I create a cluster between FS1 and FS2, create a storage pool in failover cluster manager and include all 4 disks, then I create a virtual disk within this storage pool, finally I add the file server role under the role services section, create a share and assign relevant permissions.

Everything is online, everything looks good. I can transfer the scale out file service role between FS1 and FS2 as I expect, the cluster name for this role is SOFS1.

So to test that the backend disk system can fail, I turn off NAS1 but the storage pool in my cluster shows errors and warnings that disks are not available and I cant bring them online even though both FS1 and FS2 are online. The file services cluster role is showing that its online.

My question here is two part:

Have I completely missed the point of a scale out file server and this is not how its supposed to work (i.e. is it just meant to work if one of the FS goes down rather than the backend NAS)?

Or… is this is unexpected behaviour, is it not just meant to use NAS2?

I assumed that the storage pool I created which included disks from both NAS1 and NAS2 will mirror the data across those disks so removing either NAS shouldn’t fail the storage?

I would appreciate any advice on this matter.

Thanks

Steve

 

Multiple Event ID's 4738 and 4724 on new Windows 2012 Hyper V cluster

$
0
0
We just setup a two node Windows 2012 Hyper V cluster.  Everything is working correctly, but in the Security log on both nodes, we're getting Event ID's 4738 and 4724 every 3 minutes for the built in CLIUSR (Failover Cluster Local Identity) that gets created with the cluster (there's a local CLIUSR account on each node).  Because we have an application that parses the security logs and emails us about user account changes, we're getting spammed because of this.  Does anyone know if this is normal behavior for that CLIUSR account?  Any way to suppress this?  On the account properties, "User cannot change password" and "Password never expires" are both checked.  I appreciate any insights people might have.  Thanks

Full internet connection but speed is very minimal if it will load at all

$
0
0

I have tried everything you could possibly imagine with troubleshooting. reset the modem, the computer. deleted all cookies etc. I've searched and ridded my computer of any viruses, but I can't seem to get a stable wireless connection to the internet. My connection to the modem is full (excellent) and says that it is running at 60mbps. I've been trying to download an anti spyware program but can only manage to get up to 700 BYTES per second. All in all my connection is stuffed, I'm not sure if it is either something to do with the modem or my computer. If you have any suggestions about this, please let me know ASAP.

I feel as though this is going to make me destroy my laptop

Cheers

Jacob

Hyper-V Failover Cluster - Inconsistent Network Availability

$
0
0

We've got a Small cluster with, 7 hosts and a dozen or two VM's.  For some reason i'm getting inconsistent availability with the Cluster networks.  The host seem to function fine on there own but theres all types of issues using Migration which i'm assuming is because certain hosts think other hosts are unavailable. For Example:

Cluster Network 1 - From Host 8

Cluster Network 1 - From Host 10

As far as I can tell all of the networks are UP. I can ping all hosts on all interfaces.  What criteria goes into determining host availability?





CAU: Analyze Clutser Passes all nodes except local node

$
0
0
  1. No matter which node I run Test-CAUSetup on, it passes all nodes except the node running the test.  Specifically, it fails test remote management via WMIv2 & PowerShell remoting.  WSManFault Code 2150859027 thrown.
  2. This is a Hyper-V cluster managed by a SC 2012 SP1 VMM server.  For security purposes, VMM does not use the default remote management HTTP port 5985.
  3. On each node, I've set the PowerShell execution policy to unrestricted, enabled the two remote management firewall rules, and opened HTTP port 5985 with the following command: WinRm Create winrm/config/listener?AddressIP:hyperV_mgt_Ip+Transport=HTTP 'at_symbol{Port="5985"}'.
  4. Can I safely ignore this issue and press forward with CAU or does it need to be addressed?  If the latter, what?  Create HTTPS:5986 listener?
Thank You


Can't access Failover Cluster Manager in Windows 2012 R2 Cluster - The Kerberos client received a KRB_AP_ERR_MODIFIED error from the server

$
0
0

Hello!

   My client has a two-node Windows Server 2012 R2 Hyper-V Cluster which has been running for month or two.  The other day I found that from the Hyper-V1 host I couldn't run the Failover Cluster Manager - I got an error "The RPC Server is Unavailable - Exception from HRESULT: 0x800706BA".  Then I tried to log into the other Hyper-V Host (Hyper-V2) and found that I couldn't log in using domain credentials - I had to log in locally.  On this Hyper-V2 server I saw errors in the System event log:

The Kerberos client received a KRB_AP_ERR_MODIFIED error from the server hyper-v2$. The target name used was HYPER-V2$. This indicates that the target server failed to decrypt the ticket provided by the client. This can occur when the target server principal name (SPN) is registered on an account other than the account the target service is using. Ensure that the target SPN is only registered on the account used by the server. This error can also happen if the target service account password is different than what is configured on the Kerberos Key Distribution Center for that target service. Ensure that the service on the server and the KDC are both configured to use the same password. If the server name is not fully qualified, and the target domain (XXXX.XXXX.com) is different from the client domain (XXXX.XXXX.COM), check if there are identically named server accounts in these two domains, or use the fully-qualified name to identify the server.

   I've done quite a bit of searching but I didn't find this exact scenario.  And because this is a very busy season for my client I'd like to try to resolve this issue with minimal impact to the running VM's.  As of right now all of the VM's are working - I just can't manage them from the Failover Cluster Manager (which means that I can't even run live-migrations).  I feel pretty much stuck.

   Can anyone give me any (safe) guidance on how to proceed?  I'd certainly appreciate the help!!!

dave

Getting started with Cluster - Design advice

$
0
0

We have a new Dell VRTX server with 2 blades and shared storage.

Its my first Cluster so I'm trying to get to grips with the design and terminology.

What I want to achieve (Goals):-

  • Hyper-V Cluster to ensure resilience and no single point of failure
  • Virtualise a number of older physical servers, such as Print Servers, consoles for managing systems, door access system etc
  • Virtualised new file server with DFS for use access to data

Any advice on how best to configure this would be appreciated, not necessarily step by step but whether to use high availability file server?, do we use SMB?,  how to best partition the storage?, how best to provide storage for file server? etc etc

I will also be wanting to make snapshots of the Hyper-V servers onto another physical server off-site just in case of a disaster.


Windows 2008 Cluster question on using a new cluster drive source from shrinking existing disk

$
0
0

I have a two node Windows 2008 R2 enterprise SP1 cluster. It has a basic cluster setup of one (Q:)quorum disk and data disk (E:) which is 2.7tb is size. This cluster is connected to a shared Dell Disk array.

My question is can I safely shrink the 2.7tb drive down and carve out a disk size of 500gb from the same disk and use for a new cluster disk resource. We want to install Globalscape SFTP software on this new disk for use as a cluster resource.

Will this work without crashing the cluster.

Thanks,

Gonzolean


Server 2012 Hyper-V Cluster Network Configuration

$
0
0

I have not been able to find any documentation that explicitly says if my network configuration is supported or not. It passes cluster validation but after a crash last weekend I am questioning it.  Hopefully one of the experts here can chime in.

Here is our setup:

Server 2012 Hyper-V cluster, two nodes with a disk witness.  Dell R520 Servers and a Dell Equalogic iSCSI SAN.  CSV and witness disk are running on the SAN (on different volumes).  Currently running four VMs, will expand to eight, possibly nine in the near future.

Each host has (all gigabit):

a 4-port NIC dedicated to the SAN using Dell mpio on its own subnet

a 4-port NIC team dedicated to the VM network

a 2-port NIC team for management, heartbeat and live migration

All are connected to a Dell two-switch stack with load balancing and failover at the switch level.  All teams are using link aggregation and the SAN connections are using aggregation and jumbo frames.

My concern is having the live migration network on the same network team as management.  I know this is not recommended but I have not seen this configuration listed as "not supported" either.  Redundancy shouldn't be an issue with failover at the team and the switches.  I am concerned with possible bottle-necking though.

If I were to add QoS to the 2-port team to cap the management and heartbeat bandwidth would that be enough?  How much of a cap should I set?

Or should I break the team and create another network exclusively for live migration?

Or is the current configuration okay?

SIMPLE QUESTION: HOW TO MIGRATE FROM WINDOWS 2008 R2 + SQL 2012 FAILOVER CLUSTER to WINDOWS SERVER 2012 CLUSTER WITH ALWAYS ON AVAILABILITY GROUP

$
0
0

Hello,

We have 2-node Windows 2008 R2 Enterprise Edition failover cluster with Fibre shared storage (SAN) running SQL Server 2012 SP1. Below is current configuration - very simple and classic, I would say everything by the book:

This is what I think we want to achieve:

Objectives:

1. Upgrade Windows Operating System from Windows Server 2008 R2 to Windows Server 2012

2. Migrate to SQL Server 2012 Always On Availability Group (AAG) for High Availability and Disaster Recovery

My question is how to achieve both goals?

If possible I would like to upgrade OS first. Ideally I would like to upgrade on the same hardware (because it should be minimal impact - no need to migrate data). If this is not possible, we have new hardware I can use also. But I guess it will be more impact and actual data migration will be required.

For AAG what I'm honestly missing is what would be the name of the second SQL server? Lets say my servers called DB1 and DB2, and SQL server called DB. If I create AAG, and fail-over to replica server, would SQL server name be DB as well?

I know there is lots of documentation on AAG and I went through it but I cannot find any specific information about names.

Another question I have - would 3rd server (DB3) be part of the same MSCS cluster? Or it will be separate server? How fail-over exactly works - do I use Fail-over cluster Manager to initiate failover?

Sorry for lots of questions, but any information would be appreciated very much.

Thanks!



One VM Cluster Resources Regularly Failing

$
0
0

Hi All,

We run hundreds of Windows and Linux VMs in clustered and non-clustered environments. However, we're having issues with one particular VM that regularly restarts itself. The environment the problem VM is running in is a Windows 2012 R2 cluster.

The event log within the VM provides no BSOD information, the only entry of any note:

The system has rebooted without cleanly shutting down first. This error could be caused if the system stopped responding, crashed, or lost power unexpectedly.

Therefore, I don't believe that the actual OS within the VM (Windows 2008 R2) is crashing.

The cluster log shows only a single entry:

Cluster resource 'Virtual Machine vps.xxxxxx.com' of type 'Virtual Machine' in clustered role 'vps.xxxxxx.com' failed.

Based on the failure policies for the resource and role, the cluster service may try to bring the resource online on this node or move the group to another node of the cluster and then restart it.  Check the resource and group state using Failover Cluster Manager or the Get-ClusterResource Windows PowerShell cmdlet.

How can I debug this? I've added extra RAM to it, and moved it to other nodes and other storage but the problem continues to occur. 

Thanks

Will

event id -5120

$
0
0

Hi All,

Can  any  one  help  me to  resolve  this issue.  iam  not  geting  100% validation report for my cluster configaration...

DCluster Shared Volume 'Volume1' ('Cluster Disk 1') is no longer available on this node because of 'STATUS_MEDIA_WRITE_PROTECTED(c00000a2)'. All I/O will temporarily be queued until a path to the volume is reestablished.

Cluster Disk i/o Timeout

$
0
0

Hi ,

We are stuck in problem with our private cloud protection , when ever DPM trying to backup virtual machines the cluster shared volume i.o timeout and the disk disappeared for a moment and that cause my virtual machine rebooted unexpectedly and move to different nodes of cluster . 

Follow are the configuration of my Infrastructure .

1. Windows server 2012 Cluster X 6 nodes

2. DPM 2012 SP1

Event Generated when backup initiate :

Cluster Shared Volume 'Volume2' ('CSV 7TB Cluster Disk Production') is no longer available on this node because of 'STATUS_CLUSTER_CSV_AUTO_PAUSE_ERROR(c0130021)'. All I/O will temporarily be queued until a path to the volume is reestablished.

I have applied hot fix as Microsoft recommended 

http://support.microsoft.com/kb/2813630/en-us

Disabled ODX as well , because my storage doesn't support this feature .

please help me out  to resolve this matter .

Best Regards,

Muzammil


Muzammil Ubaray

Cluster Shared Volume is no longer accessible from cluster node

$
0
0

Hello,

We have a 3 nodes Hyper-v Cluster running Windows Server 2012. Recently we start having error below intermittently on a node, and the VMs running on this host and LUN will power off.

Alert: Cluster Shared Volume is no longer accessible from cluster node
Source: Cluster Service
Path: HV01.itl.local
Last modified by: System
Last modified time: 12/1/2013 12:27:18 AM
Alert description: Cluster Shared Volume 'Volume1' ('Cluster_Vol1_R6') is no longer accessible from this cluster node because of error 'ERROR_TIMEOUT(1460)'. Please troubleshoot this node's connectivity to the storage device and network connectivity.

The only changes made recently is we installed VEEAM on test basis for DR replication. We switched off the Veeam server and stop the Veeam Services on the Hyper-V Hosts but we are still having same issue.

We are using an EMC SAN connected via FC as Shared storage and Powerpath as Multi-Pathing. No errors were found on the SAN.

I don't think the issue is related to the number of IO as we also experienced the issue at midnight during the week-end where no one was working.

Any help would be very much appreciated.

Thanks.

Irfan


Irfan Goolab SALES ENGINEER (Microsoft UC) MCP, MCSA, MCTS, MCITP, MCT


Server 2012R2 Cluster will not start after failure

$
0
0

Hello,

we have a two node cluster running server 2012R2, at the weekend we had a power failure and one of the nodes (I presume it was the active node) has a hardware failure and won't come back up. I thought this will be fine because we have another node and that would take on the work - however, in failover cluster manager the it won't connect to the cluster name, or the local node which is running. if I run the command Start-ClusterNode -FixQuorum it shows that the node is in the state of "joining" but never seems to get any further than that. I can manually start the cluster service but after a while it seems to stop.

I believe this is all because the active node is not contactable now I think what I am asking is how to make this remaining node be the authoritative active node....but I don't know what to do to make that happen.....

I would appreciate any help,

thank you

Steve

Windows 2012 R2 Failover Cluster Hyper-V Invalid Class error when I'm trying to create VM

$
0
0

Hi, in my test lab environment I created Windows 2012 R2 Failover Cluster with 4 Servers to get Hyper-V HA.


I had no issues with Windows 2008 R2 or Windows 2012 before in same setup (2 NIC, FC HBA, SAN storage), but this time I cannot creat VM using Failover Cluster console:


Roles - Vurtual Machines - New Virtual Machine - Select Host -> <ANY HOST>

I'm using default settings during VM creation (except setting path to VM manually to pont to desired disk).

IN progress I see disk creation and both VM configuration and .vhdx files on target disk, but after that I see "The Operation has failed. An error occured creating a New Virtual Machine. Invalid Class.



In fact I see virtual machine in Hyper-V Manager and it's fully functional, but not added to Failover Cluster Roles. When I use Configure Role - Virtual Machine to see eligible machines - it's not there.

Cluster Validation says that everything is OK.

I wasn't able to find anything in Eventllog(s) or %SYSTEMROOT%\Cluster as it was in Win2k8.

How should I troubleshoot this issue ?

Network Card Reset Triggers a Failover?

$
0
0

Hello,

We have a two-node active/active SQL Server 2008 Std Cluster on Windows 2008 on Production.

After a network maintenance, the network card (not heart-beat) of one node has no gateway ip. We want to fix it without causing a failover.

I assume when we set gateway ip, the network card will be reset, and the ip will be temporarily unavailable, then fail-over happens. Am I right?

Is there a way to play with dependencies not to initiate failover during fixing network card?

Thanks,

Upgrading from Server 2008 R2 Core to 2012 R2

$
0
0

I am having a few problems with my 2008 R2 Core installation and am considering upgrading instead of just reinstalling which would be easier. My question is this; I have a three node failover cluster with Cluster shared volumes where the VHD and VM configs are stored on a NetApp SAN. I want to do a staged upgrade where I take two nodes out of the cluster and install fresh 2012 R2 full installs, configure the servers and set up the new cluster, once I have done that I can have a couple of hours downtime while I move the SAN over to the new cluster and then import all the machines into the new cluster.

The scenario works in my head but I was wondering if I am missing something. Also I am no iscsi guru so can someone give me a step by step guide to setup the MPIO and iscsi Section of the cluster.

We have a limited window to do this work as this will have to take place over the Christmas break which this year consists of 7 working days.

  • Can I use the same cluster name or is it best to use a new one?
  • Lastly should I remove the nodes properly leaving one node or just shutdown and let the cluster windge for a couple of days?
  •  Would I have to do any other config changes to the NetApp?

server 2012 cluster repair option

$
0
0

Hello

Can anyone tell me what a repair actually does to an offline cluster? I cannot find it in technet or msdn.  

Thanks

Viewing all 6672 articles
Browse latest View live