Quantcast
Channel: High Availability (Clustering) forum
Viewing all 6672 articles
Browse latest View live

Implement of File and Print server clustering

$
0
0

Hi All,

Server is installed with File Services and Print Services role, i would like to implement cluster for this server for fault tolerant. This server is direct attached to HP storage with SAS connection and running Windows Server 2008 R2 Std

Question:

1. What are the concern and consideration i need to take note when implement clustering for this server? Pro and Cons for this?

2. Can i configure File and Print clustering in same server? Any different for configure File Services and Print Server on different server?

3. Any impact when i configure File Services and Print Services clustering in same server?

4. As i know Windows Server 2008 R2 Std not support clustering so i need to upgrade to Windows Server 2008 R2 Enterprise or Windows Server 2012 R2.

5. If i upgrade to Windows Server 2012 R2 Std, i need to split out Server Role as 2012 does not has clustering for Print Services. Right?

Regards,

mekmek



Windows 2012 R2 Cluster on DMZ network

$
0
0

Dear , We are planning to setup windows 2012 R2 Cluster on DMZ network in order to have high availability of Hyper-V virtual Machine.

My concern are :

Could we setup windows 2012 R2 cluster on DMZ without our Active Directory? if yes let describe it how.

Note: We have windows 2008 R2 Active Directory in Internal network

Regards,

Hussain

Cluster Node paused

$
0
0

Hi there

My Setup:

2 Cluster Nodes (HP DL380 G7 & HP DL380 Gen8)
HP P2000 G3 FC MSA (MPIO)

The Gen8 Cluster Node pauses after a few minutes, but stays online if the G7 is paused (no drain) My troubleshooting has led me to believe that there is a problem with the Cluster Shared Volume:

00001508.000010b4::2015/02/19-14:51:14.189 INFO  [RES] Network Name: Agent: Sending request Netname/RecheckConfig to NN:cf2dec1d-ee88-4fb6-a86d-0c2d1aa888b4:Netbios
00000d1c.0000299c::2015/02/19-14:51:14.615 INFO  [API] s_ApiGetQuorumResource final status 0.
00000d1c.0000299c::2015/02/19-14:51:14.616 INFO  [RCM [RES] Virtual Machine VirtualMachine1 embedded failure notification, code=0 _isEmbeddedFailure=false _embeddedFailureAction=2
00001508.000010b4::2015/02/19-14:51:15.010 INFO  [RES] Network Name <Cluster Name>: Getting Read only private properties
00000d1c.00002294::2015/02/19-14:51:15.096 INFO  [API] s_ApiGetQuorumResource final status 0.
00000d1c.00002294::2015/02/19-14:51:15.121 INFO  [API] s_ApiGetQuorumResource final status 0.
000014a8.000024f4::2015/02/19-14:51:15.269 INFO  [RES] Physical Disk <Quorum>: VolumeIsNtfs: Volume\\?\GLOBALROOT\Device\Harddisk1\ClusterPartition2\ has FS type NTFS
00000d1c.00002294::2015/02/19-14:51:15.343 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00002294::2015/02/19-14:51:15.352 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.386 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.386 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.386 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:15.847 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:15.855 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.887 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.888 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.888 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:15.928 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:15.939 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.968 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.969 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.969 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:16.005 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:16.015 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:16.059 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:16.059 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:16.059 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00002568::2015/02/19-14:51:17.110 INFO  [GEM] Node 1: Deleting [2:395 , 2:396] (both included) as it has been ack'd by every node
00000d1c.0000299c::2015/02/19-14:51:17.444 INFO  [RCM [RES] Virtual Machine VirtualMachine2 embedded failure notification, code=0 _isEmbeddedFailure=false _embeddedFailureAction=2
00000d1c.0000299c::2015/02/19-14:51:18.103 INFO  [RCM] rcm::DrainMgr::PauseNodeNoDrain: [DrainMgr] PauseNodeNoDrain
00000d1c.0000299c::2015/02/19-14:51:18.103 INFO  [GUM] Node 1: Processing RequestLock 1:164
00000d1c.00002568::2015/02/19-14:51:18.104 INFO  [GUM] Node 1: Processing GrantLock to 1 (sent by 2 gumid: 1470)
00000d1c.0000299c::2015/02/19-14:51:18.104 INFO  [GUM] Node 1: executing request locally, gumId:1471, my action: /nsm/stateChange, # of updates: 1
00000d1c.00001420::2015/02/19-14:51:18.104 INFO  [DM] Starting replica transaction, paxos: 99:99:50133, smartPtr: HDL( c9b16cf1e0 ), internalPtr: HDL( c9b21

This issue has been bugging me for some time now. The Cluster is fully functional and works great until the node gets paused again. I've read somewhere that the MSMQ errors can be ignored, but can't find anything about theHardDiskpGetDiskInfo: GetVolumeInformation failed messages. No errors in the san or the Server Event logs. Driver and Firmware are up to date. Any help would be greatly appreciated.

Best regards

Hyper-V Network Cluster Connections Unavailable

$
0
0

We have 4 servers in a cluster. They appear to be functioning fine, however, Live Migrations have failed with error "Failed to get the network address for this operation. (0x000013AB)."

On investigation, there is a clear fault marked with two servers networks - show up as Unavailable in FailoverCluster Manager -> Networks -> Cluster Network1 -> Click on Network Connections tab. 

Bindings order appears to be identical for server which is OK as well as those which are not.

Any thoughts?

ISCSI disk not available for storage

$
0
0

I am trying to create a lab to demonstrate a simple clustering environment. Best practices is not an issue here. 

I have a lone Domain Controller that is also running Hyper-V. I am hosting 2 VMs, I call Cluster1 and Cluster2. The VMs share the NIC with the DC, and are members of the domain. 

After turning on the iSCSI initiator service on both VMs, I created an iSCSI target on the DC. When I created the target, the DC saw both of the iSCSI initiators on the VMs, and created the target without incident. The target now points to a .vhdx file on the DC. The disk initialized without incident on both VMs.

I was able to add the disk and create a CSV without incident in Failover Cluster Manager. It shows up as online and healthy. When I look at the iSCSI target on the DC, it shows up at health and connected. I ran validation against the cluster, and the storage comes up perfect, no warnings. 

When I try and add the file sever role (File Server for general use) to the clustered VMs, no storage is available to add the the role. 

The only indication that anything is wrong is that on Cluster2, the disk is shown to be offline. If I try and bring it online, I get the following error:

“The specified disk or volume is managed by the Microsoft Failover Clustering component. The disk must be in cluster maintenance mode and the cluster resource status must be online to perform this operation”

I have tried everything mentioned in the error message, to no avail. In cluster manager, it shows that Cluster1 is the owner of the resource.

Why isn’t this working? I know a .vhdx file has to be marked at shared if two machines are going to access it, but I don’t see how I can do that, or perhaps its already shared?

More to the point, where could I place a .vhdx file on the DC one so that I could share it between the two VMs? 

I have 2 more servers at my disposal, and I could hypothetically go out and buy cheap a NAS that supports iSCSI, but thats missing the point. I want to run it all off one box so its portable and easy to demonstrate. 

Thank you in advance.



validation failed on remote server - Ensure that the remote registry service is running, and have remote administration enabled

$
0
0

I am trying to setup my 2012 cluster and when i try to add my remote server it gives me an error

Failed to access remote registry on server.

Ensure that the remote registry service is running, and have remote administration enabled

I checked the server but remote registry is started already

any idea?

also checked under server manager and remote management is enabled

Oracle10g Ent. Real application cluster setup in 32bit Windows 2008 Ent

$
0
0
We want to setup Oracle 10g Enterprise Real application cluster(RAC) in a two node 32bit Windows 2008 Enterprise server
for database  high availability.

DB servers have 16GB RAM and are of same make and model. I want to know if there is any limitation in RAM usage if Oracle 10g Enterprise RAC is setup in a two node 32bit Windows 2008 Enterprise server.  I read in some form that Oracle 10g RAC would not use more than 2 GB RAM in 32 bit Windows 2008 Enterprise server environment.

Can somebody confirm and let me know the pro's and con's of such a setup?

Would editing boot.ini help Oracle recognize 16 GB RAM in the server? Would it create any issues in terms of performance
and stability of OS?

The application that would use the cluster DB are ESRI GIS and MS Dynamics AX 2009.

Failed to put node in node maintenance mode. Details: Microsoft.ClusterAwareUpdating.ClusterUpdateException: Could not suspend cluster node

$
0
0

CAU was working for some time (I was really surprised that it did actually work without any hitch, being it MS product...)

But it is back to its old tricks:

Failed to put node VHOST01 in node maintenance mode.  Details: Microsoft.ClusterAwareUpdating.ClusterUpdateException: Could not suspend cluster node "VHOST01".
at MS.Internal.ClusterAwareUpdating.Util.CheckPshError(PowerShell shell, MulticulturalString exceptionMessage)
at MS.Internal.ClusterAwareUpdating.FailoverClusterImpl.PutIntoMaintenanceMode(String nodeName, ICauPluginCallbackBase callback, CancellationToken cancelToken, Boolean force)

After which it seems that it just rebooted the host with all VMs crashing & restarting on the other host in the cluster and the update finishing on both hosts with Success (I would call it otherwise)

Nice, really nice...

Anybody has any idea what that "warning" means


2012 R2 Hyper-V cluster nodes hang

$
0
0

Hi,

We have a two node Hyper-V cluster. ~ once a week either one of the cluster nodes hangs during a backup (Backup Exec 2014 Vray edition) causing all VM's to restart to the other node. When a node hangs the console is just black and mouse moving. Ctrl-alt-del does nothing, only option is to reboot the server.
And almost always if I just let the node to boot up it boots up to the same state, black screen only mouse visible.
I have to boot it first to safe mode and then reboot it again to get it up.

Hardware:

2 x IBM x3550 (2 x CPU, 320 GB RAM, addtional cards: 4-port intel net card + 2 port SAS-card ) as Hyper-V nodes
1 x IBM V3700 as SAN-storage, connected to both nodes with redundant SAS-cables.

Software used:
Windows Server 2012 R2 datacenter OS with Hyper-V roles in cluster nodes
Windows failover clustering
4 x 2TB Shared CSV-disks for Virtual-machines
Backup Exec 2014 V-ray edition
SDDDSM driver for V3700

Configuration:
1 network team of two interfaces for VM-traffic only
1 network interface for VM-traffic only for DMZ traffic for selected VM's
1 network team of two interfaces for Cluster traffic only
1 management interface

ODX (Offloaded Data Transfers) is disabled from both nodes as V3700 does not support it.
~30 virtual machines, mostly windows server versions from 2003 to 2012 R2, couple of Ubuntu VMs and four Windows 7 VMs.

We have all the latest Windows updates and HW firmwares installed in our Cluster nodes.
The problem is that the nodes won't generate any kind of dumps when they hang, so we can't pinpoint where the problem is.

Also system logs don't reveal anything that would tell the actual cause of the hang.

For example according to System log one of the nodes hung at 21:17:44:
The previous system shutdown at 9:17:44 PM on ‎12/‎10/‎2014 was unexpected.

From the even viewer I have found following errors, but these are not near the crash time.

17:06:08 
ERROR VSS
Volume Shadow Copy Service error: Unexpected error calling routine IVssAsrWriterBackup::GetAsrMetadata.  hr = 0x80070037, The specified network resource or device is no longer available.


Operation:
   PrepareForBackup event

Context:
   Execution Context: ASR Writer
   Execution Context: Writer
   Writer Class Id: {be000cbe-11fe-4426-9c58-531aa6355fc4}
   Writer Name: ASR Writer
   Writer Instance ID: {d2d37e37-99d1-446d-a840-5390af00616e}

Error-specific details:
   ASR Writer: The specified network resource or device is no longer available. (0x80070037)



17:06:08 
Warning VSS
Volume Shadow Copy Service warning: ASR writer Error 0x80070037.  hr = 0x00000000, The operation completed successfully.


Operation:
   PrepareForBackup event

Context:
   Execution Context: ASR Writer
   Execution Context: Writer
   Writer Class Id: {be000cbe-11fe-4426-9c58-531aa6355fc4}
   Writer Name: ASR Writer
   Writer Instance ID: {d2d37e37-99d1-446d-a840-5390af00616e}

Error-specific details:
   ASR Writer: The specified network resource or device is no longer available. (0x80070037)


We also have these errors showing up in the Event viewer multiple times during backups, but according to info released
by Microsoft these seem to be related to VM's with IDE root-disks:

ERROR: VDS Basic Provider
Unexpected failure. Error code: 48F@01000003

We will need help to find out what is causing these hangs. Anyone have any hints or should I just open a case to Symantec or Microsoft?

Br,
Antti Kiiski


Failover Cluster Manager 2012 Showing Wrong Disk Resource - Fix by Powershell

$
0
0

On Server 2012 Failover Cluster Manager, we have one Hyper-V virtual machine that is showing the wrong storage resource.  That is, it is showing a CSV that is in no way associated with the VM.  The VM has only one .vhd, which exists on Volume 16.  The snapshot file location and smart paging file are also on Volume 16.  This much is confirmed by using the Failover Cluster Manager to look at the VM settings.  If you start into the "Move Virtual Machine Storage" dialog, you can see the .vhd, snapshots, second level paging, and current configuration all exist on Volume 16.  Sounds good.

However, if you look at the resources tab for the virtual machine, Volume 16 is not listed under storage.  Instead, it says Volume 17, which is a disk associated with a different virtual machine.  That virtual machine also (correctly) shows Volume 17 as a resource.

So, if everything is on Volume 16, why does the Failover Cluster Manager show Volume 17, and not 16, as the Storage Resource?  Perhaps this was caused by an earlier move with the wrong tool (Hyper-V manager), but I don't remember doing this.

In Server 2003, there was a "refresh virtual machine configuration" option to fix this, but it doesn't appear in Failover Cluster Manager in Server 2012.

Instead, the only way I've found to fix the problem is in PowerShell.

  Update-ClusterVirtualMachineConfiguration "put configuration name here in quotes"

You would think that this would be an important enough operation to include GUI support for it, possibly in the "More Actions" right-click action on the configuration file.

Windows 2012 R2 hanging on "Forming cluster"

$
0
0

Man, do I need some help here.

I am getting the following error when trying to create a cluster on Windows 2012 R2 for 2 nodes. It is also failing when using just one node. I have the following configuration:

Win2012A - Domain Controller

Win2012B - Domain Controller member

All the validation tests run fine. Here are the errors:

Waiting for notification that Cluster service on node WIN2012A.cluster.local has started.

Forming cluster 'tsmcluster'

Unable to successfully cleanup.

An error occurred while creating the cluster and the nodes will be cleaned up. Please wait...

There was an error cleaning up the cluster nodes. Use Clear-ClusterNode to manually clean up the nodes.

An error occurred while creating the cluster.

This operation returned because the timeout period expired.

To troubleshoot cluster creation problems, run teh Validate a Configuration wizard on the servers you want to cluster.

Thanks

ssghj

File server on cluster

$
0
0

Hi

I have failover cluster on win server 2008 R2 that host file service fro 2 nodes (File1 , File2)

I make failover with no issues but once i failover to file server1 some users cannot access the shared volumes

some users when they make logoff and login they also cannot access the shared volumes

what was this issue ?

need help and how can i monitoring ?


MCP MCSA MCSE MCT MCTS CCNA

Cannot start a WSFC node with -ForceQuorum in a cluster that's lost quorum

$
0
0

Hi,

I've got a really simple setup: a two node healthy cluster constisting of SRV1 and SRV2. Current Vote is 1 for SRV1 and 0 for SRV2. To simulate a lost node (and in this case cluster losing quorum) I remove SRV1 from the network. Failover Cluster Manager (FCM) on SRV1 pretty instantly reports the status of the nodes as:

SRV1 - UP
SRV2 - DOWN

Fine. On SRV2 however, nothing happens in FCM for some time. After about a minute, FCM loses contact with the cluster. When I try to reconnect FCM to the local node (SRV2), I get the following error:

Node 'SRV2' is in the process of being started. The remote server has been paused or is in the process of being started.

Waiting does not help - the problem persists. I then resort to PowerShell and "Start-ClusterNode -ForceQuorum". It responds with State=Joining. But the node is never started. Cannot connect to it in FCM. And any other PowerShell command (e.g. Get-ClusterNode) returns "The remote server has been paused or is in the process of being started".

What am I doing wrong? How can I manually force a node to start in a cluster that's lost quorum?

Kindly,
Fredrik


NFS shares on Scale Out File Server Clusters 2012R2 - is this supported?

$
0
0

Hi,

I would like to know if NFS shares are supported on Failover Clusters 2012R2. On an existing Scale Out File Server cluster 2012R2, I would like to install 'Server for NFS' role and than would like to provision NFS shares to store backup data (not virtual machines).
 
I would appreciate if someone could verify if this is supported by Microsoft.

Thanks in advance.

Regards

Prabhash

Windows 2012 - File Cluster Performance

$
0
0

Hi,

We have a file server cluster in windows 2012 that is used as backend access to our app servers. This is a large environment and lots of access 24/24h a day by our app servers.

Each node has 64GB of RAM, MS NIC Teaming with 10GBbps per nic, etc...

During business hours the server performance is acceptable, but we have a serious performance hit at night when the backups start, during the first 1 or 2 hours the access to the files in the file cluster become very slow...

I already check several times this problem with no effective solution, hopefully you guys can throw other different ideas for us to test.

So here is the scenario:

- The problematic share is one folder with aprx. 4.000.000 files that range between 4k and 1Mb.

- During working hours the average access time to the files is between 3ms and 1sec MAX. At night when the BKs starts, during the first hour or two of the Backup process, the access time can reach up to 60 seconds and this causes the app servers and web servers to timeout which causes problems for our clients.

- Note the backup (full backup) takes aprox 12 hours. The daily backup takes 4 Hours but we still have the issue (the drives are backup one of the time to not cause large impact)...

-I already check the storage and everything is apperantly ok, the network appears to not be a problem as well (the app servers and the file cluster is in the same vLan segment), no CPU increase, and so on...

- I also ran perfmon, windows performance toolkit (WPT) and others... From my perspective the problem is related with the file access in the disks/luns... But I don`t see any credible evidences of that, howerver in WPT reports the first 2 hours of large operations may be related with snapshots being taken to all the files and this may be the problem...

So, for windows 2012 what recommendations or ideas do you suggest?

Thank you.



can not RDP to remote 2012R2 server

$
0
0

Hi experts,

I meet a problem when i remote to 2012r2 server, it occur after i insatll all the latest update by Windows update,  i can enter my credential, but after found it can not logon, the origional message is "Access Denied“ and nothing else, the same there have a event

Log Name:      Application
Source:        Windows Error Reporting
Date:          2015/3/15 14:30:48
Event ID:      1001
Task Category: None
Level:         Information
Keywords:      Classic
User:          N/A
Computer:      2012r2-02
Description:
Fault bucket , type 0
Event Name: IMECustomerEvent
Response: Not available
Cab Id: 0

Problem signature:
P1: IPX Assertion
P2: 0CHS
P3: ChsIME.exe
P4: 6.3.9600.17031
P5: ChsIME.exe
P6: 6.3.9600.17031
P7: Windows\feime\Modern\IMEexe\common\CImeKeyboardInputProvider.h
P8: 489
P9:
P10:

Attached files:

These files may be available here:
C:\ProgramData\Microsoft\Windows\WER\ReportQueue\NonCritical_IPX Assertion_d4c76cfcf567ba4d2612a3eb3b6bb0345d4d3494_00000000_cab_13dfbf5a

Analysis symbol:
Rechecking for solution: 0
Report Id: d0d335ef-cadc-11e4-813e-68b599ef54b5
Report Status: 4
Hashed bucket:
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="Windows Error Reporting" />
    <EventID Qualifiers="0">1001</EventID>
    <Level>4</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2015-03-15T06:30:48.000000000Z" />
    <EventRecordID>11487</EventRecordID>
    <Channel>Application</Channel>
    <Computer>2012r2-02</Computer>
    <Security />
  </System>
  <EventData>
    <Data>
    </Data>
    <Data>0</Data>
    <Data>IMECustomerEvent</Data>
    <Data>Not available</Data>
    <Data>0</Data>
    <Data>IPX Assertion</Data>
    <Data>0CHS</Data>
    <Data>ChsIME.exe</Data>
    <Data>6.3.9600.17031</Data>
    <Data>ChsIME.exe</Data>
    <Data>6.3.9600.17031</Data>
    <Data>Windows\feime\Modern\IMEexe\common\CImeKeyboardInputProvider.h</Data>
    <Data>489</Data>
    <Data>
    </Data>
    <Data>
    </Data>
    <Data>
    </Data>
    <Data>C:\ProgramData\Microsoft\Windows\WER\ReportQueue\NonCritical_IPX Assertion_d4c76cfcf567ba4d2612a3eb3b6bb0345d4d3494_00000000_cab_13dfbf5a</Data>
    <Data>
    </Data>
    <Data>0</Data>
    <Data>d0d335ef-cadc-11e4-813e-68b599ef54b5</Data>
    <Data>4</Data>
    <Data>
    </Data>
  </EventData>
</Event>

Please any help me, thanks !


How to use CAU in 2012R2 to install the specific kind of update

$
0
0

Hi all,

Can any body heklp me how to use CAU to install the specifc kind of update, such only insatll hotfix but not the function update, Thanks.

DFS can only see in one direction at a time with clustered server

$
0
0

Hi,

This one has had me confused for some time.

We have a clustered pair of file servers, with an additional server at our DR site. Certain shares are replicated to the DR site using DFS.

This replication fell over several months ago. We worked around it with a scheduled xcopy, but we want to figure out the root cause now we have time to work on it.

When I run a diagnostic report, the result is always that one server will report that it cannot see its DFS partner:

DFS Replication cannot replicate with partner <CAP name> due to a communication error. The DFS Replication service used partner DNS name <CAP FQDN>, IP address <CAP IP>, and WINS address <CAP name> but failed with error ID: 1727 (The remote procedure call failed and did not execute.). Event ID: 5002

Weird thing is, it's not consistently the same server, rather it's the one with the highest uptime. If I restart a cluster node and put the Cluster Access Point there, then the DR server can't resolve the CAP. If I then bounce the DR server, it will see the CAP, but the CAP no longer sees it. Next time I get the opportunity, I'll reboot the DR server, then a cluster node, then move the role back and forth between nodes to see if the behaviour is consistent.

Computer management also fails in the same direction. All other servers can see each other, and the DR server can see all the other CAPs. Additionally, computer management works via IP address, only name resolution fails, and only for that one name.

Could there be something wrong with the CAP? Can anyone suggest where to look next?

Kind Regards,

Em.

CAU - Cluster Aware Updating Computer Object

$
0
0

Hello,

recently I installed 2 Windows Server 2012 R2 Failover Clusters.
I prestaged the CAU Computer Objects.

After configuring the Role on the first Cluster, I went to the second and specified the wrong computer object for the CAU Role.

Basically I gave the second Cluster the same computer object than the first.
This resulted in a failure, so I removed the CAU Role and installed it again with the correct Computer Object.

However, for some reason it is still referencing to the wrong computer object, therefore CAU cannot run.

Do you know how to clean the information in the cluster re. CAU?

Thanks,
Jens


jensit.wordpress.com

Cluster Node paused

$
0
0

Hi there

My Setup:

2 Cluster Nodes (HP DL380 G7 & HP DL380 Gen8)
HP P2000 G3 FC MSA (MPIO)

The Gen8 Cluster Node pauses after a few minutes, but stays online if the G7 is paused (no drain) My troubleshooting has led me to believe that there is a problem with the Cluster Shared Volume:

00001508.000010b4::2015/02/19-14:51:14.189 INFO  [RES] Network Name: Agent: Sending request Netname/RecheckConfig to NN:cf2dec1d-ee88-4fb6-a86d-0c2d1aa888b4:Netbios
00000d1c.0000299c::2015/02/19-14:51:14.615 INFO  [API] s_ApiGetQuorumResource final status 0.
00000d1c.0000299c::2015/02/19-14:51:14.616 INFO  [RCM [RES] Virtual Machine VirtualMachine1 embedded failure notification, code=0 _isEmbeddedFailure=false _embeddedFailureAction=2
00001508.000010b4::2015/02/19-14:51:15.010 INFO  [RES] Network Name <Cluster Name>: Getting Read only private properties
00000d1c.00002294::2015/02/19-14:51:15.096 INFO  [API] s_ApiGetQuorumResource final status 0.
00000d1c.00002294::2015/02/19-14:51:15.121 INFO  [API] s_ApiGetQuorumResource final status 0.
000014a8.000024f4::2015/02/19-14:51:15.269 INFO  [RES] Physical Disk <Quorum>: VolumeIsNtfs: Volume\\?\GLOBALROOT\Device\Harddisk1\ClusterPartition2\ has FS type NTFS
00000d1c.00002294::2015/02/19-14:51:15.343 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00002294::2015/02/19-14:51:15.352 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.386 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.386 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.386 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:15.847 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:15.855 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.887 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.888 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.888 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:15.928 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:15.939 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:15.968 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:15.969 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:15.969 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00001420::2015/02/19-14:51:16.005 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
00000d1c.00001420::2015/02/19-14:51:16.015 WARN  [RCM] ResourceTypeChaseTheOwnerLoop::DoCall: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000014a8.000024f4::2015/02/19-14:51:16.059 INFO  [RES] Physical Disk: HardDiskpQueryDiskFromStm: ClusterStmFindDisk returned device='\\?\mpio#disk&ven_hp&prod_p2000_g3_fc&rev_t250#1&7f6ac24&0&36304346463030314145374646423434393243353331303030#{53f56307-b6bf-11d0-94f2-00a0c91efb8b}'
000014a8.000024f4::2015/02/19-14:51:16.059 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: GetVolumeInformation failed for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
000014a8.000024f4::2015/02/19-14:51:16.059 ERR   [RES] Physical Disk: HardDiskpGetDiskInfo: failed to get partition size for\\?\GLOBALROOT\Device\Harddisk3\ClusterPartition2\, status 3
00000d1c.00002568::2015/02/19-14:51:17.110 INFO  [GEM] Node 1: Deleting [2:395 , 2:396] (both included) as it has been ack'd by every node
00000d1c.0000299c::2015/02/19-14:51:17.444 INFO  [RCM [RES] Virtual Machine VirtualMachine2 embedded failure notification, code=0 _isEmbeddedFailure=false _embeddedFailureAction=2
00000d1c.0000299c::2015/02/19-14:51:18.103 INFO  [RCM] rcm::DrainMgr::PauseNodeNoDrain: [DrainMgr] PauseNodeNoDrain
00000d1c.0000299c::2015/02/19-14:51:18.103 INFO  [GUM] Node 1: Processing RequestLock 1:164
00000d1c.00002568::2015/02/19-14:51:18.104 INFO  [GUM] Node 1: Processing GrantLock to 1 (sent by 2 gumid: 1470)
00000d1c.0000299c::2015/02/19-14:51:18.104 INFO  [GUM] Node 1: executing request locally, gumId:1471, my action: /nsm/stateChange, # of updates: 1
00000d1c.00001420::2015/02/19-14:51:18.104 INFO  [DM] Starting replica transaction, paxos: 99:99:50133, smartPtr: HDL( c9b16cf1e0 ), internalPtr: HDL( c9b21

This issue has been bugging me for some time now. The Cluster is fully functional and works great until the node gets paused again. I've read somewhere that the MSMQ errors can be ignored, but can't find anything about theHardDiskpGetDiskInfo: GetVolumeInformation failed messages. No errors in the san or the Server Event logs. Driver and Firmware are up to date. Any help would be greatly appreciated.

Best regards

Viewing all 6672 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>