Quantcast
Channel: High Availability (Clustering) forum
Viewing all 5648 articles
Browse latest View live

Microsoft DSM versions

$
0
0

I have a 2 node 2012 r2 cluster and am trying to add another 2012 r2 server

I fully pathced the new server and when adding a node it runs validation which fails on the dsm qfe versions

the 2 original units are qfe 16384, the new fully patched server is qfe 17088

so I can't add it to the cluster without validating

I can skip the validation and add it that way, but

I need to know if adding this node with the diffrent mpio dsm version is ok for a while so I can move the vm over and update and restart the other 2 older servers?

the issue is I don't have enough resources to put all the vm's on 1 server

we are a healthcare facility and I don't wan thte cluster to go down

any help would be appreciated


Cluster Windows 2012 in different subnet

$
0
0

Hi

I prepare to configure 4 windows 2012 in a cluster. All windows 2012 have 2 network card (1 = DATA and 1 = Heartbeat)

It's possible to configure:

- All 4 servers network card in same VLAN for "DATA" (Ex.: 10.10.10.0 /24)

- 3 servers network card in VLAN "Heartbeat" (Ex.: 192.168.0.0 /24) and 1 server network card in VLAN  (Ex.: 172.168.0.0/23)

Thanks

Cluster Aware Updating Scheduling

$
0
0

I am having an issue with Cluster Aware Updating (CAU) on Server 2012 and Server 2012 R2. If I schedule self-updating and specify a time other than 03:00 (e.g. 07:00) in the wizard, when I get to the end of the wizard the confirmation page shows the schedule to be 03:00. if I click apply then the schedule does appear to be set to 03:00.The same behaviour occurs whether I am setting up CAU for the first time or editing an existing configuration. Thus it is not possible to schedule cluster aware updating for any time other than 03:00.

I am assuming that this is a bug, although I am open to suggestions if anyone else can think of a possible cause. I haven't found this mentioned anywhere online and I have been to MS Connect and Server 2012 is not listed as open for bugs. Has anybody else been able to reproduce this? Any idea how to report a bug if connect is closed?

I have two clusters, one on Server 2012 and one on 2012 R2 and I can reproduce on both:

OS Name    Microsoft Windows Server 2012 Datacenter
Version    6.2.9200 Build 9200

OS Name    Microsoft Windows Server 2012 R2 Datacenter
Version    6.3.9600 Build 9600

I am happy to supply further details if anyone is willing to help.

Thanks


Event id 1196 third part DNS

$
0
0

Hello - I have created windows 2012 R2 two node cluster. I see this event always in cluster events. We dont use windows DNS and use third party DNS Bind.

"

Cluster network name resource 'Cluster Name' failed registration of one or more associated DNS name(s) for the following reason:
DNS bad key.
.

How I can disable dynamic register of DNS in windows 2012 R2 ?

Thank you,

Adnan


ad

Hyper-v Live Migration not completing when using VM with large RAM

$
0
0

hi,

i have a two node server 2012 R2 cluster hyper-v which uses 100GB CSV, and 128GB RAM across 2 physical CPU's (approx 7.1GB used when the VM is not booted), and 1 VM running windows 7 which has 64GB RAM assigned, the VHD size is around 21GB and the BIN file is 64GB (by the way do we have to have that, can we get rid of the BIN file?). 

NUMA is enabled on both servers, when I attempt to live migrate i get event 1155 in the cluster events, the LM starts and gets into 60 something % but then fails. the event details are "The pending move for the role 'New Virtual Machine' did not complete."

however, when i lower the amount of RAM assigned to the VM to around 56GB (56+7 = 63GB) the LM seems to work, any amount of RAM below this allows LM to succeed, but it seems if the total used RAM from the physical server (including that used for the VMs) is 64GB or above, the LM fails.... coincidence since the server has 64GB per CPU.....

why would this be?

many thanks

Steve

Windows 2012 FSW

$
0
0

Hi

I prepare to configure 5 windows 2012 in cluster with a File Share Withness.

My question: It's possible to configure a share (for a file share withness) on one of my servers Windows 2012?

If Yes, do you a link to explain that?

Thanks

Node in Cluster drops NIC connection. Troubleshooting steps?

$
0
0

Hey all,

Having a Node that occasionally has its NIC disconnect on us (about once a month).  We get nothing but the Event ID 1127 stating that the network interface fails.  Its a NIC team and I've checked the driver/firmware versions of the NIC and the NIC team software, as well as the configuration on each and they are all identical on this Node and the working Node.

I've read someplace that the NIC Power Saving setting needs to be unchecked so the NIC doesn't go to sleep but in Windows 2008 (non-R2) I'm not finding that option anywhere on these NICs to check it. 

I'm wondering if anyone could check out the Cluster log and see if they notice anything obvious and have any ideas that I can pursue for further troubleshooting:

00001770.00000b70::2014/08/05-16:56:06.358 WARN  [RES] Physical Disk <52_F_Log01>: VerifyFS: Ignoring failure to open file \\?\GLOBALROOT\Device\Harddisk53\Partition1\BellDesk_Log2_BKUP.ldf Error: 5.
00001728.000008b8::2014/08/05-16:56:12.724 WARN  [RES] Physical Disk <52_F_Data01>: VerifyFS: Ignoring failure to open file \\?\GLOBALROOT\Device\Harddisk11\Partition1\BellDeskEXC_1_BKUP.mdf Error: 5.
00001728.000008b8::2014/08/05-16:56:12.724 WARN  [RES] Physical Disk <52_F_Data01>: VerifyFS: Ignoring failure to open file \\?\GLOBALROOT\Device\Harddisk11\Partition1\BellDeskEXC_BKUP.mdf Error: 5.
00001728.000008b8::2014/08/05-16:56:12.724 WARN  [RES] Physical Disk <52_F_Data01>: VerifyFS: Ignoring failure to open file \\?\GLOBALROOT\Device\Harddisk11\Partition1\BellDeskEXC_log_BKUP.ldf Error: 5.
00001728.000008b8::2014/08/05-16:56:12.725 WARN  [RES] Physical Disk <52_F_Data01>: VerifyFS: Ignoring failure to open file \\?\GLOBALROOT\Device\Harddisk11\Partition1\ePO4_EPO_1_old.ndf Error: 5.
00000e8c.00007390::2014/08/05-16:56:13.692 INFO  [GUM] Node 4: Processing RequestLock 4:45497
00000e8c.000009b4::2014/08/05-16:56:13.694 INFO  [GUM] Node 4: Processing GrantLock to 4 (sent by 3 gumid: 478989)
00000e8c.000009b4::2014/08/05-16:56:17.327 INFO  [GUM] Node 4: Processing RequestLock 3:47054
00000e8c.000009b4::2014/08/05-16:56:17.327 INFO  [GUM] Node 4: Processing GrantLock to 3 (sent by 4 gumid: 478990)
00000e8c.0000566c::2014/08/05-16:56:20.148 INFO  [GUM] Node 4: Processing RequestLock 4:45498
00000e8c.000009b4::2014/08/05-16:56:20.150 INFO  [GUM] Node 4: Processing GrantLock to 4 (sent by 3 gumid: 478991)
00000e8c.000009b4::2014/08/05-16:56:27.272 INFO  [GUM] Node 4: Processing RequestLock 3:47055
00000e8c.000009b4::2014/08/05-16:56:27.272 INFO  [GUM] Node 4: Processing GrantLock to 3 (sent by 4 gumid: 478993)
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.45:003853
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.45:003853
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.45:003853
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.47:003853
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.47:003853
00000e8c.000009b0::2014/08/05-16:56:48.736 DBG   [NETFTAPI] Signaled NetftRemoteUnreachable  event, local address 10.199.17.48:003853 remote address 10.199.17.47:003853
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [IM] got event: Remote endpoint 10.199.17.45:~3343~ unreachable from 10.199.17.48:~3343~
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [IM] Marking Route from 10.199.17.48:~3343~ to 10.199.17.45:~3343~ as down
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [NDP] Checking to see if all routes for route (virtual) local fe80::68fd:e989:2b47:f203:~0~ to remote fe80::a4fa:4845:1f98:c058:~0~ are down
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [NDP] Route local 192.168.17.48:~0~ to remote 192.168.17.45:~0~ is up
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [IM] Adding information for route Route from local 10.199.17.48:~3343~ to remote 10.199.17.47:~3343~, status: true, attributes: 0
00000e8c.000009c4::2014/08/05-16:56:48.736 INFO  [IM] Adding information for route Route from local 10.199.17.48:~3343~ to remote 10.199.17.45:~3343~, status: false, attributes: 0
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] Sending connectivity report to leader (node 1): <class mscs::InterfaceReport>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <fromInterface>fe516577-6c30-4b96-a6b0-38adb0ccee3e</fromInterface>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <upInterfaces><vector len='2'>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>fe516577-6c30-4b96-a6b0-38adb0ccee3e</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>073b1ff1-b43a-4a7b-9875-e0d6b8ac0b83</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </vector>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </upInterfaces>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <downInterfaces><vector len='1'>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>8a90dc45-ed7e-4f26-90c0-f36becfa3e0a</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </vector>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </downInterfaces>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <viewId>1704</viewId>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </class mscs::InterfaceReport>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] got event: Remote endpoint 10.199.17.47:~3343~ unreachable from 10.199.17.48:~3343~
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] Marking Route from 10.199.17.48:~3343~ to 10.199.17.47:~3343~ as down
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [NDP] Checking to see if all routes for route (virtual) local fe80::68fd:e989:2b47:f203:~0~ to remote fe80::28da:1ec6:f11e:58da:~0~ are down
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [NDP] Route local 192.168.17.48:~0~ to remote 192.168.17.47:~0~ is up
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] Adding information for route Route from local 10.199.17.48:~3343~ to remote 10.199.17.47:~3343~, status: false, attributes: 0
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] Adding information for route Route from local 10.199.17.48:~3343~ to remote 10.199.17.45:~3343~, status: false, attributes: 0
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  [IM] Sending connectivity report to leader (node 1): <class mscs::InterfaceReport>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <fromInterface>fe516577-6c30-4b96-a6b0-38adb0ccee3e</fromInterface>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <upInterfaces><vector len='1'>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>fe516577-6c30-4b96-a6b0-38adb0ccee3e</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </vector>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </upInterfaces>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <downInterfaces><vector len='2'>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>073b1ff1-b43a-4a7b-9875-e0d6b8ac0b83</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO      <item>8a90dc45-ed7e-4f26-90c0-f36becfa3e0a</item>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </vector>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </downInterfaces>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO    <viewId>1704</viewId>
00000e8c.000009c4::2014/08/05-16:56:48.737 INFO  </class mscs::InterfaceReport>
00000e8c.000011f0::2014/08/05-16:56:48.751 INFO  [GUM] Node 4: Processing RequestLock 1:67
00000e8c.000009b4::2014/08/05-16:56:48.752 INFO  [GUM] Node 4: Processing GrantLock to 1 (sent by 3 gumid: 478994)
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  [IM] Changing the state of adapters according to result: <class mscs::InterfaceResult>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO    <up><vector len='2'>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO      <item>8a90dc45-ed7e-4f26-90c0-f36becfa3e0a</item>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO      <item>073b1ff1-b43a-4a7b-9875-e0d6b8ac0b83</item>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </vector>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </up>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO    <down><vector len='1'>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO      <item>fe516577-6c30-4b96-a6b0-38adb0ccee3e</item>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </vector>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </down>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO    <unreachable><vector len='0'>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </vector>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </unreachable>
00000e8c.00000fd4::2014/08/05-16:56:48.754 INFO  </class mscs::InterfaceResult>
0000143c.00000e1c::2014/08/05-16:56:48.755 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: WorkerThread: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed. Failing resource.
0000143c.00000e1c::2014/08/05-16:56:48.755 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL54)>: WorkerThread: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed. Failing resource.
0000143c.00000e1c::2014/08/05-16:56:48.755 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: WorkerThread: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed. Failing resource.
0000143c.00000e1c::2014/08/05-16:56:48.756 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: WorkerThread: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed. Failing resource.
0000143c.000068a0::2014/08/05-16:56:48.940 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: IP Interface 3811C70A (address 10.199.17.56) failed LooksAlive check, status 1117.
0000143c.000068a0::2014/08/05-16:56:48.940 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: IP Interface 3811C70A (address 10.199.17.56) failed IsAlive check, status 1117.
0000143c.000068a0::2014/08/05-16:56:48.940 WARN  [RHS] Resource SQL IP Address 1 (ENTSQL56) IsAlive has indicated failure.
00000e8c.000076c4::2014/08/05-16:56:48.940 INFO  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SQL IP Address 1 (ENTSQL56)', gen(0) result 1.
00000e8c.000076c4::2014/08/05-16:56:48.940 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) Online-->ProcessingFailure.
00000e8c.000076c4::2014/08/05-16:56:48.941 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL56))
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] resource SQL IP Address 1 (ENTSQL56): failure count: 1, restartAction: 2.
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] Will restart resource in 500 milliseconds.
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) ProcessingFailure-->[Terminating to DelayRestartingResource].
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] rcm::RcmGroup::ProcessStateChange: (ENTSQL56, Online --> Pending)
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] TransitionToState(SQL Network Name (ENTSQL56)) Online-->[Terminating to OnlineCallIssued].
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] TransitionToState(SQL Server (ENT56)) Online-->[Terminating to OnlineCallIssued].
00000e8c.000076c4::2014/08/05-16:56:48.941 INFO  [RCM] TransitionToState(SQL Server Agent (ENT56)) Online-->[Terminating to OnlineCallIssued].
0000143c.000068a0::2014/08/05-16:56:48.942 INFO  [RES] Network Name <SQL Network Name (ENTSQL56)>: Terminating resource...
0000143c.000068a0::2014/08/05-16:56:48.942 INFO  [RES] Network Name <SQL Network Name (ENTSQL56)>: Offline of resource continuing...
0000143c.00005bb8::2014/08/05-16:56:48.942 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Terminating resource...
0000143c.00005bb8::2014/08/05-16:56:48.943 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Deleting IP interface 3811C70A.
00000e8c.00005d78::2014/08/05-16:56:48.944 INFO  [GUM] Node 4: Processing RequestLock 4:45500
00000e8c.000076c4::2014/08/05-16:56:48.945 DBG   [NETFTAPI] received NsiDeleteInstance  for 10.199.17.56
00000e8c.000011f0::2014/08/05-16:56:48.945 INFO  [GUM] Node 4: Processing GrantLock to 4 (sent by 1 gumid: 478995)
00000e8c.000076c4::2014/08/05-16:56:48.946 WARN  [NETFTAPI] Failed to query parameters for 10.199.17.56 (status 80070490)
00000e8c.000076c4::2014/08/05-16:56:48.946 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.56
00000e8c.000076c4::2014/08/05-16:56:48.946 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.56
00000e8c.000076c4::2014/08/05-16:56:48.946 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.56
0000143c.00005bb8::2014/08/05-16:56:49.085 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Address 10.199.17.56 on adapter 10.199.17 Corp Team offline.
00000e8c.000056ac::2014/08/05-16:56:49.085 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) [Terminating to DelayRestartingResource]-->DelayRestartingResource.
0000143c.00007414::2014/08/05-16:56:49.095 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544006f
0000143c.00007414::2014/08/05-16:56:49.107 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type GPT, guid {88f87390-fc69-43d2-991a-4b128bdfeaf6}
0000143c.00007414::2014/08/05-16:56:49.118 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type GPT, guid {8293a54e-8426-4a7f-be9f-343327bebf28}
0000143c.00007414::2014/08/05-16:56:49.129 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544001c
0000143c.00007414::2014/08/05-16:56:49.282 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: IP Interface 3A11C70A (address 10.199.17.58) failed LooksAlive check, status 1117.
0000143c.00007414::2014/08/05-16:56:49.282 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: IP Interface 3A11C70A (address 10.199.17.58) failed IsAlive check, status 1117.
0000143c.00007414::2014/08/05-16:56:49.282 WARN  [RHS] Resource SQL IP Address 1 (ENTSQL58) IsAlive has indicated failure.
00000e8c.000076c4::2014/08/05-16:56:49.282 INFO  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SQL IP Address 1 (ENTSQL58)', gen(0) result 1.
00000e8c.000076c4::2014/08/05-16:56:49.282 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) Online-->ProcessingFailure.
00000e8c.0000566c::2014/08/05-16:56:49.282 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL58))
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] resource SQL IP Address 1 (ENTSQL58): failure count: 1, restartAction: 2.
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] Will restart resource in 500 milliseconds.
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) ProcessingFailure-->[Terminating to DelayRestartingResource].
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] rcm::RcmGroup::ProcessStateChange: (ENTSQL58, Online --> Pending)
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] TransitionToState(SQL Network Name (ENTSQL58)) Online-->[Terminating to OnlineCallIssued].
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] TransitionToState(SQL Server (ENT58)) Online-->[Terminating to OnlineCallIssued].
00000e8c.0000566c::2014/08/05-16:56:49.283 INFO  [RCM] TransitionToState(SQL Server Agent (ENT58)) Online-->[Terminating to OnlineCallIssued].
0000143c.00007414::2014/08/05-16:56:49.284 INFO  [RES] Network Name <SQL Network Name (ENTSQL58)>: Terminating resource...
0000143c.00007414::2014/08/05-16:56:49.284 INFO  [RES] Network Name <SQL Network Name (ENTSQL58)>: Offline of resource continuing...
0000143c.00005bb8::2014/08/05-16:56:49.284 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Terminating resource...
0000143c.00005bb8::2014/08/05-16:56:49.284 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Deleting IP interface 3A11C70A.
00000e8c.0000566c::2014/08/05-16:56:49.285 DBG   [NETFTAPI] received NsiDeleteInstance  for 10.199.17.58
00000e8c.0000566c::2014/08/05-16:56:49.286 WARN  [NETFTAPI] Failed to query parameters for 10.199.17.58 (status 80070490)
00000e8c.0000566c::2014/08/05-16:56:49.286 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.58
00000e8c.0000566c::2014/08/05-16:56:49.286 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.58
00000e8c.0000566c::2014/08/05-16:56:49.286 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.58
0000143c.00005bb8::2014/08/05-16:56:49.291 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Address 10.199.17.58 on adapter 10.199.17 Corp Team offline.
00000e8c.0000566c::2014/08/05-16:56:49.292 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) [Terminating to DelayRestartingResource]-->DelayRestartingResource.
0000143c.00006378::2014/08/05-16:56:49.304 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type GPT, guid {20586068-4927-4492-9403-e826e0d60ad9}
0000143c.00006378::2014/08/05-16:56:49.320 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544006c
00001518.000025e4::2014/08/05-16:56:49.325 INFO  [RES] SQL Server <SQL Server (ENT56)>: [sqsrvres] OnlineThread: asked to terminate while waiting for QP.
0000143c.00006378::2014/08/05-16:56:49.339 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544006d
0000143c.00006378::2014/08/05-16:56:49.350 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x54400b1
0000143c.00006378::2014/08/05-16:56:49.361 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544001f
0000143c.00006378::2014/08/05-16:56:49.454 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: IP Interface 3411C70A (address 10.199.17.52) failed LooksAlive check, status 1117.
0000143c.00006378::2014/08/05-16:56:49.454 WARN  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: IP Interface 3411C70A (address 10.199.17.52) failed IsAlive check, status 1117.
0000143c.00006378::2014/08/05-16:56:49.454 WARN  [RHS] Resource SQL IP Address 1 (ENTSQL52) IsAlive has indicated failure.
00000e8c.000056ac::2014/08/05-16:56:49.454 INFO  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'SQL IP Address 1 (ENTSQL52)', gen(0) result 1.
00000e8c.000056ac::2014/08/05-16:56:49.454 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) Online-->ProcessingFailure.
00000e8c.000076c4::2014/08/05-16:56:49.454 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL52))
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] resource SQL IP Address 1 (ENTSQL52): failure count: 1, restartAction: 2.
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] Will restart resource in 500 milliseconds.
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) ProcessingFailure-->[Terminating to DelayRestartingResource].
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] rcm::RcmGroup::ProcessStateChange: (ENTSQL52, Online --> Pending)
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] TransitionToState(SQL Network Name (ENTSQL52)) Online-->[Terminating to OnlineCallIssued].
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] TransitionToState(SQL Server (ENT52)) Online-->[Terminating to OnlineCallIssued].
00000e8c.000076c4::2014/08/05-16:56:49.455 INFO  [RCM] TransitionToState(SQL Server Agent (ENT52)) Online-->[Terminating to OnlineCallIssued].
0000143c.00006378::2014/08/05-16:56:49.456 INFO  [RES] Network Name <SQL Network Name (ENTSQL52)>: Terminating resource...
0000143c.00006378::2014/08/05-16:56:49.456 INFO  [RES] Network Name <SQL Network Name (ENTSQL52)>: Offline of resource continuing...
0000143c.00005bb8::2014/08/05-16:56:49.456 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Terminating resource...
0000143c.00005bb8::2014/08/05-16:56:49.456 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Deleting IP interface 3411C70A.
00000e8c.000076c4::2014/08/05-16:56:49.458 DBG   [NETFTAPI] received NsiDeleteInstance  for 10.199.17.52
00000e8c.000076c4::2014/08/05-16:56:49.458 WARN  [NETFTAPI] Failed to query parameters for 10.199.17.52 (status 80070490)
00000e8c.000076c4::2014/08/05-16:56:49.458 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.52
00000e8c.000076c4::2014/08/05-16:56:49.458 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.52
00000e8c.000076c4::2014/08/05-16:56:49.458 DBG   [NETFTAPI] Signaled NetftLocalRemove  event for 10.199.17.52
0000143c.00005bb8::2014/08/05-16:56:49.461 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Address 10.199.17.52 on adapter 10.199.17 Corp Team offline.
00000e8c.000076c4::2014/08/05-16:56:49.461 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) [Terminating to DelayRestartingResource]-->DelayRestartingResource.
0000143c.00007108::2014/08/05-16:56:49.473 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x5440082
0000143c.00007108::2014/08/05-16:56:49.488 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x544007c
00001728.000008b8::2014/08/05-16:56:49.502 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x5440086
0000143c.00007108::2014/08/05-16:56:49.514 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x54400ba
0000143c.00007108::2014/08/05-16:56:49.526 INFO  [RES] Physical Disk: HardDiskpGetDiskInfo: Disk is of type MBR, signature 0x54400bb
00000e8c.0000566c::2014/08/05-16:56:49.585 INFO  [RCM] Delay-restarting SQL IP Address 1 (ENTSQL56) and any waiting dependents.
00000e8c.0000566c::2014/08/05-16:56:49.585 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) DelayRestartingResource-->OnlineCallIssued.
0000143c.00007108::2014/08/05-16:56:49.585 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Bringing resource online...
00000e8c.000076c4::2014/08/05-16:56:49.586 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL56)', gen(1) result 997.
00000e8c.000076c4::2014/08/05-16:56:49.586 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) OnlineCallIssued-->OnlinePending.
0000143c.00003878::2014/08/05-16:56:49.586 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Online thread running.
0000143c.00003878::2014/08/05-16:56:49.589 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3811C70A^0011A8C0, role 1.
0000143c.00003878::2014/08/05-16:56:49.590 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3811C70A^00BDD60A, role 0.
0000143c.00003878::2014/08/05-16:56:49.590 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3811C70A^0011C70A, role 3.
0000143c.00003878::2014/08/05-16:56:49.596 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Online: Opened object handle for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00003878::2014/08/05-16:56:49.597 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Online: Registered notification for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00003878::2014/08/05-16:56:49.597 ERR   [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed.
0000143c.00003878::2014/08/05-16:56:49.597 ERR   [RHS] Online for resource SQL IP Address 1 (ENTSQL56) failed.
00000e8c.0000566c::2014/08/05-16:56:49.597 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL56)', gen(1) result 5018.
00000e8c.0000566c::2014/08/05-16:56:49.597 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) OnlinePending-->ProcessingFailure.
00000e8c.000056ac::2014/08/05-16:56:49.597 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL56))
00000e8c.000056ac::2014/08/05-16:56:49.597 INFO  [RCM] resource SQL IP Address 1 (ENTSQL56): failure count: 2, restartAction: 2.
00000e8c.000056ac::2014/08/05-16:56:49.598 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) ProcessingFailure-->[Terminating to Failed].
00000e8c.000056ac::2014/08/05-16:56:49.598 INFO  [RCM] Resource SQL IP Address 1 (ENTSQL56) is causing group ENTSQL56 to failover.  Posting worker thread.
0000143c.00007108::2014/08/05-16:56:49.598 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Terminating resource...
0000143c.00007108::2014/08/05-16:56:49.598 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL56)>: Resource is already offline.
00000e8c.000056ac::2014/08/05-16:56:49.598 INFO  [RCM] rcm::RcmGroup::Failover: (ENTSQL56)
00000e8c.00007480::2014/08/05-16:56:49.598 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL56)) [Terminating to Failed]-->Failed.
00001608.00002578::2014/08/05-16:56:49.614 INFO  [RES] SQL Server <SQL Server (ENT58)>: [sqsrvres] OnlineThread: asked to terminate while waiting for QP.
00000e8c.000076c4::2014/08/05-16:56:49.792 INFO  [RCM] Delay-restarting SQL IP Address 1 (ENTSQL58) and any waiting dependents.
00000e8c.000076c4::2014/08/05-16:56:49.792 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) DelayRestartingResource-->OnlineCallIssued.
0000143c.00007108::2014/08/05-16:56:49.792 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Bringing resource online...
00000e8c.00002b98::2014/08/05-16:56:49.793 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL58)', gen(1) result 997.
00000e8c.00002b98::2014/08/05-16:56:49.793 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) OnlineCallIssued-->OnlinePending.
0000143c.00006f80::2014/08/05-16:56:49.793 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Online thread running.
0000143c.00006f80::2014/08/05-16:56:49.796 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3A11C70A^0011A8C0, role 1.
0000143c.00006f80::2014/08/05-16:56:49.797 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3A11C70A^00BDD60A, role 0.
0000143c.00006f80::2014/08/05-16:56:49.798 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3A11C70A^0011C70A, role 3.
0000143c.00006f80::2014/08/05-16:56:49.804 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Online: Opened object handle for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00006f80::2014/08/05-16:56:49.804 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Online: Registered notification for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00006f80::2014/08/05-16:56:49.804 ERR   [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed.
0000143c.00006f80::2014/08/05-16:56:49.804 ERR   [RHS] Online for resource SQL IP Address 1 (ENTSQL58) failed.
00000e8c.000076c4::2014/08/05-16:56:49.804 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL58)', gen(1) result 5018.
00000e8c.000076c4::2014/08/05-16:56:49.804 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) OnlinePending-->ProcessingFailure.
00000e8c.0000660c::2014/08/05-16:56:49.805 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL58))
00000e8c.0000660c::2014/08/05-16:56:49.805 INFO  [RCM] resource SQL IP Address 1 (ENTSQL58): failure count: 2, restartAction: 2.
00000e8c.0000660c::2014/08/05-16:56:49.805 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) ProcessingFailure-->[Terminating to Failed].
00000e8c.0000660c::2014/08/05-16:56:49.806 INFO  [RCM] Resource SQL IP Address 1 (ENTSQL58) is causing group ENTSQL58 to failover.  Posting worker thread.
0000143c.00007108::2014/08/05-16:56:49.806 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Terminating resource...
0000143c.00007108::2014/08/05-16:56:49.806 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL58)>: Resource is already offline.
00000e8c.0000660c::2014/08/05-16:56:49.806 INFO  [RCM] rcm::RcmGroup::Failover: (ENTSQL58)
00000e8c.00007480::2014/08/05-16:56:49.806 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL58)) [Terminating to Failed]-->Failed.
00000e8c.00007480::2014/08/05-16:56:49.961 INFO  [RCM] Delay-restarting SQL IP Address 1 (ENTSQL52) and any waiting dependents.
00000e8c.00007480::2014/08/05-16:56:49.961 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) DelayRestartingResource-->OnlineCallIssued.
0000143c.00007108::2014/08/05-16:56:49.961 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Bringing resource online...
00000e8c.00003170::2014/08/05-16:56:49.962 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL52)', gen(1) result 997.
00000e8c.00003170::2014/08/05-16:56:49.962 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) OnlineCallIssued-->OnlinePending.
0000143c.00003720::2014/08/05-16:56:49.962 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Online thread running.
0000143c.00003720::2014/08/05-16:56:49.965 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3411C70A^0011A8C0, role 1.
0000143c.00003720::2014/08/05-16:56:49.966 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3411C70A^00BDD60A, role 0.
0000143c.00003720::2014/08/05-16:56:49.967 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Checking for network match: network masks 00FFFFFF=00FFFFFF and addresses 3411C70A^0011C70A, role 3.
0000143c.00003720::2014/08/05-16:56:49.973 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Online: Opened object handle for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00003720::2014/08/05-16:56:49.973 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Online: Registered notification for netinterface fe516577-6c30-4b96-a6b0-38adb0ccee3e.
0000143c.00003720::2014/08/05-16:56:49.973 ERR   [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: NetInterface fe516577-6c30-4b96-a6b0-38adb0ccee3e has failed.
0000143c.00003720::2014/08/05-16:56:49.973 ERR   [RHS] Online for resource SQL IP Address 1 (ENTSQL52) failed.
00000e8c.00007480::2014/08/05-16:56:49.973 INFO  [RCM] HandleMonitorReply: ONLINERESOURCE for 'SQL IP Address 1 (ENTSQL52)', gen(1) result 5018.
00000e8c.00007480::2014/08/05-16:56:49.973 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) OnlinePending-->ProcessingFailure.
00000e8c.00003098::2014/08/05-16:56:49.973 ERR   [RCM] rcm::RcmResource::HandleFailure: (SQL IP Address 1 (ENTSQL52))
00000e8c.00003098::2014/08/05-16:56:49.974 INFO  [RCM] resource SQL IP Address 1 (ENTSQL52): failure count: 2, restartAction: 2.
00000e8c.00003098::2014/08/05-16:56:49.974 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) ProcessingFailure-->[Terminating to Failed].
00000e8c.00003098::2014/08/05-16:56:49.974 INFO  [RCM] Resource SQL IP Address 1 (ENTSQL52) is causing group ENTSQL52 to failover.  Posting worker thread.
0000143c.00007108::2014/08/05-16:56:49.974 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Terminating resource...
0000143c.00007108::2014/08/05-16:56:49.974 INFO  [RES] IP Address <SQL IP Address 1 (ENTSQL52)>: Resource is already offline.
00000e8c.0000566c::2014/08/05-16:56:49.974 INFO  [RCM] rcm::RcmGroup::Failover: (ENTSQL52)
00000e8c.00003170::2014/08/05-16:56:49.975 INFO  [RCM] TransitionToState(SQL IP Address 1 (ENTSQL52)) [Terminating to Failed]-->Failed.
000017a0.00000c50::2014/08/05-16:56:50.352 INFO  [RES] SQL Server <SQL Server (ENT52)>: [sqsrvres] OnlineThread: asked to terminate while waiting for QP.

This is just a cut out of the logs but if you need more info just let me know. 

Thanks for any help!

How to configure current SQL high availability cluster using mirroring with dedicated replication NICS?

$
0
0
We have a current HA cluster at center1 which is mirrored to another HA cluster in center2.   We have several instances already installed and working which are using one NIC for data and replication.  We want to prevent mirror failovers by configuring a NIC on a replication network which has no DNS server.   What are the steps to configure the current SQL instances to use this dedicated NIC for mirror replication? 

Proper steps to fail over to another host in a cluster

$
0
0

Hello,

Pardon my ignorance.  What is the proper steps to force a fail over to the standby host in a cluster with two nodes?

My secondary host is the currently the active host for custer name. I would like to force it to fail to the primary, which is acting as a standby.  Thank you in advance.

Windows Server 2012 R2 Failover Cluster

$
0
0

Hello!

Is it supported configuration to separate File Witness Server and Windows 2012 R2 Cluster nodes in Dynamic Quorum by NAT/PAT/Firewall in internal network?

Thanks in advance.

Host name changes in Failover clustering

$
0
0

Hello all,

We have following setup,

2 servers of windows server 2012 standard...they are in failover clustering

SQL server 2012 is also there.

The customer has asked us to change hostname of servers.so we changed the hostname of servers and now we r unable to see disk(SAN),unable to connect CLUSTER ,unable to add  theses nodes in cluster and as no disk in computer management no SQL server database.

Please suggest what to do??

HOw to change hostname of servers without affecting failover clustering.


How to perform disk validation against a Single/Selected LUNs in Windows clustering ??

$
0
0

Hello All,

I have a scenario here where I should need to perform a disk validation(only disk, not other tests)  against newly added LUNs in windows cluster(which are in available storage) without taking existing LUN or SQL services Offline. When I check in failover cluster manager, I don't see any options like running validation check against a single LUN. Can someone please guide on this. Appreciate your time and help on this. Didn't get accurate information on googling.

Environment : Windows Server 2008R2 SP1

Thank you.


CAU: The plug-in argument HotfixRootFolderPath has invalid value

$
0
0

I am trying to apply a hotfix using CAU and have configured the self-updating options.  When I come to preview the update for the cluster and select Microsoft.Hotfixplugin as the plugin, I get the following error:

I am unsure why this is being generate or how to correct the problem.

Any ideas?

Unable to bring cluster group online

$
0
0

Hi,

I have a Windows 2008 2 node cluster.

We have SQL on node 1.  The cluster group is on node 2 right now.

Over the weekend, the cluster group failed and will not come back online. We were unable to connect via cluster manager through the name.  Able to connect on the server, but only see cluster events and cluster status 'down'.  The cluster service on this node is on.  Here is cluster GROUP output:

Cluster IP Address - Offline

Cluster Name - Offline

Quorum - Failed

When attempting to online the group we receive:

System error 5908 has occurred (0x00001714).
The group is unable to accept the request since it is moving to another node.

The cluster log contains many instances of this error:

00000b30.00001b7c::2014/08/18-16:31:07.502 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

The Q: drive is active on node 2 and I am able to browse and write files to it.

Here is the last 10 minutes of cluster log.  I tried to online the group during that time.  Thanks for your review.

-------------------------------------------------------------------------------------------------------------

00000b30.00002728::2014/08/18-16:24:59.737 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.00002728::2014/08/18-16:24:59.737 WARN  [RCM] ResourceControl(GET_COMMON_PROPERTIES) to Quorum returned 5910.
00000b30.0000138c::2014/08/18-16:25:17.883 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.0000138c::2014/08/18-16:25:17.883 WARN  [RCM] ResourceControl(STORAGE_GET_DISK_INFO_EX) to Quorum returned 5910.
000012c4.000018c0::2014/08/18-16:26:07.689 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.000022f8::2014/08/18-16:26:07.689 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqclus.dll via Wow64.
00001330.00001968::2014/08/18-16:26:07.689 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.000022f8::2014/08/18-16:26:07.689 WARN  [RCM] Failed to load restype MSMQ: error 126.
000012c4.000018c0::2014/08/18-16:26:07.689 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.000022f8::2014/08/18-16:26:07.689 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqtgclus.dll via Wow64.
00001330.00001968::2014/08/18-16:26:07.689 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.000022f8::2014/08/18-16:26:07.689 WARN  [RCM] Failed to load restype MSMQTriggers: error 126.
00000b30.00000f74::2014/08/18-16:26:25.305 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.00000f74::2014/08/18-16:26:25.305 WARN  [RCM] ResourceControl(GET_CLASS_INFO) to Quorum returned 5910.
00000b30.000006dc::2014/08/18-16:26:41.844 INFO  [RCM] rcm::RcmApi::OnlineGroup: (Cluster Group)
00000b30.000006dc::2014/08/18-16:26:41.844 INFO  [GUM] Node 2: Processing RequestLock 2:207
00000b30.00002618::2014/08/18-16:26:41.922 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285421)
00000b30.000006dc::2014/08/18-16:26:41.922 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:42.031 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:43.046 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:43.046 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:44.060 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:44.060 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:45.074 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:45.074 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:46.088 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:46.088 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:47.102 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:47.102 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:48.117 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:48.132 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:49.146 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:49.146 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:50.161 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:50.176 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:26:51.190 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:26:51.206 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:26:54.295 INFO  [GUM] Node 2: Processing RequestLock 1:22512
00000b30.00002618::2014/08/18-16:26:54.295 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285431)
00000b30.000006dc::2014/08/18-16:27:01.223 INFO  [GUM] Node 2: Processing RequestLock 2:217
00000b30.00002618::2014/08/18-16:27:01.223 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285432)
00000b30.000006dc::2014/08/18-16:27:01.223 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:01.223 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:27:11.241 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:11.241 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:27:21.258 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:21.289 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:27:23.614 INFO  [GUM] Node 2: Processing RequestLock 1:22513
00000b30.00002618::2014/08/18-16:27:23.614 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285435)
00000b30.000006dc::2014/08/18-16:27:31.306 INFO  [GUM] Node 2: Processing RequestLock 2:220
00000b30.00002618::2014/08/18-16:27:31.306 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285436)
00000b30.000006dc::2014/08/18-16:27:31.306 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:31.306 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:27:41.323 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:41.355 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:27:51.372 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:27:51.372 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:27:54.305 INFO  [GUM] Node 2: Processing RequestLock 1:22514
00000b30.00002618::2014/08/18-16:27:54.305 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285439)
00000b30.000006dc::2014/08/18-16:28:01.389 INFO  [GUM] Node 2: Processing RequestLock 2:223
00000b30.00002618::2014/08/18-16:28:01.389 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285440)
00000b30.000006dc::2014/08/18-16:28:01.389 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:01.389 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:28:11.406 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:11.422 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:28:21.439 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:21.455 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:28:23.624 INFO  [GUM] Node 2: Processing RequestLock 1:22515
00000b30.00002618::2014/08/18-16:28:23.624 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285443)
00000b30.000006dc::2014/08/18-16:28:31.472 INFO  [GUM] Node 2: Processing RequestLock 2:226
00000b30.00002618::2014/08/18-16:28:31.472 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285444)
00000b30.000006dc::2014/08/18-16:28:31.472 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:31.488 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:28:41.504 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:41.504 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:28:51.520 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:28:51.520 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:28:54.328 INFO  [GUM] Node 2: Processing RequestLock 1:22516
00000b30.00002618::2014/08/18-16:28:54.328 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285447)
00000b30.000006dc::2014/08/18-16:29:01.536 INFO  [GUM] Node 2: Processing RequestLock 2:229
00000b30.00002618::2014/08/18-16:29:01.536 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285448)
00000b30.000006dc::2014/08/18-16:29:01.536 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:01.552 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:29:11.568 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:11.568 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:29:21.584 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:21.600 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:29:23.643 INFO  [GUM] Node 2: Processing RequestLock 1:22517
00000b30.00002618::2014/08/18-16:29:23.643 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285451)
00000b30.000006dc::2014/08/18-16:29:31.616 INFO  [GUM] Node 2: Processing RequestLock 2:232
00000b30.00002618::2014/08/18-16:29:31.616 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285452)
00000b30.000006dc::2014/08/18-16:29:31.616 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:31.616 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:29:41.632 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:41.632 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:29:51.648 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:29:51.695 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:29:54.331 INFO  [GUM] Node 2: Processing RequestLock 1:22518
00000b30.00002618::2014/08/18-16:29:54.331 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285455)
00000b30.000006dc::2014/08/18-16:30:01.711 INFO  [GUM] Node 2: Processing RequestLock 2:235
00000b30.00002618::2014/08/18-16:30:01.711 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285456)
00000b30.000006dc::2014/08/18-16:30:01.711 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:01.711 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:30:11.727 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:11.742 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.0000138c::2014/08/18-16:30:17.889 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.0000138c::2014/08/18-16:30:17.889 WARN  [RCM] ResourceControl(GET_COMMON_PROPERTIES) to Quorum returned 5910.
00000b30.00001fdc::2014/08/18-16:30:17.905 INFO  [NM] Received request from client address 10.12.13.8.
00000b30.00000f74::2014/08/18-16:30:17.905 INFO  [NM] Received request from client address 10.12.13.8.
00000b30.0000115c::2014/08/18-16:30:17.952 INFO  [NM] Received request from client address 10.12.13.8.
000012c4.000018c0::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.00000f74::2014/08/18-16:30:18.326 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqclus.dll via Wow64.
00001330.00000a10::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.00000f74::2014/08/18-16:30:18.326 WARN  [RCM] Failed to load restype MSMQ: error 126.
00000b30.00000f74::2014/08/18-16:30:18.326 WARN  [RCM] rcm::RcmApi::ResTypeControl: ResType MSMQ's DLL is not present on this node.  Attempting to find a good node...
000012c4.000018c0::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.00002120::2014/08/18-16:30:18.326 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqclus.dll via Wow64.
00001330.00000a10::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqclus.dll
00000b30.00002120::2014/08/18-16:30:18.326 WARN  [RCM] Failed to load restype MSMQ: error 126.
000012c4.000018c0::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.00000f74::2014/08/18-16:30:18.326 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqtgclus.dll via Wow64.
00001330.00000a10::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.00000f74::2014/08/18-16:30:18.326 WARN  [RCM] Failed to load restype MSMQTriggers: error 126.
00000b30.00000f74::2014/08/18-16:30:18.326 WARN  [RCM] rcm::RcmApi::ResTypeControl: ResType MSMQTriggers's DLL is not present on this node.  Attempting to find a good node...
000012c4.000018c0::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.00002120::2014/08/18-16:30:18.326 INFO  [RCM] rcm::RcmResType::LoadDll: Got error 126; will attempt to load mqtgclus.dll via Wow64.
00001330.00000a10::2014/08/18-16:30:18.326 WARN  [RHS] ERROR_MOD_NOT_FOUND(126), unable to load resource DLL mqtgclus.dll
00000b30.00002120::2014/08/18-16:30:18.326 WARN  [RCM] Failed to load restype MSMQTriggers: error 126.
00000b30.000006dc::2014/08/18-16:30:21.759 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:21.759 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:30:23.646 INFO  [GUM] Node 2: Processing RequestLock 1:22519
00000b30.00002618::2014/08/18-16:30:23.646 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285459)
00000b30.000006dc::2014/08/18-16:30:31.775 INFO  [GUM] Node 2: Processing RequestLock 2:238
00000b30.00002618::2014/08/18-16:30:31.775 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285460)
00000b30.000006dc::2014/08/18-16:30:31.775 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:31.775 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000021dc::2014/08/18-16:30:33.974 INFO  [NM] Received request from client address 10.12.13.8.
00000b30.000006dc::2014/08/18-16:30:41.791 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:41.822 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:30:51.838 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:30:51.838 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:30:54.334 INFO  [GUM] Node 2: Processing RequestLock 1:22520
00000b30.00002618::2014/08/18-16:30:54.334 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285463)
00000b30.000006dc::2014/08/18-16:31:01.854 INFO  [GUM] Node 2: Processing RequestLock 2:241
00000b30.00002618::2014/08/18-16:31:01.854 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285464)
00000b30.000006dc::2014/08/18-16:31:01.854 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:31:01.870 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00001b7c::2014/08/18-16:31:07.502 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.00001b7c::2014/08/18-16:31:07.502 WARN  [RCM] ResourceControl(STORAGE_GET_DISK_INFO_EX) to Quorum returned 5910.
00000b30.000019f0::2014/08/18-16:31:07.861 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.000019f0::2014/08/18-16:31:07.861 WARN  [RCM] ResourceControl(GET_COMMON_PROPERTIES) to Quorum returned 5910.
00000b30.000006dc::2014/08/18-16:31:11.886 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:31:11.886 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000006dc::2014/08/18-16:31:21.902 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:31:21.902 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.00002618::2014/08/18-16:31:23.665 INFO  [GUM] Node 2: Processing RequestLock 1:22521
00000b30.00002618::2014/08/18-16:31:23.665 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285467)
00000b30.000006dc::2014/08/18-16:31:31.918 INFO  [GUM] Node 2: Processing RequestLock 2:244
00000b30.00002618::2014/08/18-16:31:31.918 INFO  [GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 285468)
00000b30.000006dc::2014/08/18-16:31:31.918 INFO  [RCM] rcm::RcmGum::SetGroupPersistentState(Cluster Group,1)
00000b30.000006dc::2014/08/18-16:31:31.949 WARN  [RCM] rcm::RcmApi::OnlineGroup: retrying: Cluster Group, 5908.
00000b30.000022f8::2014/08/18-16:31:46.427 ERR   [RCM] rcm::RcmResControl::DoResourceControl: ERROR_RESOURCE_CALL_TIMED_OUT(5910)' because of 'Failed to wait for pending resource control call to Quorum.

'
00000b30.000022f8::2014/08/18-16:31:46.427 WARN  [RCM] ResourceControl(STORAGE_GET_DISK_INFO_EX) to Quorum returned 5910.
00000b30.00002618::2014/08/18-16:31:54.337 INFO  [GUM] Node 2: Processing RequestLock 1:22522
00000b30.00002618::2014/08/18-16:31:54.337 INFO  [GUM] Node 2: Processing GrantLock to 1 (sent by 2 gumid: 285469)


How to schedule Cluster logs to be generated for Microsoft Failover Clusters 2008 R2

$
0
0

Hi, As per my understanding we always have to generate Cluster logs manually on the cluster nodes to get these generated.

Is there any way we can schedule Cluster Logs to be generated every time so that it would be easy for us to analyze the issue?

Kevin


Hyper-V Fail-over cluster problem - "stuck" VM

$
0
0

I'm working on testing a Hyper-V 2-node cluster for my employer, and have run into a problem.  I suspect the problem stems from a reboot after installing some updates on the nodes and storage (which has led me to create a more stringent update policy for the cluster.)

The problem is that since the cluster has come back up, I had a VM that was "stuck" in a Pending state.  So I began hammering at it trying to "unstick" it, finally going into the Dependencies tab for the VM, and removing the dependency for the Virtual Machine Configuration resource.  This has led to my current problem.

Now, in the Services and applications section of the Cluster Manager, I have the following:

In the left-hand pane, I have the old VM still listed, with a red X over it.  Clicking it displays <unavailable> for all the items in the Summary, and "Failed to retrieve the resources in this service or application."  Attempting to delete this VM gives "Could not delete Services and Applications {VM Name}" and the details shows "The object has been deleted from the cluster."

If anyone could help me with the following, it would be appreciated:

  1. How do I get rid of the "Failed to load the item" and the "Single Client SQL Server" items?
  2. In the future when we install updates (manually, I'm not turning on auto-updating in any of the cluster devices) do weproperly reboot?

I'm thinking the answer to #2 might be to first, migrate all the VMs to one host, update the now unused host and reboot, then migrate all the VMs back to the now updated host and repeat.  Then when updating the storage system, take the CSV offline, then reboot the storage array.  Once it comes back up, online the array, then bring the VMs back online.

Thank you all!

Jason A.


Jason A.

CAU - Cluster Aware Updating Computer Object

$
0
0

Hello,

recently I installed 2 Windows Server 2012 R2 Failover Clusters.
I prestaged the CAU Computer Objects.

After configuring the Role on the first Cluster, I went to the second and specified the wrong computer object for the CAU Role.

Basically I gave the second Cluster the same computer object than the first.
This resulted in a failure, so I removed the CAU Role and installed it again with the correct Computer Object.

However, for some reason it is still referencing to the wrong computer object, therefore CAU cannot run.

Do you know how to clean the information in the cluster re. CAU?

Thanks,
Jens


jensit.wordpress.com

The lease timeout between avaiability group and the Windows Server Failover Cluster has expired

$
0
0

Hi,

I am having some issues where I get a lease timeout from time to time.  I have a Windows 2012 Failover Cluster with 2 nodes and 2 SQL 2012 Always-on Availability Groups.  Both nodes are a physical machines and each node is the primary for an AG. 

From what I understand ifthe HealhCheckTimeoutis exceeded without the signal exchange the lease is declared 'expired' and the SQL Server resource dll reports that the SQL Server availability group no longer 'looks alive' to the Windows cluster manager.  Here are the properties I have setup which are the default settings:

LeaseTimeout - 20000

HealthCheckTimeout - 30000

VerboseLoging - 0>

FailureConditionLevel – 3

Here are the events that occur in the Application Event Viewer:

Event ID 19407:

The lease between availability group 'AG_NAME' and the Windows Server Failover Cluster has expired. A connectivity issue occurred between the instance of SQL Server and the Windows Server Failover Cluster. To determine whether the availability group is failing over correctly, check the corresponding availability group resource in the Windows Server Failover Cluster.

Event ID 35285:

The recovery LSN (120881:37533:1) was identified for the database with ID 32. This is an informational message only. No user action is required.

SQl server logs are too long to post in this box but I can send them if you request.

The AG is setup to failover automatically but it did not failover.  I am trying to figure out why the lease timed out.  Thanks.

Server 2003 Cluster service will not start

$
0
0

I have a server 2003 2-node cluster that runs an Oracle Database. One of the nodes got an McAfee update and was rebooted. Node A will not start the cluster service after the update but Node B does not have any issues in the cluster.

The Cluster log shows issues with not able to access Node B showing Network adapter issues. When I open cluster admin all the network connections show an exclamation point. 

I have done the following:

Verified all dependent services have started

Reinstalled the network drivers and rebooted.

I have also ran net stat cluster service /fq

Thanks

Bill 


failover cluster - ISCSI shared disk E:/ moved to C/ClusterStorage/Volume1 is correct behaviour or not ?

$
0
0

hi ,

i just created 2 iscsi disk for quorum disk and shared disk.

1. The moment i created cluster the quorum was automatically choosen, is that correct behaviour. Cause i didt do it by myself ?

2. Also the ISCSI shared disk which was my [E: Drive] moved to [C/ClusterStorage/Volume1]  is it correct ?

Thanks

Sid


sid

Viewing all 5648 articles
Browse latest View live




Latest Images