SQL Server FC 2016 on Windows 2016 - Random Cluster Failure

  • Hi,

    For about a week my cluster is failing. It starts with event log:

    svchost (1712) SoftwareUsageMetrics-Svc: A request to write to the file "C:\Windows\system32\LogFiles\Sum\Svc.log" at offset 1843200 (0x00000000001c2000) for 4096 (0x00001000) bytes has not completed for 36 second(s). This problem is likely due to faulty hardware. Please contact your hardware vendor for further assistance diagnosing the problem.

    and immediately:

    Fault bucket , type 0

    Event Name: FSC_watchdog_timeout_all_cluster_nodes

    Response: Not available

    Cab Id: 0

    Problem signature:

    P1: GumLockIsStuck

    After that is down the drain. One of the nodes freezes, SQL instances/roles are green but inaccessible. The server isn't releasing them unless, I just turn it off. This is when the roles migrate. It is not hardware as it can happen on any of the three nodes.

    I have seen similar problem on various searches but none looks to have a solution.

    Did anyone experience such a thing? How can it be fixed?

    Any hints are highly appreciated,

     

    dg

     

  • This was removed by the editor as SPAM

  • Thanks for posting your issue and hopefully someone will answer soon.

    This is an automated bump to increase visibility of your question.

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply