Intermittent Witness Failures

  • Hi guys,

    Any one experienced in working in GCP with SQL Clusters? My File Share Witness intermittently failing but I cannot work would why. It will be in failed state and then I'll bring it online and its fine for a while before going off again.

    Below the the only error I can fine. I have multiple Clusters pointing at a couple of FSW clusters(SMB role in the cluster hosting the disk with Vip and VClient end point). I use internal load balancers to identify the active cluster node.

    Witness Client failed to connect to the witness server at IP address [10.xx.xxx.xx] with error (The RPC server is unavailable.). This event may be suppressed for this resource for the next 12 hours if the condition persists.

    Appricate the "RPC server is unavailable" might sound obvious but services are running and I cannot see anything else to check.

    Any ideas or tests would be great!

    Thanks

    V

  • Thanks for posting your issue and hopefully someone will answer soon.

    This is an automated bump to increase visibility of your question.

  • This was removed by the editor as SPAM

  • Have you tried changing the timeouts and restart values so it takes longer to fail and gets brought back on line automatically?

  • Thanks @CC-597066 I played with this but to be honest it wasnt sufficient to have failures occuring

     

    By way of update this was resolved. We identified an issue on the probe port for the internal load balancer. This was corrected by our Network engineers and the issue has abated

Viewing 5 posts - 1 through 4 (of 4 total)

You must be logged in to reply to this topic. Login to reply