July 26, 2019 at 12:29 am
During rebuild process we change our AG sync mode settings to manual failover and async mode on both primary and secondary using queries.
ALTER AVAILABILITY GROUP *group_name* MODIFY REPLICA ON '*server_name*' WITH ( { AVAILABILITY_MODE = { SYNCHRONOUS_COMMIT | ASYNCHRONOUS_COMMIT } | FAILOVER_MODE = { AUTOMATIC | MANUAL } } )
But for some reason the above queries took almost 25 minutes to update the below AG to manual and async mode on both replicas yesterday. During this time we observed the function to check primary replica for jobs also took long(close to 10 mins) for few jobs which were scheduled to run around that time. After 25 minutes, we saw a sudden spike in worker threads which in turn caused the lease failover and cluster offline. Not much info in logs, extended events or cluster events to help why it took long.
Appreciate if any one can guide me to the right direction.
July 26, 2019 at 7:40 pm
lifting the thread.
Viewing 2 posts - 1 through 1 (of 1 total)
You must be logged in to reply to this topic. Login to reply