November 23, 2021 at 4:45 pm
Hi All,
So I want to test failover to my DR site. My current set up is 2 x replica in site A (synchronous), 1 in site B (async). It's a busy live server. How would you do this?
Options I see:
Open to any other ideas?
November 23, 2021 at 6:16 pm
Do the clients fully understand the risk of lost data and uptime in testing on the live system? Would a test on duplicate environments suffice. Variables (IP addresses, server names, permissions if you aren't careful, maybe performance if specs aren't identical) differ in a duplicated environment, so it's not a perfect test of the live system, but the client needs to be willing to take the risk.
What are the client's RPO (recovery point objective -- how many minutes of data are they willing to lose?) & RTO (recovery time objective -- how quickly do they require failover to complete?)? That should determine whether (and how long) you wait to synchronize vs. failing over immediately.
What DR approach are you using? Availability groups? mirroring? Log shipping? 3rd party/custom?
If client wants to avoid risk to live system:
November 24, 2021 at 9:28 am
So I have an AOAG set up. I'm not keen to go for it. Duplication isn't really an issue - (don't ask! there are lots of constraints here).
I'm presuming it's possible to:
Unless there's a proper prescribed way of testing. I'm not suggesting this is a good idea. Just that it's presumably logically possible. I'm just prepping for all my questions from the client.
Totally take on board that there is a risk of data loss & this should be done on a lovely test rig that duplicates all on live.
November 24, 2021 at 10:36 am
What are you testing here? A nice, safe, planned failover from A to B to check the DR site works? If that is the case then I would switch to synchronous, take a possible peformance hit and switch sites with no data loss. You will then be able to switch back, again cleanly with no data loss. This will show the DR site works once any apps are connected to it and tested.
However, if you are testing something more realistic where you've suddenly lost site A for some reason then you'd have to do a failover with (potential) dataloss. Then work out the plan to get back to site A.
November 24, 2021 at 11:20 am
Fair question. Good point made. Thanks
Viewing 5 posts - 1 through 4 (of 4 total)
You must be logged in to reply to this topic. Login to reply