Software Updates, Outages, Processes, and Protocols

This past Thursday, February 22, AT&T had a major outage on their U.S. network. For upwards of 10 hours, hundreds of thousands of customers could not make phone calls, send or receive texts, or use mobile data for apps or browsing websites. Aside from not being able to communicate as normal, it also appeared to knock the ability of users to access emergency services like 911, raising the level of alarm. It was such a major communications event that the FCC and the US Cybersecurity and Infrastructure Security Agency are now involved in the investigation.

Thankfully the initial investigation does not appear to show any cyber-attack as the cause. Instead, the response from AT&T on Friday was that the incident was caused “by the application and execution of an incorrect process used as we were expanding our network…” Inside sources have told some news outlets that it was specifically a software update that didn’t complete correctly and took essential equipment offline. In other words, somebody (or team) didn’t follow the playbook. All I could think about on Thursday in relation to this story was #HugOps. J

Here's the thing. In more than 20 years of IT operations, software and database development, and building multiple SaaS applications, I have personally never had to worry about a mistake, even a failed upgrade, putting people’s lives in danger. Although I, and my teams, have always striven to provide an excellent experience for our customers at every level, nobody lost their job if utility bill data couldn’t be entered into the application for a few hours. When we did fall short of our goal and our customer SLA was breeched, it was always a learning opportunity to prepare better for next time.

The longer you work in this field, the more you recognize that preparation is key. Checklists, playbooks, and planning for how to react when the unexpected happens are essential skills to learn and build within a team. I’m positive that the folks at AT&T had all of that, and still a major outage occurred. Thankfully it appears to have been remedied in less than 10 hours, which honestly feels like a speedy recovery considering the size and scope of the network they deal with. Jokes about DNS, BGP, or an expired certificate aside, it was a good reminder for us all that thinking about, and planning for, the unexpected is always a part of our jobs. I mean, none of us want to end up in the headlines when data loss occurs, accidentally or not, right?

If you were affected by, or aware of, the outage this past week, what did it make you think about in relation to your job? Do you see opportunities to prepare in a new way the next time you release and update or modify the database schema? Give it some thought if you haven’t. And remember, in times like Thursday, it never hurts to send out some #HugOps.

.NET Framework Service Pack 1 Released

by Additional Articles

Microsoft SQL Home

Microsoft .NET Framework Service Pack 1 provides the latest updates to the .NET Framework. Service Pack 1 is highly recommended for all users of the .NET Framework, including customers of Visual Studio .NET.

2002-03-27

3,235 reads

Discuss

Technical Chat: Inside with Jim Gray--The Search for Petabyte Storage

by Additional Articles

Microsoft MSDN

Please join Jim Gray, Distinguished Engineer at Microsoft Research, for this Q&A Session. Jim, the father of Structured Query Language, has been looking at LARGE databases like Google, Hotmail, BarBar, CERN, EOS/DIS, Internet Archive, and others that are either at a Petabyte or will grow to a petabyte scale in the next year or so.

2002-02-28

3,353 reads

Discuss

Beta Nominations for 64bit SQL Server

by Additional Articles

Microsoft MSDN

Currently, Microsoft is accepting nominations for the 64-bit version of the SQL Server beta program, code named "Liberty." The Liberty beta program is slated to start in February 2002. Important: You must have access to 64-bit hardware running Windows LE to participate in the Liberty beta program. This beta does not install on 32-bit hardware. Before signing up for the beta, verify that you have access to the appropriate hardware.

2002-01-08

3,229 reads

Discuss

A New Way to View and Analyze Your Data

by Additional Articles

Microsoft SQL Home

With Data Analyzer—the brand-new Office data analysis solution—you can quickly and easily view, analyze, and share business data, giving you the power to make better business decisions.

2001-11-30

4,798 reads

Discuss

SQLXML 2.0 (XML for SQL Server 2000)

by Additional Articles

Microsoft MSDN

Microsoft® SQL Server 2000 introduced several new features for querying database tables and receiving the results as an XML document. Web release 1 of SQLXML (XML for SQL Server) added Updategrams and XML Bulk Load functionality, as well as a host of other features to the SQL Server 2000 base.

2001-10-29

6,623 reads

Discuss

Software Updates, Outages, Processes, and Protocols

Rate

Share

Categories

Share

Rate

Software Updates, Outages, Processes, and Protocols

Rate

Share

Categories

Share

Rate

Related content

.NET Framework Service Pack 1 Released

Technical Chat: Inside with Jim Gray--The Search for Petabyte Storage

Beta Nominations for 64bit SQL Server

A New Way to View and Analyze Your Data

SQLXML 2.0 (XML for SQL Server 2000)