For many of us, SQL Server just works. We might get some syntax errors if we mistype things, but for the most part, SQL Server runs smoothly in many environments. However, there are some common situations that do occur regularly, and I wonder if you can guess which errors often occur?
I saw a blog this week from the SQL Server Support group where they covered the top 25 errors that come in support calls. Their goal was to see if they could document and help people better solve problems themselves and reduce the support load.
Can you guess what the first error was? I'm assuming these are in descending order, but that's not clear. In any case, the top error was #18456, which I didn't recognize at first. Reading the documentation page shows this is the "login failed for xxx" error, which is probably my most common error. Often because I can't type a password correctly, but also because of an inability to select the right instance or user name. There are other causes, and it's nice to see a long list of things people can check.
The next error was 19407, which is a cluster communication error. If that's the second most common error, then maybe clusters and AGs need a bit more resiliency or better setup guidance. Third is an OS error with NTFS, which I've never run into.
If you flip through the list, I wonder how many of these errors are common for you. Do they come up often? I know I've seen people post on 912, which is an upgrade error and very annoying. I think some of the upgrade scripts for CUs aren't that well written and should have better error handling inside them. That would seem like an easy one to fix and reduce call volume.
There are plenty of network errors, including the "error occurred while establishing a connection" one. That one is usually is a typo from me or a misconfiguration of an instance after installation. Lots of other errors seem network or backup related, which may not be common, but those are errors that likely cause people to call Microsoft Support.
Maybe the most interesting one is 9002, log out of space. While I know lots of people might not know how to manage space, I also see lots of accidental DBAs get caught here because they set up full backups and not log backups. Their databases are small, storage is cheap, and they encounter this a year or so after they've set things up. To me, this is really low-hanging fruit by making it really easy to have an automatic backup process added for each database. Just add tooling to help make this easier, or create a job when a new database is created. If this isn't needed, let it be disabled, but for those that are installing SQL Server for some COTS application, make this a easy.
A lot of these errors are ones I'd never call support for, but I can imagine others not feeling that way. Plenty of these are errors I've never seen, but I'm glad the documentation is more than just a description of what happens. These updated pages give some possible causes and things that the user can do. That's something all of us would like in documentation when something goes wrong.