Send Metrics Not Logs

This is part of a series on observability, a concept taking hold in modern software engineering.

One of the interesting things I saw in an engineering presentation on Observability from Chik-Fil-A was that they are sometimes bandwidth constrained at remote sites. In an early version of their platform, they sent logs back to HQ, and their logs used all the available bandwidth, so they were unable to process credit card transactions.

While most of us don't deal with lots of remote offices sending data back to a central data warehouse, we do often work in distributed environments, and we may send data to/from a cloud or even employees' remote offices. Bandwidth is very good in many parts of the world, but it isn't infinite.

In the presentation, they talked about a tool, called Vector, that can work with data lots and slice/dice/aggregate/sample/etc. the data and then send the results to a sink location. This works like many other ETL tools that have a source and sink, along with various transforms that operate on the data.

It's an interesting philosophy to try and send back metrics that might be useful to developers or Operations staff in understanding the performance of their system. By only sending metrics, the load on downstream systems is reduced. This also allows us to store less data and read metrics sooner rather than storing all the data and processing it each time someone needs a metric.

The flip side of this is that taking this approach means that the consumers of the metrics need to ensure they are getting useful and actionable information. Determining what is needed will be like any development project, something built, iterated, re-tested, and repeated. This might even be an ongoing part of building software as new features and logging are added to your software or system.

In general, I prefer to have more data over less, but the volumes of logging and instrumentation data have grown dramatically. Some systems are producing more log data than actual data on a daily basis. Like audit data, we likely need to reduce and limit the amount of data stored long-term. However, we want to keep the important data that we find useful.

I am looking forward to trying out Vector and seeing what's possible. Having good CLI-based tools that can work with data is becoming more important all the time, especially as more of us move to DevOps flows, coding our systems operation in text, storing it in version control, and deploying on demand.

If you've used Vector, let us know what you think, and if you prefer another tool, share why today.

It's Natural to Avoid Problems

by Steve Jones

SQLServerCentral

Culture change is hard, and it's one of the most difficult things to implement when trying to get your team to work in a DevOps fashion.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

5 (2)

You rated this post out of 5. Change rating

2019-10-01

224 reads

Discuss

Staying Focused

by Steve Jones

SQLServerCentral

Finding the zone, working in the flow, these are the most efficient times for us, but they can be hard to find.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

5 (2)

You rated this post out of 5. Change rating

2019-09-13

239 reads

Discuss

The State of DevOps Report for 2019

by Steve Jones

SQLServerCentral

DevOps and Continuous Delivery (CI/CD)

It seems like just a few weeks ago that I went over the 2018 results with Gene Kim. That was an exciting few weeks for me, running over the report in prep and then having the opportunity to host a webinar with Gene Kim. Exciting times, but it's been almost a year and the 2019 […]

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-09-12

122 reads

Discuss

DevOps and OSS Can Scale

by Steve Jones

SQLServerCentral

Microsoft has embraced open source software, and has opened much of their code. They have also proven that DevOps development can scale in this environment.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-07-17

187 reads

Discuss

Ransomware and DevOps

by Steve Jones

SQLServerCentral

Ransomware is becoming a bigger and bigger problem. Steve has some thoughts on how you should think about security in your database environment.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

5 (1)

You rated this post out of 5. Change rating

2019-07-15

337 reads

Discuss

Send Metrics Not Logs

Rate

Share

Categories

Share

Rate

Send Metrics Not Logs

Rate

Share

Categories

Share

Rate

Related content

It's Natural to Avoid Problems

Staying Focused

The State of DevOps Report for 2019

DevOps and OSS Can Scale

Ransomware and DevOps