Operations | Monitoring | ITSM | DevOps | Cloud

Observability

The latest News and Information on Observabilty for complex systems and related technologies.

Discord Bot Part 2: More Observability

I’ve recently started working on a new project to build a Discord bot in Go, mostly as a way to learn more Go but also so I can use it to manage various things in Azure and potentially elsewhere. I figured it’d be useful to document some of this project to give some insights as to what I’ve done and why. Next up is the bot itself and how I integrated it into Honeycomb to get some visibility on how different commands are running.

Achieving the Observability Imperative Requires AI

The shift to Observability Over the last six months, unified monitoring, log management, and event management vendors have reoriented their technology portfolios (often without any change to the underlying functionality) towards Observability. In so doing, a fair amount of confusion has been generated in the market.

LeadDev Live 2021- Habits of highly-performing teams

There is a yawning gap opening up between the best and the rest — the elite top few percent of engineering teams are making incredible gains year over year in reliability and lack of technical drag forces, while the bottom 50% are losing ground. Take an engineer out of an elite-performing team and place them in the bottom 50%, and they become subpar too; take an engineer out of a mediocre team and embed them in an elite team, and they are pulling their weight within the year. I will share with you everything I know — everything that went into building a high-performing team at Honeycomb.

Take the first step toward SRE with Cloud Operations Sandbox

At Google Cloud, we strive to bring Site Reliability Engineering (SRE) culture to our customers not only through training on organizational best practices, but also with the tools you need to run successful cloud services. Part and parcel of that is comprehensive observability tooling—logging, monitoring, tracing, profiling and debugging—which can help you troubleshoot production issues faster, increase release velocity and improve service reliability.

Level Up 2020 Highlights

Hear from LogicMonitor leadership on some of the biggest announcements and additions to the LM product suite in 2020. We release an array of features that allow IT and Dev Ops teams to have full visibility into every corner their infrastructure, and with the addition of LM Logs we're on a mission to provide an extensible, fully unified observability platform.

How Prometheus monitoring mixins can make effective observability strategies accessible to all

Three years ago, Tom Wilkie and Frederic Branczyk sketched out the idea for Prometheus monitoring mixins. This is a jsonnet-based package format for grouping and distributing logically related Grafana dashboards with Prometheus alerts and rules. The premise was that the observability world needed a way for system authors to not only emit metrics, but also provide guidance on how to use those metrics to monitor their systems properly.

Building w/ Observability- Honeycomb & CircleCI

Do you know exactly what your builds are doing at every step of the way to prod and after they’ve been deployed? A key part of what lets you ship code to production often and quickly is having observability in your builds. Together, CircleCI and Honeycomb can help you get both speed and quality when shipping code to production. In this webinar, we’ll not only examine how CircleCI and Honeycomb work well together, we’ll also look at how Honeycomb used both products together to identify changes that impacted their build times and reduced them by 25%.

5 Tips for Observability Success

In 2020, the concept of observability in IT operations gained mindshare as IT leaders looked for new ways to rein in the complexity that’s grown organically with cloud computing and rapid digitization. Observability differs from IT monitoring in that it focuses on the development of the application and rich instrumentation so that operators can ask meaningful questions about how the software works or is working in production.

Learn How to Get Observability into Your Applications with New KubeAcademy Course

When developing applications for Kubernetes, which is a distributed system, developers and platform engineers need to know both how to monitor them and understand how they impact their business. There are several tools available to instrument Kubernetes clusters and code, but figuring out which options are the right ones can be challenging. That’s why we created the Introduction to Observability course on KubeAcademy.