Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

All Together Now: Better Debugging With Multiple Visualizations

“Nines don’t matter when users aren’t happy” is something you may have heard a time or two from folks here at Honeycomb. We often emphasize the fact that while your system can look healthy at a high level, deep down something is likely broken in ways that cause pain for users. If you are empowered to ask detailed questions about your services, you can find and understand these problems more easily.

Best serverless video tutorials

Now I’m sure this is not the case for everyone but for me personally, I like watching video tutorials on subjects that I need help with. I spent a good amount of time searching for these tutorials and while there might be a lot more of them out there I believe these are some of the best and easiest tutorials to follow but nevertheless, I’d like to keep this list updated so if you have any suggestions please let me know.

How To Give Your Employees A Voice During Digital Transformation

I recently read an article in the Wall Street Journal about the need for companies to take into account what their employees’ needs are when rolling out new technologies. Many new technology initiatives fail because employees aren’t involved in the process at some level — whether it’s helping design a solution to their problems or providing regular feedback to their leadership teams on what’s working and what they need to be successful.

Fireside chat with Kelsey Hightower, part two: on business models, the evolution of operations, and bringing value to your organization

In part one of this series, I recapped a good bit of the fireside chat with Kelsey Hightower, Caleb Hailey, and Sean Porter at Sensu Summit 2018. They discussed the evolution of monitoring tools and related DevOps technologies, and how to incorporate new tools into both modern and legacy infrastructure.

How to monitor Google Kubernetes Engine with Datadog

Google Kubernetes Engine (GKE), a service on the Google Cloud Platform (GCP), is a hosted platform for running and orchestrating containerized applications. Similar to Amazon’s Elastic Container Service (ECS), GKE manages Docker containers deployed on a cluster of machines. However, unlike ECS, GKE uses Kubernetes, an increasingly popular open source orchestrator that can deploy, schedule, and scale containers on the fly.

How Grafana Labs Effectively Pairs Loki and Kubernetes Events

As we’ve rolled out Loki internally at Grafana Labs, we wanted logs beyond just simple applications. Specifically while debugging outages due to config, Kubernetes, or node restarts, we’ve found Kubernetes events to be super useful. The Kubernetes events feature allows you to see all of the changes in a cluster, and you can get a simple overview by just retrieving them: This also captures when nodes go unresponsive and when a pod has been killed along with the reason.