Operations | Monitoring | ITSM | DevOps | Cloud

Announcing Cloud Pathfinder: Network GPS for Infrastructure Teams

Today, we’re excited to launch Cloud Pathfinder, an AI-powered path assessment service built into Kentik Journeys. Read on to learn how Cloud Pathfinder gives you instant, turn-by-turn insight into cloud routing—mapping out every hop, gateway, VPC/VNet, and attachment along the way.

How DevSecOps Automation Should (and Will) Change Your DevOps Security Practices in 2025

Learn how to use automation to combat this year's emerging DevSecOps trends. The dream of DevSecOps has always been resilience. To focus on proactive strategies rather than reactive firefighting; to learn from failures and build something stronger and more flexible in their wake. DevSecOps adoption may have grown, but implementation remains uneven: Many teams struggle to align their security and development workflows into a cohesive system.

An Ode to OpsGenie: A Look Back at One of Ops' Most Loved Tools

With the news of OpsGenie shutting down and everyone looking for possible alternatives, we wanted to take a moment—not just to acknowledge the end, but to rewind and revisit the journey that brought them here. Over the years, it carved out a meaningful place in a competitive market, and in the workflows of thousands of teams. This is a look back at where it all began, what made OpsGenie different, and the mark it leaves behind.

Understanding Docker monitoring: A comprehensive list of key Docker metrics

In today’s fast-paced development landscape, containerization has become a cornerstone for deploying scalable and efficient applications. Docker, as one of the most popular container platforms, offers a robust environment for building and running containers. However, with great power comes the need for greater scrutiny, i.e., Docker monitoring or observability. Understanding Docker metrics is key to maintaining optimal performance and ensuring your containerized applications run smoothly.

Prometheus Monitoring in 5 Minutes: Set Up Your First Alert

Prometheus is an open-source toolkit for systems monitoring and alerting, designed to collect and store metrics as time-series data. It was initially created at SoundCloud, and has since become essential in the cloud-native ecosystem, benefiting from a powerful query language, dependable alerting functionality, and a pull-based architecture. Prometheus effectively monitors rapidly changing container environments, microservices, and cloud infrastructure. Its main benefits include.

Maximizing ROI in server monitoring: A strategic approach for businesses

According to the 2024 Statista report on global crucial data center IT outages from 2020-2023 , power disruptions have become the leading cause of outages, rising from 37% in 2020 to 52% in 2023. This shift highlights an increasing vulnerability in infrastructure reliability, making proactive server monitoring more critical than ever. Want to see real-world examples? Check out our blog on major outages in 2024 , what caused them, and key lessons for businesses.

What Grafana OnCall's Maintenance Mode Means for On-Call Teams

If you’ve been using Grafana OnCall OSS for incident management, you may have already heard the news—it’s now in maintenance mode and will be archived within one year. Grafana Labs recently announced that Grafana OnCall OSS is now in maintenance mode and will be archived in 2026. This means no new features, limited updates, and eventually, no support.

Deployment Tracking with Mezmo Live Streaming Tail

You've deployed a new feature into production. You've done your unit testing, fixed lots of bugs, your code is awesome. Now it's time for hundreds/thousands/millions of users to break...err...use your feature. You're diligent about tracking usage in real-time, and getting customer feedback when something goes wrong. You track the performance and response time impacts on the server. All is good...except...that feature isn't quite working for a specific group of users. Now what?

Calico Whisker, Your New Ally in Network Observability

With the upcoming release of Calico v3.30 on the horizon, we are excited to introduce Calico Whisker, a simple yet powerful User Interface (UI) designed to enhance network observability and policy debugging. If you’ve ever struggled to make sense of network flow logs or troubleshoot policies in a complex Kubernetes cluster, Whisker is your friend!

LogicMonitor Achieves FedRAMP "In Process" Status: AI-powered Hybrid Observability for Government Agencies

Throughout my career working with government agencies, I’ve seen firsthand how critical it is to have monitoring solutions that meet federal security requirements while delivering the visibility needed to manage complex IT environments. That’s why I’m particularly proud to announce that LogicMonitor has reached a significant milestone in its commitment to serving government agencies and public sector organizations.