Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

GrafanaCONline: Slicing Kubernetes: Raspberry Pis, monitoring and chaos

Do you ever feel like your systems lack unnecessary complexity and overengineering? This is the talk for you. Join me as I detail the creation of a Raspberry Pi-based desktop Kubernetes cluster in an attempt to overcomplicate the already complex world of stateless web application deployment. We’ll walk through building an observability platform using open source tools like Prometheus, Grafana, and Jaeger to keep a close watch on our tiny fragile microservices and then break them intentionally for our own amusement.

GrafanaCONline: Industrial process monitoring: oil and gas industry

We have been utilizing Grafana at Whiting Oil and Gas for approximately three years. In that time, we have greatly impacted the way our field assets are managed through dashboards and alerting. Instead of servers and network data, Whiting is using the application to monitor oil and gas wells in remote locations. Historically, the energy industry has deployed very expensive tools to monitor assets and visualize data.

AppOptics Application Service Map

Announcing automatic dependency mapping in SolarWinds AppOptics When your team responds to a latency alert on a single service, how do they know how (or if) the increased latency in one service affects the end user? With the SolarWinds® AppOptics™ service map, teams can easily view the relationships between services, and see how (or if) a service is connected to the end user.

Building an Effective Alert Strategy

Alerts are an essential part of performance monitoring. Alerts and notifications need to be sent out as soon as an issue is identified, allowing you to know about any problems before your customers do. In this week’s Tip Tuesday, we look at building an effective alert strategy and how to utilize Catchpoint Alerts so that you can quickly and effectively leverage the information provided to take carefully targeted action and improve your MTTR. Building an effective alert strategy is important.

Building the Future of Icinga

When we started the development of Icinga 2 we had a clear picture. We wanted a great monitoring solution for small and large environments. Back then, we imagined Icinga 2 to be a scalable monitoring tool with a dynamic configuration, enriched with a slight and fast web interface. In fact, there were some more details to that. But let’s keep it short. We worked hard to achieve our goals.

Exploring Jaeger traces with Elastic APM

Jaeger is a popular distributed tracing project hosted by the Cloud Native Computing Foundation (CNCF). In the Elastic APM 7.6.0 release we added support for ingesting Jaeger traces directly into the Elastic Stack. Elasticsearch has long been a primary storage backend for Jaeger. Due to its fast search capabilities and horizontal scalability, Elasticsearch makes an excellent choice for storing and searching trace data, along with other observability data such as logs, metrics, and uptime data.

Adopting Distributed Tracing: Finding the Right Path

Here at Sumo Logic, we share a lot of thoughts about managing data at scale, and the innovative ways we help customers address their unique use cases. It’s not just about analysis of logs. In this article, I will talk about another important observability signal: distributed traces. I will share a few observations about how we at Sumo think about the future of adoption of distributed traces, a very important concept, taking from our own experience.

Pre-Cache CDN Edge Servers with Synthetic Monitoring

A Content Delivery Network (CDN) is a collection of distributed nodes, called edge servers, connected to the same origin servers and placed local to the users’ location. If you are using a CDN, your website content is delivered to the user from the nearest edge server to their location. Without a CDN, you are putting stress on the origin server every time a user requests something.