Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

What is the Difference Between Distributed and Centralized Version Control Systems

Safe and efficient source code management is crucial during application development. The code should be stored securely, with all changes meticulously documented to catch and fix any errors, especially when multiple developers are involved. This is typically done using version control systems (VCS). Version control enables teams to collaborate effectively, reduce risks, and maintain stability.

Tackling the Unsustainable Skills Challenge in Cybersecurity and Observability

This is the third and final post in a series of blog posts about the disconnect between modern IT and security teams and the vendors they’re forced to work with. If you’re looking for the first and second posts, you can find them here and here.

Establishing and Enabling a Center of Production Excellence

Software is in a crisis. This is nothing new. Complex distributed systems are perpetually in a state far from equilibrium, operating in what Richard Cook has called a “degraded mode.” It’s through a combination of technical artifacts, organizational practices and policies, and pure gumption that they manage to maintain themselves through time. However, there are some organizations that seem to have an easier time of it than others.

Major Improvements For Linux Users In Tracealyzer v4.9

Installation on Linux has been greatly simplified in the upcoming Tracealyzer v4.9. The installation package now includes everything needed to run the software. Linux users no longer need to install dependencies like Mono or libgconf. Instead, a native Linux binary is provided (for x86-64) with the Mono runtime integrated. Most other dependencies have been replaced or removed. You are up and running in a few minutes. We have also spent a lot of time improving the overall user experience on Linux.

Making Sense of Your IoT data with AWS and MetricFire

The Internet of Things (IoT) is all the rage these days, and for good reason. It lets us connect all sorts of devices to the internet, opening up a world of possibilities. However, managing all those devices and the data they generate can be a challenge. That's where AWS and MetricFire come in. AWS offers a robust suite of cloud services called AWS IoT that makes it easy to develop and manage IoT applications. MetricFire is a platform that helps you monitor your AWS services, including your IoT devices.

AI-powered insights for continuous profiling: introducing Flame graph AI in Grafana Cloud

Like many in the observability space, we see a lot of potential in harnessing AI to enhance the developer experience. As we continue to evolve and expand our observability platform, we strive to develop features that not only solve complex problems, but make it easier to access and derive value from tools like Grafana Pyroscope.

Log Monitoring: Challenges and Best Practices for Modern Applications

Almost everyone acknowledges that log monitoring is essential for maintaining the reliability, security, and performance of modern applications. However, the complexities increase as organizations adopt diverse architectures to effectively manage the various log data challenges they encounter. In our previous blog post, we discussed the significance of log monitoring alongside a few popular log monitoring tools available in the market today.

Accelerate root-cause analysis with AIOps

The digital landscape is evolving constantly — as is its complexity. Organizations need more efficient and effective ways to sort through high volumes of IT noise to identify the root cause of incidents. In a recent webinar with BigPanda CIO Jason Walker and Waste Management Principal Architect Udo Strick, Joe Connelly — director of monitoring, observability, and service reliability at Chipotle Mexican Grill — shared his perspective on.

Efficient data/file copying on modern Linux

Editing and copying large files or large numbers of files is slow. For a configuration management tool, it is probably one of the slowest things we do, apart from waiting for other programs to finish or waiting for network communication. In this blog post, we look at how to copy files. More specifically, the most performant approaches available on modern Linux systems. We are working on implementing these techniques so CFEngine and all your policy will copy files more efficiently.

5 Top Kubernetes Observability Challenges and Solutions

Observability in IT refers to the ability to measure a system's internal functioning by studying its signals from the outside. Modern IT observability is achieved through three kinds of telemetry: metrics, traces, and logs. Metrics aggregate events to gauge a system’s current state. Tracing tracks the progress of each transaction to not only measure performance but also debug the problem. On the other hand, logs record each event, which can help during troubleshooting.