Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Grafana Incident: new tools for faster, simpler incident response

At Grafana Labs, we’re committed to helping teams dramatically improve how they manage and respond to incidents. Through Grafana Incident Response & Management (IRM), we provide tools to empower teams, streamline processes, and enhance the effectiveness of incident management strategies—and we’re constantly looking for ways to make our solution even better.

Track changes in your containerized infrastructure with Container Image Trends

Datadog’s Container Images view provides key insights into every container image used in your environment, helping you quickly detect and remediate security and performance problems that can affect multiple containers in your distributed system. In addition to having a snapshot of the performance of your container fleet, it’s also critical to understand large-scale trends in security posture and resource utilization over time.

Speedrun to Signals: automated migrations are here

When we launched Signals to the world, we were excited to hear how our product resonated with many teams. But with that excitement came an understandable concern: how much time and effort will I have to put in to move from my existing provider to Signals? We hear you — that’s why we built the Signals Migrator tool. And we’re open sourcing it.

Aggregate, correlate, and act on alerts faster with AIOps-powered Event Management

Maintaining service availability is a challenge in today’s complex cloud environments. When a critical incident arises, the underlying cause can be buried in a sea of alerts from interconnected services and applications. Central operations teams often face an overload of disparate alerts, causing confusion, delayed incident response, alert fatigue, and redundant resolution efforts. These issues can negatively impact revenue and customer experience, especially during an outage.

Unveiling the power of AI in incident management

The emergence of AI opens new and innovative possibilities, simplifies operations, and boosts overall success. With AIOps, your technical organization can achieve unparalleled efficiency, productivity, and profitability. This cutting-edge technology leads us toward a brighter, more prosperous future with exciting opportunities to grow and thrive.

Easily configure which hosts can participate in CFEngine infrastructure management

Two modules are available for this task: allow-all-hosts and allow-hosts. The first module, allow-all-hosts, configures the most open situation which is to accept hosts from anywhere. This is only recommended in network restricted environments such as a local machine’s virtual machine network or other such closed down situations.

10 Best Tools to Manage Kubernetes Clusters

Managing multiple Kubernetes clusters presents a significant challenge, primarily due to operational overheads, complexity, and the steep learning curve associated with Kubernetes' complex ecosystem. As organizations scale and deploy across various environments; production, staging, and development, the need for a robust tool to streamline management becomes crucial.

Data source security in Grafana: Best practices and what to avoid

Recently, an incorrect security report was published, claiming that there’s a SQL injection attack in Grafana. As we have communicated to the security researcher, this report is wrong. Authenticated users in Grafana have the same permissions as the user configured for the underlying data source.