Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Make the most of your ITSM platform for Knowledge Management in 3 easy steps

For many organizations, knowledge management is often considered a technology problem. This is why when they implement knowledge management, the execution is often rendered unsuccessful because the processes and the kind of articles that are created are too complicated for people to use. ITIL knowledge management is defined as the process of capturing, processing, storing, and sharing knowledge across the enterprise.

Announcing the ServiceNow Service Graph Connector for LogicMonitor

We are pleased to announce another integration with one of our strategic partners, ServiceNow. Available today with the launch of ServiceNow’s Paris release, LogicMonitor supports the Service Graph Connector! LogicMonitor was selected by ServiceNow as a key player and one of the first monitoring platforms invited to this new, exclusive program.

Manage AppArmor profiles in Kubernetes with kube-apparmor-manager

Discover how Kube-apparmor-manager can help you manage AppArmor profiles on Kubernetes to reduce the attack surface of your cluster. AppArmor is a Linux kernel security module that supplements the standard Linux user and group-based permissions to confine programs to a limited set of resources. AppArmor can be configured for any application to reduce its potential attack surface and provide greater in-depth defense.

Netdata named to the Forbes Cloud 100 Rising Stars

We’re excited to announce that we’ve been named to the Forbes 2020 Cloud 100 Rising Stars. This is a list of the top 100 private cloud companies in the world, published by Forbes in partnership with Bessemer Venture Partners and Salesforce Ventures. The 20 Rising Stars represent young, high-growth and category-leading cloud companies who are poised to join the Cloud 100 ranks.

Netdata Agent v1.25 and Cloud enhancements

The v1.25.0 release of the Netdata Agent delivers on our commitment to make our metrics collection, visualization, and troubleshooting platform more stable and usable. We enhanced our recently-added Prometheus collector with user-configurable filtering and grouping, made dramatic improvements to the reliability of the Agent-Cloud link that streams metrics on-demand to your browser when you use Netdata Cloud, and more. Let’s jump in and look at each improvement.

Netdata versus Datadog: root cause analysis with metric correlations

When an incident strikes, and every minute spent on root cause analysis delays the time to resolution, the real-world consequences can be dire. Troubleshooting an event requires a certain data set: every metric, at the greatest granularity, in one place, available in real time. Limits on the number or type of metrics, collection frequency, or time to visualization can mean the difference between timely resolution and unacceptable losses in time, money, and productivity.

Introducing our first Netdata Cloud Insights feature: Metric Correlations for faster root cause analysis

Today, we are excited to launch our first Netdata Cloud Insights feature, Metric Correlations, developed for discovering underlying issues more quickly and identifying the root cause more efficiently. Read on to learn more about our approach to developing this new feature, how it works, and the many benefits you’ll find incorporating this into your team’s troubleshooting workflow.

How we're making it easier to use the Loki logging system with AWS Lambda and other short-lived services

There are so many great things that can be said about Loki – I recently wrote about them here. But today, I want to talk about something technical that has been difficult for Loki users, and how we might make it easier: using Loki for short-lived services. Historically, one of Loki’s blind spots is ingesting logs from infrastructure you don’t control, because you can’t co-locate a forwarding agent like promtail with your application logs.

CIO Insights: The New Normal and Cloud Mobility

As I wrote in a previous blog post, the world has recently undergone unprecedented changes that have wreaked havoc for CIOs as they struggle to ensure operational continuity, especially in scenarios where extreme changes happen overnight. What does operational continuity look like as businesses move forward in the framework of the new normal? In the earlier blog post, I highlighted some significant paradigm shifts the new normal includes, with a specific focus on these areas.

Detecting CVE-2020-14386 with Falco and mitigating potential container escapes

On September 14, CVE-2020-14386 was reported as a “high” severity threat. This CVE is a kernel security vulnerability that enables an unprivileged local process to gain root access to the system. CVE-2020-14386 is a result of a bug found in the packet socket facility in the Linux kernel. It allows a bad actor to trigger a memory corruption that can be exploited to hijack data and resources and in the most severe case, completely take over the system.