Operations | Monitoring | ITSM | DevOps | Cloud

When More Incident Commanders are Better

It has been lightly revised and reposted with his permission from the original article on Medium. Leading major incident responses can be extremely stressful. You have to quickly gather an ad-hoc team, figure out what went wrong, identify a fix and make sure this doesn't make things worse, all the while with senior leadership breathing down your neck. Are we having fun yet? Many people think having a dedicated incident commander role will solve the problem.

Experience Everywhere Wrap Up: Electric Energy Around the World

The Experience Everywhere tour is a wrap, and what a tour it was! We had an incredible time meeting up with our customers, partners, and DEX practitioners from all around the world to share expertise, learn, and grow. If you couldn’t make it (or even if you could) – you can relive all the action now over on our Experience Replays. Below, we asked a few Nexthinkers to send us their thoughts on each of the four Experience locations.

Sending Data to Elastic Security With Cribl Stream (And Making It Work With Elastic SIEM)

Cribl Stream is a real-time security and observability data processing pipeline that can be used to collect, transform, enrich, reduce, redact, and route data from a variety of sources to a variety of destinations. One of the popular destinations for Cribl users is Elastic SIEM. This blog post will walk you through the steps on how to set up Cribl Stream to normalize and forward data to use with Elastic Security for SIEM.

Multi-Cluster Observability Part 3: Practical Tips for Operational Success

This is the final article of a three-part series. To start at the beginning, read Part 1: Benefiting from multi-cluster setups requires familiarity with common variations and Part 2: Exploring the facets of a multi-cluster observability strategy. As companies scale software production, they lean on Kubernetes as a crucial container orchestration platform for managing, deploying and ensuring software availability.

Future-Proofing Resilience: How Manufacturers Are Navigating Growing Pains of IT/OT Convergence

The manufacturing industry is at a crossroads. With automation and emerging technologies like AI, organizations are eager to make operational and production processes more efficient. However, for many manufacturers, the rapid pace of digitizing legacy infrastructure and systems has also exposed many unanticipated hurdles, with one of the biggest being the convergence between IT and operational technology (OT).

How to Monitor MSP Networks for 360-Degree Visibility

MSPs (Managed Service Providers) have a lot of responsibility on their shoulders. They need to look after the IT infrastructures or networks of their customers to ensure that they’re always up and running. But, what happens when the MSP network itself isn’t performing like it should? Like a business organization, an MSP also faces repercussions due to network downtime. Even a minute of downtime can prevent an MSP from offering the necessary services to its clients.

OpenTelemetry Auto & Manual Instrumentation Explained with a Sample Python App

OpenTelemetry is an open-source observability project that provides a set of APIs, SDKs, and tooling for collecting, generating, and exporting telemetry data. It provides instrumentation libraries in all major programming languages. In this article, we will demonstrate the automatic and manual instrumentation of Python applications. In this tutorial, we cover: If you want to jump straight into implementation, start with this prerequisites section.

The case for Kubernetes resource limits: predictability vs. efficiency

This blog post by Grafana Labs Senior Software Engineer Milan Plžík was originally published on the Kubernetes.io blog on Nov. 16, 2023. There’s been quite a lot of posts suggesting that not using Kubernetes resource limits might be a fairly useful thing (for example, For the Love of God, Stop Using CPU Limits on Kubernetes or Kubernetes: Make your services faster by removing CPU limits ).