Operations | Monitoring | ITSM | DevOps | Cloud

%term

The IoT Boom No One Talks About: Where It's Really Growing

Everyone thinks of IoT as smart home devices and wearables, but that's just the tip of the iceberg. In reality, the biggest growth is happening in enterprise and industrial IoT—powering robotic farms, warehouse automation, and critical infrastructure. François Baldassari breaks down where IoT is thriving and why it's finally delivering on its promise.

Generating Calculated Fields From Natural Language

If you’ve been using Honeycomb for a bit, you know that Calculated Fields (otherwise known as derived columns) are a powerful way to transform your events to a format that’s easier to query and understand. However, they use a lisp-esque language that can be difficult to read and a pain to write. If you dislike making Calculated Fields and want something a little easier, here’s a generative AI prompt that can generate them from natural language.

After OpsGenie: 3 Reasons Why Industry Leaders Are Migrating to PagerDuty Over JSM

OpsGenie has served many teams well for years, but with Atlassian’s OpsGenie 2027 sunset announcement and as it enters its maintenance phase, it’s time to look forward and plan your next move. Running tomorrow’s operations on yesterday’s technology isn’t just risky – it’s holding you back. This isn’t just a transition – it’s an opportunity to leap ahead.

Stop recurring IT incidents with proactive problem analysis

ITOps and Incident Management teams must manually handle high volumes of daily alerts, tickets, and incidents. This makes it challenging to spot recurring patterns that could be addressed or prevented. Without proactive problem management, teams waste time resolving repeat issues instead of focusing on higher-priority or first-time problems. Limited visibility into incident trends forces organizations to engage in reactive firefighting, diverting valuable time from addressing the root cause.

7 Java Exception Monitoring Blind Spots That SREs Must Eliminate

It’s 2 a.m. Alerts flood your dashboard. Transactions are failing, but logs offer no clues. Your SRE team is drowning in noise—while users struggle with outages. As Java workloads shift to microservices, Kubernetes, and the cloud, this problem is compounded. Exceptions cascade across tiers, triggering blame games while the root cause remains buried under fragmented logs and scattered alerts. Legacy monitoring tools overwhelm SREs with raw data but fail to connect the dots.

Let's Encrypt Stops Expiration Emails - How to Ensure Your Certificates Stay Valid with SSL Certificate Monitoring

SSL/TLS certificates are critical for secure communication, and keeping track of their expiration is essential. Until now, Let’s Encrypt has sent email notifications when certificates were about to expire. However, as of June 2025, Let’s Encrypt will discontinue these expiration emails. This change could lead to expired certificates going unnoticed, potentially causing security risks and downtime.

Monitoring in Hyperconverged Infrastructures: Challenges and Solutions

I have a not-so-secret suspicion that the dream of everyone working with technology is the Enterprise computer from Star Trek. Controlling shields, communications, engines, and everything else from a single place—and with voice commands, no less. “One button to rule them all,” as Sauron might whisper. But until that utopia becomes a reality, at least we can implement a hyperconverged infrastructure (HCI) in our organization’s technology stack.

High Cardinality Is Eating Your Storage Budget-Here's Why

Have you noticed your storage costs rising even when you're keeping an eye on them? The reason might be something easy to overlook: high cardinality data. For data engineers and developers balancing performance and costs, understanding its impact isn’t just useful—it’s key to avoiding unnecessary spending and system slowdowns.

Retrieving All Keys in Redis: Commands & Best Practices

Need to list all the keys in your Redis database? If you're debugging an issue or just checking what's stored, retrieving all keys is a useful skill for any developer. This guide covers everything you need to know—from the basic commands to the performance implications—so you can query Redis efficiently without slowing things down.

A Guide to Fixing Kafka Consumer Lag [Without Jargon]

Have you ever looked at your monitoring dashboard and wondered, "Why is my Kafka consumer lag spiking again?" It’s a common frustration. Consumer lag isn’t just an inconvenience—it’s a sign that something’s wrong with your data pipeline. When lag builds up, you're facing delayed data processing and the risk of system failures.