Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Choosing the Right APM for Go: 11 Tools Worth Your Time

If you’re building high-performance systems, Golang has probably earned a spot in your stack. Its speed, lightweight concurrency, and quick compile times make it ideal for scalable APIs, microservices, and distributed systems. But those same qualities that make Go powerful can make performance monitoring tricky. Goroutines run fast and in parallel, which means a simple CPU or memory graph doesn’t always tell you what’s slowing things down.

Rolling Out AI Application with Confidence: How Nexthink's AI Drive + Adopt Makes AI Compliant, Insightful, and Effective

From Microsoft Copilot to ChatGPT, AI applications are quickly becoming everyday workplace tools. But for many organizations, turning on these capabilities isn’t as simple as flipping a switch. Enterprise licenses for AI tools can cost millions, yet few companies can confidently say employees are using them effectively, or safely. The reality is that most AI rollouts start strong but stall fast.

APM vs Observability: Both-and, not either-or

I'll start this, the third and final entry in my series on APM and Observability, which was originally inspired by my contribution to an APMdigest article, by once again pointing out that APM tools can be built with observability in mind. Many are, in fact. And the ones that aren’t don’t turn into a different type of tool. In my experience, it's more that there's a difference of mindset.

Introducing Cribl Insights: A central hub for monitoring and alerts

What happens when your data pipelines slow down, drop volume, or quietly change shape? Most monitoring tools won’t catch those shifts until it’s too late—when downstream systems are already impacted, dashboards are broken, or critical information is missing. That’s why we’re excited to introduce Cribl Insights, to give you real-time visibility into every part of your Cribl environment: data flows, operations, processing, user activity, configuration changes, and more.

Introducing Cribl Notebooks: One Tab For Your Entire Investigation

Investigations move fast. Data is messy. And today’s analysts are expected to connect the dots across massive datasets and various tools—while documenting every step and sharing results with stakeholders. What does that look like? A security investigation may involve 10 or more queries—each one filtering, transforming, and analyzing data from a different angle—duplicated across multiple browser tabs so nothing gets lost.

AI-First: Agentic AI needs a new architecture

At Cribl, we’ve talked a lot about epochs. A moment in time when there was a before and after. AI, and specifically agentic AI, is an epoch. The way we work is going to forever change. There have been many such events in our lifetimes: the PC, the Internet, and the smartphone. AI will change how we work forever. Prior to the PC, there were people whose jobs were literally titled “computer”.

Teams issues are inevitable - but your users don't need to know that

Our previous blog gave a quick overview of an all-too real scenario involving poor Microsoft Teams performance and frustrated VIP users. The situation, picking up on our recent Power Moves webinar, centered on a big board meeting held over Teams that suffered from multiple call quality issues — spurring the CEO to pay a stormy visit to IT. In that case, the issue had already happened, and our point was that with native Microsoft tools, it can be hard to get to a precise root cause quickly.

Observability in Fraud Detection: How Transaction Monitoring Tools Can Help Spot Money Laundering

In today's increasingly digital financial landscape, transaction monitoring has become a critical component of global fraud detection strategies. As financial crimes evolve in complexity, institutions must strengthen their ability to detect anomalies and uncover suspicious activity before it causes damage. Observability, a concept long used in IT and data operations is now emerging as a powerful approach for improving visibility into complex financial transactions.

Real-Time Outage Alerts in Slack and 4 Ways To Set It Up

When a third-party service you depend on goes down, every minute counts. The sooner your team knows about the outage, the faster you can respond and reduce downtime. Since most IT and operations teams live in Slack, it makes sense to receive real-time outage notifications directly in Slack channels where you already collaborate. There are several ways to do this, from integrating an all-in-one status page aggregator like StatusGator, to setting up RSS feeds or building your own Slack app.

Strengthen the server back end with server URL checks

In distributed architectures, the back-end service reliability of microservice endpoints and internal APIs relies on the health of local URLs. These local URLs are not exposed to the public internet and are essential for your IT infrastructure health and automation suites. Site24x7’s server URL check is engineered for operations teams that require immediate visibility into these server-level endpoints. These granular endpoints are often overlooked by traditional external monitoring tools.