Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Redefining Resilient IT: Edwin AI, Service Intelligence, and What's Next for LogicMonitor

Downtime is more than an inconvenience these days, nor is it solely a problem for the ITOps team. Since every organization is a digital business, downtime can cost millions of dollars per hour, stall innovation, and erode customer trust. Yet most IT teams are still trapped in reactive mode, scrambling across fragmented tools and drowning in alert fatigue. That model no longer works. The future of IT is about foresight, not firefighting.

Future-Proofing Your Historian with a Time Series Database

As technology scales and data volumes accelerate, organizations face a pressing challenge: how can they modernize data infrastructure without putting daily operations at risk? Data historians, specialized databases that capture and store time-stamped machine and sensor data, have long been the foundation for reliability and compliance. However, they were not designed for the openness and advanced analytics that modern workloads demand.

BYOS with Cribl Lake: Data ownership meets flexibility

Today, more than ever, organizations face a difficult balancing act: how to keep sensitive data fully under their control while still making it accessible and usable so teams can unlock the value and insights they need. Industries such as financial services, healthcare, and government agencies often must comply with strict regulations that require data to remain in environments they directly own and manage.

Cribl.Cloud Goes to Washington: Cribl.Cloud Government FedRAMP Authority to Operate Milestone

Way back in 2009, when I was serving as a second lieutenant in the U.S. Army, I worked in a network operations center for a deployed Army unit. Our mission was to provide network connectivity across central and northern Iraq. Our observability tools were incredibly limited. We had a network map that would turn nodes and network links red, yellow, and green when they were up or down. We had to write down in a physical logbook any status changes and what we did about them.

Cribl.Cloud Government Is a New Era of Secure Cloud Telemetry for Federal Agencies

As a Co-founder and CPO at Cribl, I'm genuinely stoked that our new federal suite, Cribl.Cloud Government, has achieved an “In Process” designation under the Federal Risk and Authorization Management Program (FedRAMP). This isn’t any old milestone. We’re bringing all of Cribl’s kickass capabilities to government agencies, even those that require the strictest compliance and security standards. Because, who doesn’t love a good set of rules?

Icinga Experience: Insights from Real-World Icinga Deployments Across Industries

Modern IT environments are hybrid, distributed, and constantly growing. To keep them reliable, organizations rely on monitoring that scales, automates, and integrates seamlessly into existing workflows. We collected 24 Icinga customer stories from industries including finance, telecom, manufacturing, and public services. What unites them is the choice of Icinga as a flexible and cost-efficient alternative to proprietary monitoring tools.

Faster, more memory-efficient performance in Grafana Mimir: a closer look at Mimir Query Engine

Until recently, Grafana Mimir — our open source, horizontally scalable, multi-tenant time series database (TSDB) — has exclusively used Prometheus’ PromQL engine to evaluate queries. While the PromQL engine works great, it sometimes needs a lot of memory to run, specifically in the Mimir querier component. To address this memory consumption issue, we recently introduced Mimir Query Engine (MQE).

What is Asynchronous Job Monitoring?

Modern applications don’t process everything inside the request/response path. To keep APIs responsive, time-consuming work like image resizing, payment processing, or data syncs is moved into background queues. Workers then pick up these asynchronous jobs and run them outside the main thread. Asynchronous job monitoring is the practice of tracking these background tasks: Without this visibility, background workers become a blind spot.