Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

2.5X faster and 88% cheaper error resolution with GPT-4o mini and Raygun

In May, GPT-4o was released, refining the GPT-4 architecture with native multi-modal input support, faster speeds, and a cheaper price per token. This week, with the release of GPT-4o mini, it’s even more cost-effective and quicker. This model is considered better than GPT-3.5 Turbo, being faster and smarter—a win all around. Let’s put it to the test in a real-world application to see just how good it is for software developers.

Monitoring Third Party Vendors as an Ops Engineer/SRE

Why should you monitor your third-party Cloud and SaaS vendors if you are in SRE/Ops? As part of an SRE team, your primary responsibility is ensuring the reliability of your applications. What makes you responsible for monitoring services that you don't even manage? Third-party services are just like yours - with SLAs. And outages happen, affecting you as well as many others who depend on them.

The Microsoft-CrowdStrike Outage: An In-Depth Analysis

On July 19, 2024, a significant outage impacted globally, causing widespread disruptions across various industries. This outage was primarily linked to a faulty update from CrowdStrike’s Falcon Sensor, which led to severe issues on Windows systems. CrowdStrike is a leading cybersecurity company that specializes in protecting businesses from online threats.

Securing the Foundation of Cribl Copilot

Integrations are the bread and butter of building vendor-agnostic software here at Cribl. The more connections we provide, the more choice and control customers have over their unique data strategy. Securing these integrations has challenges, but a new class of integrations is creating new challenges and testing existing playbooks: large language models. In this blog, we are going to explore why these integrations matter, investigate an example integration, and build a strategy to secure it.

How to Build a Custom OpenTelemetry Collector

Telemetry data collection and analysis are important for businesses. We're diving right in to explain the ins and outs of the OpenTelemetry Collector, including its core components, distribution selection, and customization tips for optimal data collection and integration. Whether you're new to OpenTelemetry or expanding your capabilities, this will help you effectively use the OpenTelemetry Collector in your observability strategy.

Streamlining Debugging with Lightrun Snapshots: A Superior Alternative to Trace Logging

According to a recent study, failing tests alone cost the enterprise software market an astonishing $61 billion annually. This figure mirrors the vast number of resources devoted to rectifying software failures, translating into about 620 million developer hours lost each year. On average, engineers spend 13 hours to resolve a single software failure, a statistic that paints a stark picture of the current state of debugging efficiency.

Deep Dive Into The 2024 State Of Cloud Cost Report

In April of this year, we released “The State of Cloud Cost in 2024” report. As with our previous cloud cost survey from 2022, this one was packed with data on how finance and engineering professionals use the cloud and how they manage the costs associated with that use. If you’re in the SaaS world, you’ll definitely want to tune in to this webinar we’ve created to discuss the results of the survey: Deep Dive Into The 2024 State Of Cloud Cost Report.

Status page examples

The visual presentation and aesthetics of a company's online presence are crucial in shaping the company's reputation and customer trust. One such vital aspect is the status page, which is often overlooked yet highly influential. By examining the best status page examples, we can see how a well-designed status page not only conveys reliability and professionalism but also builds users' confidence, reassuring them of the organization's dedication to maintaining transparency and excellence.