Operations | Monitoring | ITSM | DevOps | Cloud

Testing GenAI: How to approach nondeterministic software development

Michael Webster, principal engineer at CircleCI, talks to Rob about testing AI-enabled applications. In this episode, learn how to face the unique challenges posed by the probabilistic and non-deterministic nature of AI output, as well as the importance of subjective evaluation criteria. Webster covers how model graded evals can be used to test AI applications, and the importance of caution in using this approach.

How Flexcity used Grafana Cloud to help balance the national power grid in France

Last winter, Flexcity — a market leader in electric flexibility — faced an unprecedented challenge: Help stabilize the French national power grid, in the midst of a widespread energy crisis that loomed over Europe. As a byproduct of the Russian invasion of Ukraine, energy prices in the EU soared in 2022. And France, meanwhile, faced a nuclear power outage that winter that threatened to significantly disrupt its energy supply and increase the risk of electricity shortages.

How does SIGNL4 provide for truly reliable alerting?

Of course, one expects an alerting solution to be reliable. This is important because a missed alert can have a significant impact on the business. It is about IT uptime, disruptions in production or other critical system conditions. Business processes, production workflows and therefore money, the reputation of the company or even the health of the employees are at stake. But what does reliable alerting actually mean and how is it achieved?

Choosing Azure Database Services - What are the options?

Microsoft Azure offers a choice of relational and non-relational database services to support a wide range of application needs and demands. Built-in intelligence helps automate management tasks like high availability, scaling, and query performance tuning to provide users with services that ensure applications are always available and performant. Many services offer essentially limitless database scale and SLAs (Service Level Agreements) usually range between 99.9-99.999% availability.

How To Profile and Optimize Telemetry Data: A Deep Dive

We recently had the privilege of presenting our telemetry data pipelining platform at Cloud Field Day. Today, we'd like to share a recap of our demo with you. In this demo, we explore the transformative potential of data profiling, telemetry pipeline optimization, and incident response. Foundationally, we follow an Understand, Optimize, and Respond workflow.

ServiceNow Integration

In this video, we'll be covering the benefits of the ServiceNow integration. We'll also go through the configuration process allowing you to follow along and quickly configure this integration. The integration will allow you to automatically create and resolve incidents based off of alarms triggered in Exoprise. With incidents being sent to ServiceNow, you can begin automatically assigning incidents to the correct teams based off of criteria or automatically escalating issues that have not been addressed yet.