Operations | Monitoring | ITSM | DevOps | Cloud

Architecting for Reliability

As modern systems become increasingly more complex, the risk of incidents and outages increases. Old approaches to reliability can sometimes be adapted to novel system designs, but other times new methods need to be invented. In this panel session moderated by Datadog’s Jason Yee, you’ll hear from SRE leaders and systems architects across the industry about how they’re designing and operating systems to achieve greater reliability.

Democratizing Observability

DevOps principles have helped many organizations improve cross-team collaboration, which has in turn led to increased reliability and velocity in the development lifecycle. In this session moderated by Jason Yee, we hear from panelists who have applied these same DevOps principles to observability, helping them unlock data-based insights and empower teams to make smarter, more informed decisions.

How to centralize thousands of data sources with Grafana: Inside Adform's observability system

Over the course of two decades, Adform grew from a dream between friends huddled in a basement to a leading advertising tech platform powering more than 25,000 clients worldwide. Success brought external accolades, but it also created the need for internal innovation to support the company’s continued growth. In 2018, Adform was still operating in startup mode, which meant developers and teams cherry-picked the tools that worked best for them.

How Automation Playbooks Double Down on the Value of SOARs

So you’ve set up a Security Orchestration, Automation and Response (SOAR) platform. You’re now ready to detect, respond to and remediate whichever threats cyberspace throws at you, right? Well, not necessarily. In order to deliver their maximum value, SOAR tools should be combined with playbooks, which can be used to drive SOAR systems and ensure that SOARs remediate threats as quickly as possible — in some cases, without even waiting on humans to respond.

Optimizing Your Kubernetes Load Testing with Speedscale

One of the major factors that come into play when deciding on a load testing tool is whether it can perform as you expect it to. There are many ways to measure how well a load testing tool performs, with the amount of requests per second undoubtedly being one of the main ways. Speedscale creates load tests from recorded traffic, so generating load is at the core of the tool.

Installing the HG Heroku Monitoring & Dashboards Add-on

HG or HostedGraphite provides a complete infrastructure and application monitoring platform from a suite of open-source monitoring tools. Depending on the setup, you can choose Hosted Graphite as your data source and view all required metrics on beautiful Grafana dashboards in real time. Hosted Graphite offers a wide range of tools, add-ons, and plugins that make it possible to measure, analyze, and visualize large amounts of data about your applications with ease.

Is Your Ecommerce Site Ready for Black Friday and Cyber Monday?

The holiday shopping season is one of the most stressful periods for operators of retail and ecommerce businesses, as the seasonal surge of holiday shoppers can put massive amounts of stress and strain on even the most well-architected websites. Here’s a recent example from 2021: The Office Depot website suffered an outage during Cyber Monday that knocked the online shop offline for hours, impacting the ability of customers to place orders online.

Sunbird Named a Top 10 Most Innovative Data Center Company to Watch by CIO Insights

We are proud to share that Sunbird was recognized by CIO Insights as one of their 10 Most Innovative Data Center Companies to Watch. "We will present the leading data center companies in the world and their contribution to making the world more digital-friendly," said Richard Thomas, Editor, CIO Insights. "They are constantly innovating and disrupting the data center space with their ability to look at the future.