Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

The Most Important KPIs for Monitoring Mobile Games

Managing modern mobile games involves measuring and tracking dozens of metrics. Each value lets you know how well your game is doing in a specific area, such as user experience, infrastructure, and monetization, to name but a few. But not all KPIs are created equally. Some metrics are more valuable than others in terms of helping you make informed business and technical decisions about your game. That’s what we’ll take a look at in this article.

AWS And Azure Dedicated Hosts Monitoring With VirtualMetric

Amazon Web Services and Microsoft Azure do not need any formal introduction. They are two major players in the cloud computing and virtualization industry. AWS leads with about 32% market share. On the other hand, Azure is the next closest with about 17% market share. Both the tech giants, AWS and Azure, have been on an onward growth trajectory with revenue boosts coming in every quarter.

vSphere with Tanzu Makes It Easier to Create Kubernetes Clusters Using vSphere 7.0 U1

The release of vSphere 7.0 U1 brings with it a lot of new enhancements to the platform’s core, while the four new Tanzu editions make it easier to package a complete Kubernetes solution. In the meantime, using vSphere with Tanzu has been simplified even more with use of the vSphere Distributed Switch as its main networking construct. Now your vSphere environment can be transformed into a Kubernetes powerhouse.

Managing Datadog with Terraform

Terraform is an increasingly popular infrastructure-as-code tool for teams that manage cloud environments spanning many service providers. New users are often drawn to Terraform’s ability to quickly provision compute instances and similar resources from infrastructure providers, but Terraform can also manage platform-as-a-service and software-as-a-service resources.

Breaking Serverless Things on Purpose: Chaos Engineering in Stateless Environments - Emrah Samdan

Serverless enabled us to build highly distributed applications that led to more granular functions and ultimate scalability. However, it also brought the risk of failure from a single microservice to many serverless functions and resources. You might be able to predict and design for certain troublesome issues but there are many, many more that you probably will not be able to easily plan for. How do you build a resilient system under these highly distributed circumstances? The answer is Chaos Engineering: Breaking things on purpose just to experience how the whole system will react.

Chaos Engineering: The Path to Reliability - Kolton Andrus

We’re all here for the same purpose: to ensure the systems we build operate reliably. This is a difficult task, one that must balance people, process and technology during difficult conditions. We operate with incomplete information, assessing risks and dealing with emerging issues. We’ve found Chaos Engineering to be a valuable tool in addressing these concerns. Learn from real world examples what works, what doesn’t, and what the future holds.

Identifying Hidden Dependencies - Liz Fong Jones

You don't need to write automation or deploy on Kubernetes to gain benefits from resilience engineering! Learn how Honeycomb improved the reliability of our Zookeeper, Kafka, and stateful storage systems through terminating nodes on purpose. We'll discuss the initial manual experiments we ran, the bugs in our automatic replacement tools we uncovered, and what steps we needed to progress towards continuously running the experiments. Today, no node at Honeycomb lives longer than 12 months, and we automatically recycle nodes every week.

Looking back on Chaos Conf 2020

It’s already been a week since we closed our third annual Chaos Conf! While we were forced to take the conference online, this meant that more of you could join us. Over 3,500 people signed up to help make this the world’s largest Chaos Engineering conference. That’s 5x more than 2019, and nearly 10x more than 2018! This is a testament to the growth of Chaos Engineering as a practice across many different industries and around the world.