Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Five Trends from SREcon Americas 2023

Last week, over five hundred SREs gathered in Santa Clara to share the latest research, tips, tricks, best practices, and more for site reliability engineering. They were joined by some of the biggest names in the reliability space. And, yes, Gremlin was there to answer any and all questions about chaos engineering and proactive reliability. After three days of great conversations and insightful talk, let’s take a look at some of the themes we heard weaving through SRECon.

What does CloudCheckr offer for Azure Cost Optimization and its Considerations?

CloudCheckr is a SaaS application that helps bring visibility and intelligence to help you lower cloud costs, maintain security and compliance, and optimize resources. The platform supports managing costs between cloud providers like AWS and Azure. This article explores the features and benefits of CloudCheckr in Azure.

What Is Observability? Examples of How It Can Help You

Observability is a powerful concept that can help you gain insight into the performance of your systems and applications. It refers to the ability to measure, monitor, analyze, and manage different aspects of an infrastructure or application—from hardware components to application code. With observability techniques such as distributed tracing, monitoring metrics, log analysis, and anomaly detection, organizations can ensure their applications run smoothly without downtime or disruption.

5 DevOps Skills Every Engineer Should Have In The Cloud Era

DevOps doesn’t necessarily look like it used to. Engineers used to build software designed for on-prem hardware; they had a specific methodology for efficient production and distribution schedules; and they didn’t interface very much with non-engineers, if at all. Today, all that has been flipped upside-down. Cloud-era DevOps engineers now must possess wildly different skill sets, and some previously non-negotiable skills have faded into the past.

What is System Hardening? Definition and Best practices

System hardening means locking down a system and reducing its attack surface: removing unnecessary software packages, securing default values to the tightest possible settings and configuring the system to only run what you explicitly require. Let’s take an example from daily life.

Lessons from hybrid working: Are businesses and networks coping?

Almost three years into the hybrid working experiment and for some, the unintended pilot has turned into an adopted model, while for others the IT complexities of dealing with a remote workforce remain a persistent headache. Although hybrid or remote working are not new concepts, there are several reasons it wasn’t a widely adopted model prior to the outbreak of the pandemic in 2020. Many of those reasons are cultural, but some are purely technical.

Azure Blob Storage Types and Cost Factors

Azure Blob storage is a popular service provided by Microsoft, offering scalable, cost-effective, and secure cloud storage solutions for various types of unstructured data. This article aims to provide a comprehensive analysis of Azure Blob storage types, their pricing models, and the key factors that impact the cost of using these services.

What an Internal Developer Platform (IDP) Really Is + Why You Should Care

The concept of the internal developer platform (IDP) isn’t new to anyone who’s been doing DevOps for the last couple decades. But the recent explosion of interest around platform engineering, particularly among business leaders, has led to a re-examination of the IDP as we know it.