Operations | Monitoring | ITSM | DevOps | Cloud

What SREs Can Learn from Facebook's Largest Outage

Facebook’s October 2021 outage was the type of event that gives SREs nightmares: A series of critical business apps crashed in minutes and remained unavailable for hours, disrupting more than 3.5 billion users around the world and costing about 60 million dollars. As incidents go, this was a pretty big one.

Announcing HAProxy Kubernetes Ingress Controller 1.7

We’re proud to announce the release of version 1.7 of the HAProxy Kubernetes Ingress Controller! In this release, we added support for custom resource definitions that cover most of the configuration settings. Definitions are available for the global, defaults and backend sections of the configuration. This promotes a cleaner separation of concerns between the different groups of settings and strengthens validation of those settings.

How Monitor Google Cloud Interconnect & Network Performance | Obkio

How to Monitor Google Cloud Interconnect and Network Performance Google Cloud Interconnect promises data transfers with low latency, and high availability - but how can you make sure that it’s actually performing as promised? Monitoring Google Cloud performance is the key to identifying slowdowns, high levels of packet loss, and other problems affecting Google Cloud. Measuring and monitoring is the first step to troubleshooting network problems.

Rollbar Pro Tips: Item searching and filtering

On the Items view, you can filter your Items by many different properties. Some properties are direct properties of the items themselves, while others are evaluated against the occurrences of the item. Many more search options are available via the text box. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

10 years of cloud infrastructure with Eric Brewer

In this video, Google Cloud Developer Advocate, Stephanie Wong, speaks with Google Fellow, Eric Brewer, about his experience building infrastructure, including Kubernetes, over the last decade at Google. You’ll get a window into what it was like to help propel Kubernetes into one of the largest open source projects today.

GitLens for Visual Studio Code, and its Creator Eric Amodio, Join GitKraken

For those of you who don’t know me, I’m Eric Amodio, creator of GitLens. I’m an innovator, leader, architect, and seasoned full-stack developer. I started developing GitLens way back in 2016 when I fell in love with Visual Studio Code and wanted to play with what was then newly released extension support. It all started with a simple question: could I add Git insights via CodeLens (hence GitLens) to any document? Which of course was yes, and a whole lot more.

What Is Kubernetes Pod Disruption?

Kubernetes pods are the smallest deployable units in the Kubernetes platform. Each pod signals a single running process within the system and functions from a node or worker machine within Kubernetes, which may take on a virtual or physical form. Occasionally, Kubernetes pod disruptions may occur within a system, either from voluntary or involuntary causes.