Operations | Monitoring | ITSM | DevOps | Cloud

Outage in Egypt impacted AWS, GCP and Azure interregional connectivity

On Tuesday, June 7, internet users in numerous countries from East Africa to the Middle East to South Asia experienced an hours-long degradation in service due to an outage at one of the internet’s most critical chokepoints: Egypt. Beginning at approximately 12:25 UTC, multiple submarine cables connecting Europe and Asia experienced outages lasting over four hours. As I show below, the impacts were visible in various types of internet measurement data to the affected countries.

OpenObservability Talks Second Year at a Glance

I can’t believe that OpenObservability Talks podcast is already celebrating its second anniversary. It feels like just yesterday I wrote the summary of the summary of the first year, sharing the hectic times of starting a podcast in the midst of the COVID-19 global pandemic. The pandemic has been with us most of this year too, but it didn’t stop us from bringing the latest on the best of breed open source observability.

Recapping SLOconf 2022: SLOs are for everyone!

Did you get to attend the excellent SLOconf last month? With four different tracks and over 60 talks - covering everything from defining an SLO to the financial framing of error budgets, you, like us, may have missed a couple of things. In this handy recap, we take you through some of the juiciest sessions and point you to a few you may have overlooked. Luckily, SLOconf 2022 was designed for while-you’re-working participation and all the talks are still available.

GitKraken Client v8.6 - Faster Git LFS and beyond!

We know that everyone’s code story may be a little different, but speedier repos are something everyone can get behind. No matter where your developer adventures take you, it is important to keep all your code, configuration, and media assets together, and never leave a file behind. That is why we have been working on a lot of performance improvements for Git LFS users and have added Bitbucket Workspace support for Bitbucket Server users!

Here's why ITSM is a big deal in the manufacturing world

As a business leader in a manufacturing company, it is essential for you to manage quality service operations from field service to production and delivery. To maintain a successful manufacturing business unit, you must inspect and maintain a variety of important equipment and machinery on a regular basis, which, no matter how high-quality, are still prone to failures and breakdown.

Four key takeaways from our recent webinar: BigPanda picks up where Netcool left off

For years, Netcool has been omnipresent in many IT Operations organizations. That, combined with the sheer utility it once brought to the table, sometimes gave it a special sort of nostalgic reverence in IT Operations circles. But with all due respect to Netcool, there’s also little doubt the platform’s real-world utility has waned in the era of cloud and hybrid ops.

Puppet and Government: Maintaining compliance in complex hybrid cloud environments

This blog is the third in a four-part series about how Puppet can help government agencies meet compliance and security requirements. Read the second post here. Government agency IT departments know that migrating applications to the cloud can improve efficiency, increase visibility, and reduce costs. They also recognize the value in keeping some operation resources on-premises.

Netdata Agent release v1.35

The latest Netdata Agent release v1.35 introduces massive improvements for the machine learning-powered Anomaly Advisor, Metric Correlations, Kubernetes monitoring, and much more. Anomaly Advisor & on-device Machine Learning This release features a launch of the flagship machine learning (ML) assisted troubleshooting Anomaly Advisor. Unsupervised ML models are trained for every metric, at the edge, on your devices, enabling real-time anomaly detection across all your systems and applications.

Custom Resources with HAProxy Kubernetes Ingress Controller

HAProxy Kubernetes Ingress Controller provides custom resources named Backend, Defaults, and Global that let you manage ingress controller settings more efficiently. To start using them right away, check the documentation for steps and examples. In this blog post, you’ll learn why custom resources are such a powerful feature and see tips for getting the most out of them.

ServiceNow named worldwide AIOps market share leader by Gartner

I’m excited to announce Gartner has named ServiceNow the AIOps worldwide market share leader.1 We believe this confirms that ServiceNow® Predictive AIOps delivers on the promise of proactive IT. AIOps helps organizations drive resilience, efficiency, and proactive IT operations at scale. By distilling signal from noise, ServiceNow AIOps helps organizations better identify root causes and facilitate faster remediation.