Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

It's all Chaos! And it Makes for Resilience at Scale

Chaos engineering is a practice where engineers simulate failure to see how systems respond. This helps teams proactively identify and fix preventable issues. It also helps teams prepare responses to the types of issues they cannot prevent, such as sudden hardware failure. The goal of chaos engineering is to improve the reliability and resilience of a system. As such, it is an essential part of a mature SRE solution.

AIOps POC no longer have to be long and resource intensive

Gartner predicts that large enterprise exclusive use of AIOps and digital experience monitoring tools to monitor applications and infrastructure will rise from 5% in 2018 to 30% in 2023. And this prediction is soon turning into a reality. AIOps is showing promising business value as it impacts measurable metrics such as mean time to detect (MTTD), mean time to acknowledge (MTTA), mean time to restore/resolve (MTTR), service Availability, percentage of automated versus manual resolution, and so on.

How to find duplicate BLOBs in your Azure Storage Accounts

Azure Storage is like an all you can eat buffet, except the more you eat, the more you pay! This has provided organisations an almost limitless supply of storage, and as we all know, the more that’s available, the more we’ll use. Azure Storage has changed the way many organisations operate both in terms of availability and service.

What's new in Grafana Enterprise Metrics for scaling Prometheus: enhanced access control and a compactor that supports 650 million active series and beyond

I’m a fresh starter here at Grafana Labs, leading one of our teams working on the Grafana Enterprise Stack. As a longtime user of Grafana, I couldn’t wait to see what’s new in versions 1.1 and 1.2 of Grafana Enterprise Metrics (GEM), our scalable, self-hosted Prometheus service. I tried out the shiny features and wanted to share some of the cool things I found.

Three Steps To Get Started With Database DevOps

Once you’ve committed to changing your culture in order to automate your database deployments, what’s next? You’ve already done the hard part, making the decision to shift the culture. Now, what’s involved is just lots of labor. There are three things you can do to begin your Database DevOps journey: Let’s discus these in detail. It’s important to understand, not just why these are your first three steps, but why they should occur in this precise order.

How To Avoid Complex Pricing And Lengthy Contracts With Your Global Internet Access

When it comes to your business’s global internet access, network speed, security and performance are paramount. Many enterprises use a premium internet service to meet the needs of their global operations. One option available to them, which we have covered extensively already on this blog, is to use dedicated internet access across a private MPLS network. The other is to use an IP transit service.

Running commands securely in containers with Amazon ECS Exec and Sysdig

Today, AWS announced the general availability of Amazon ECS Exec, a powerful feature to allow developers to run commands inside their ECS containers. Amazon Elastic Container Service (ECS) is a fully managed container orchestration service by Amazon Web Services. ECS allows you to organize and operate container resources on the AWS cloud, and allows you to mix Amazon EC2 and AWS Fargate workloads for high scalability.

What Are Shared Services? (Your Guide For 2021)

What are shared services? How does a shared service business model work? In this article, we will be breaking down everything you need to know about shared services. Are you trying to build a request-centric operations model that places an emphasis on simplified customer experience? We will be exploring the key benefits of adopting a shared services model. There has never been a better time to start exploring the consolidation of business operations.

The Mattermost server repo surpasses 20,000 stars on GitHub

In March 2021, the Mattermost server repository officially surpassed 20,000 stars on GitHub, and we couldn’t be more excited! Huge thanks to our community for their incredible support of the Mattermost open source project and their belief in the power of secure workplace messaging solutions built for developers by developers.

Tweaking Your Monitoring Strategy for a Seamless End-User Experience

Technology and end users have an almost dichotomous relationship; as technology gets more complex, end users expect a more seamless experience. At the same time, end users are also looking for maximized application availability and performance. How can a federal IT team meet these increasing demands? The answer is monitoring. Most monitoring instances have been implemented as an afterthought or as a way to solve a single, specific problem.