Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

A Journey through the Blameless Resource Library

From the very beginning of Blameless, we had two vital missions. First, to offer a solution to what we saw as a mounting crisis of reliability by offering a comprehensive, easy-to-use, reliability platform. Second, to educate the companies facing this crisis on the fundamentals of incident management, cutting-edge best practices, and the cultural values that sustain learning and growth.

Accelerated Remediations: How to Maximize AIOps Investments in Network Operations

So, you’ve spent some money and you’re the proud owner of a shiny new AIOps tool that helps improve your Network Operations. Network alarms are now usable, but with all the constant monitoring, supervision, and incident management, your Network Operations Center (NOC) is still overwhelmed. It’s time to pull out another stop.

Incident response and awareness acceleration: What we can learn from responders of Queenstown floods.

I was visiting Queenstown, New Zealand last week amidst the horrible floods which quickly escalated. As an incident responder myself, I was amazed at the operations and how fast responders on the ground acted in evacuating and clearing the grounds. Over 100 people were evacuated in the middle of the night with zero casualties. A commendable job. Here are some observations I made and what we can learn as incident responders ourselves..

Virtana Named in Prestigious Industry Research by Gartner

Virtana’s AI-powered platform is at the forefront of IT infrastructure management, offering a comprehensive suite of tools and services that empower IT leaders to make informed decisions on how to forecast demand and streamline operations. The rapid evolution of technology has ushered in an era of complexity and dynamism that IT leaders must navigate effectively.

Zenbleed vulnerability fix for Ubuntu

On 24 July 2023, security researchers from Google’s Information Security Engineering team disclosed a hardware vulnerability affecting AMD’s Zen 2 family of microprocessors. They dubbed this vulnerability “Zenbleed” (CVE-2023-20593), evoking memories of previous vulnerabilities like HeartBleed and hinting at its possible impact.

Configuring Python StatsD Client

Building and deploying highly scalable, distributed applications in the ever-changing landscape of software development is only half the journey. The other half is monitoring your application states and instances while recording accurate metrics. There are moments when you wish to check how many resources are being consumed, how many files are under access by the specialized process, etc. These metrics provide valuable insights into our tech stack execution and management.

What are Prometheus Functions?

Prometheus is a platform for real-time systems and event monitoring and alerting. The Prometheus project is free, open-source, and available on GitHub. Originally developed at SoundCloud, Prometheus became a project of the Cloud Native Computing Foundation in 2016, alongside other popular frameworks such as Kubernetes. The core of the project is the Prometheus server, which acts as the system’s “brain” by collecting various metrics and storing them in a time-series database.

The Best Cloud Infrastructure Automation Tools

The past decade has seen a drastic growth in the adoption of public cloud. One of the primary reasons for this is its cheaper infrastructure and ease of scale. With such rapid adoption of public cloud, the need for infrastructure automation also arises. This is because teams want to quickly provision infrastructure and automate tasks that previously took weeks in the case of traditional data centers, down to minutes in the public cloud.