Operations | Monitoring | ITSM | DevOps | Cloud

Our Check Overview Page Has a Fresh New Look

We are very excited to announce that we redesigned our monitoring results chart to make it easier for you to understand check performance over time and easily investigate any past anomaly. The redesign is a result of our UX research that showed that the old check overview chart made it challenging for users to find check results from the past. While we were redesigning our monitoring results charts, we wanted to achieve two things: And, we achieved this in three attempts. Let’s dive in.

The AIOPs and Automation Handshake: Managing the Modern IT Stack

To increase business agility, IT organizations are deploying dynamic, modern architectures enabled by virtualization technologies. That includes containers, elastic clouds, microservices, and virtual machines. If you are rethinking your IT stack, you must also reconsider its management. IT operational silos limit business velocity.

Which is Better for Monitoring: Datadog or AWS CloudWatch?

Observability is the process of understanding complex systems by analyzing their outcomes and enhancing those outcomes by monitoring events within the system. Today, observability is essential for IT services to achieve a better user experience and optimize software performance. With cloud platforms dominating the IT services landscape, organizations are inclined to deploy their software and hardware systems in the cloud to reduce operational costs and enhance flexibility.

How the Prometheus community is investing in OpenTelemetry

Goutham Veeramachaneni, a product manager at Grafana Labs, and Carrie Edwards, a senior software engineer at Grafana Labs, are both contributors to the Prometheus open source project. This post, which they wrote together, was originally published on the Prometheus.io blog in March 2024. The OpenTelemetry project is an observability framework and toolkit designed to create and manage telemetry data such as traces, metrics, and logs.

Beyond Microservices: Miniservices, Macroservices, and the in between

Containerized microservices have been the gold standard for cloud computing since they replaced the monolith architecture over a decade ago. The flexibility, scalability, and velocity they enable for teams make them an obvious choice. Yet, a strict interpretation of one service for one function doesn’t quite serve everyone, especially when architectures get large. We’ll discuss how flexibility in service architecture might be the way to go.

The Data Lake Dilemma: Why Businesses Need a New Approach

In today’s data-driven landscape, every organization knows the immense value their data holds, but with the explosion of data from diverse sources, traditional data storage and management solutions are proving inadequate. Organizations are urgently seeking new ways to handle their data effectively.

Calico VPP: Empowering High-Performance Kubernetes Networking with Userspace Packet Processing

This is a guest post authored by Nathan Skrzypczak, R&D Engineer at Cisco. Calico VPP, the latest addition to Calico’s suite of pluggable data planes, revolutionizes Kubernetes networking by enabling transparent user-space packet processing. With features such as service load balancing, encapsulation, policy enforcement, and encryption, Calico VPP brings the performance, flexibility, and observability of VPP to Kubernetes networking.

Synthetic monitoring for TFA-backed applications

Two-factor authentication (TFA, sometimes 2FA) is a crucial security measure that adds an extra layer of protection to your online account. It goes beyond the traditional password-based authentication by requiring a second form of verification. In TFA-backed applications, users are supposed to provide two forms of verification before gaining access to their accounts.

The Three Most Common Sales Mistakes MSPs Must Avoid

The theme I wish to explore in this year’s series of blog posts is how to be a better salesperson and how to build a better sales model. Each month I will explore a particular sales struggle that may be impacting your sales success and provide a few insights and actionable strategies that can be implemented to help you or members of your sales team overcome these challenges so that you can become more effective in your role.

An SRE's Most Important Skill? Communication

I wish someone had told me that I shouldn’t hop between frameworks. Just like learning four programming languages in your first year, in my experience spending time content switching as a beginner is wasted effort. If I’d spent a solid year learning how to deploy services on AWS, then when it was time to learn Azure, I’d see more similarities than differences and find it a lot easier to pick up a second public cloud.