Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Why You Don't Need to Hire Kubernetes Experts

History has a tendency to repeat itself. This is because bad habits and anti-patterns are hard to break. And this remains the case with the latest sought-after engineering unicorn––the “Kubernetes expert”. These days, there is a veritable gold rush to hire the best and brightest Kubernetes wizards. Like all forms of expertise––this gold is rare, and as a result––is also costly. But this isn’t a new phenomenon in the technology world.

Introducing Honeycomb for Frontend Observability: Get the Data You Need for Actionable Customer Experience Improvements

Today, we're announcing the early access program of Honeycomb for Frontend Observability. Honeycomb for Frontend Observability gives teams the ability to quickly identify opportunities for optimization within their web app. This starts with better OpenTelemetry instrumentation, available as an NPM package, that lets you instrument and collect attribution data on Core Web Vitals in under an hour.

MPLS vs SD-WAN: Optimizing Your WAN for the Cloud Era

Imagine your business as a bustling city with branch offices scattered across different districts. Efficient communication between these locations is vital for smooth operations. This is where Wide Area Networks (WANs) come in, acting as the high-speed highways that connect your city's various departments. But when it comes to choosing the right WAN technology, navigating the options can feel like getting lost in a maze.

Setting up your Grafana k6 performance testing suite: JavaScript tools, shared libraries, and more

Editor’s note: This blog post is the second in a series of posts about organizing your performance testing suite with Grafana k6. If you haven’t already, be sure to check out the first post in the series, which explores how to implement reusable test patterns and other best practices within your testing suite.

Live event recap: Humanizing the on-call experience

There’s no two ways about it: on-call is stressful. But with humans at the center, it’s especially important to find ways to make it as manageable and empathetic as possible. In this webinar with our friends at ELC, incident.io VP of Engineering, Noberto Lopes, and Intercom Staff Product Engineer, Andrej Blagojević, discuss their own experiences with on-call, and how the process can be better.

Making Data Storage More Secure with Progress Flowmon and Veeam Backup and Replication

The new partnership between Progress and Veeam represents a significant step forward in cybersecurity. It marks a considerable advancement in data protection by merging the Flowmon AI-powered threat detection capabilities with the robust backup solution of Veeam. This empowers organizations to more effectively defend their invaluable digital assets.

IRL to IAC: Your Environment to PagerDuty via Terraform

Figuring out how to represent your as-built environment in PagerDuty can be confusing for new users. There are a lot of components to PagerDuty that will help your team be successful managing incidents, integrating with other systems in your environment, running workflows, and using automation. Your organization might have a lot of these components – users, teams, services, integrations, orchestrations, etc.

How to build reliable services with unreliable dependencies

In an earlier blog, we looked at slow dependencies and how they can impact the reliability of other services. While we explored what happens when dependencies are degraded, what happens when dependencies outright fail? What can you do when your application or service sends a request to another service, and nothing comes back? We’ll answer this question by using Gremlin to proactively test a service with multiple dependencies.