Operations | Monitoring | ITSM | DevOps | Cloud

APM

The latest News and Information on Application Performance Monitoring and related technologies.

Diagnose slow page requests with the latest addition to RUM

Earlier in the year, we launched the request details page in Raygun Real User Monitoring. This update brought the instance-level insights into page performance to help you understand exactly what caused poor performance and how to improve it. To complement instance-level insights, this latest launch brings the Latest slowest requests module to Real User Monitoring. Now, when viewing performance data for a specific URI, you can easily jump into the instance-level diagnostics for a specific slow request.
Sponsored Post

Why high-performing teams consolidate monitoring tools with Raygun APM

Raygun helps thousands of software teams diagnose and resolve performance problems faster and more efficiently. We understand that in today's current climate, there is even more pressure on teams to ensure software quality across the whole stack, all while being cost-effective. It's no easy task! But we know that consolidating tools can help.

Datadog on Kubernetes

When 2 years ago Datadog decided to move its infrastructure platform to Kubernetes we didn’t expect to find so many roadblocks, but ingesting trillions of datapoints per day in a reliable fashion requires pushing the limits of cloud computing. Creating and managing dozens of clusters, with thousands of nodes each and operating in several clouds was a challenging but rewarding learning experience. In this episode Ara Pulido, Developer Advocate, will chat with Laurent Bernaille, Staff Engineer at Datadog and part of the team that created Datadog’s Kubernetes platform. We’ll cover the challenges we found creating and scaling Datadog’s Kubernetes platform and how we overcame them.

The $5B DevOps Stranglehold

Ten years ago NewRelic, DataDog, Splunk, Dynatrace and SolarWinds built tools we loved to use. They were easy to implement and solved problems quickly and efficiently. Each company was known primarily for a single, well-conceived product. NewRelic’s APM. Splunk’s log file analyzer. DataDog’s server monitor. SolarWinds’ network performance monitor. These companies were beloved by users during the 2000s. Fast forward to 2020 and the world is very different.

Datadog on Kafka

As a company, Datadog ingests trillions of data points per day. Kafka is the messaging persistence layer underlying many of our high-traffic services. Consequently, our Kafka usage is quite high: double-digit gigabytes per second bandwidth and the need for petabytes of high performance storage, even for relatively short retention windows. In this episode, we’ll speak with two engineers responsible for scaling the Kafka infrastructure within Datadog, Balthazar Rouberol and Jamie Alquiza. They'll share their strategy in scaling Kafka, how it’s been deployed on Kubernetes, and introduce kafka-kit; our open source toolkit for scaling Kafka clusters. You'll leave with lessons learned while scaling persistent storage on modern orchestrated infrastructure, and actionable insights you can apply at your organization

Our Plans with ThousandEyes: Expanding Monitoring Across the Public Internet for All Applications at the Heart of Today's Digital Businesses

Last week, Cisco announced its intent to acquire ThousandEyes, Inc., marking a major milestone as we build toward our vision to provide a Central Nervous System (CNS) for IT. Learn more about what that means for your business.

Citrix Admins can now track Citrix Connection Quality without Needing any Client Software

Citrix technologies are often used by remote employees or collaborators to access corporate applications and desktops. Citrix access is session oriented – a session is established at logon time and a connection between the client and the server/desktop being accessed is maintained for the duration of the session. User access to Citrix apps and desktops is highly interactive – mouse clicks, keystrokes, etc. all have to go from the client to the server/desktop to be processed.

Why Stack Trace APM Isn't Enough for Complete Web Application Monitoring

It’s probably true to say that if you asked an average user what makes a great web application, they’d probably say “speed.” But speed is the probably the least important aspect of an extensive rundown of elements. Factors like application development and rendering in the program are probably higher on that list. And what makes up a great performing application? And when something goes wrong, how do you know?

Introduction to Site Reliability Engineering

In this session, we start with the basics of SRE, including some common terminology and theory, then dive into practical examples—including lessons learned from our own journey here at Datadog. We discuss the relationship between SRE and DevOps, what success looks like (and how to measure it), and how to identify and nurture both internal and external talent in order to build a cross-functional team. SRE is a large, complex topic, so the session ends with a live Q&A and deep-dive into some great topics.