Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The Hidden Costs of Website Outages and How Uptime.com Has Your Back

Businesses lose potential revenue, trust, and brand reputation every moment your website is down. Some of those things can never be earned back. Website outages sting whether you’re a blossoming startup or a seasoned enterprise. How often do they happen, and what’s the actual cost? That is exactly what we will explore together today!

Improve your API test coverage with Datadog Synthetic Monitoring

As your applications grow, your teams may be faced with managing a complex, expanding mesh of potentially thousands of loosely connected APIs—each one a new point of failure that can be difficult to track and patch. API sprawl comes naturally in rapidly expanding, distributed applications, and the difficulty of maintaining centralized knowledge and toolsets for your APIs creates friction when teams need to leverage APIs they don’t own.

Manage API performance, security, and ownership with Datadog API Catalog

Today’s modern applications are made up of thousands of loosely connected private and publicly exposed APIs, each serving a specific function. This dynamic API landscape, in combination with the decentralized nature of microservice development, can be overwhelmingly challenging to manage—let alone govern or secure adequately. API sprawl is often created as a result, leading to fragmented or nonexistent internal API documentation, knowledge bases, and toolsets.

Bugs in NASAs codebase : Bill Kennedy - The Reliability Podcast

The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

AI is not intellignece: Bill Kennedy - The Reliability Podcast

The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

How to Monitor SQL Server with OpenTelemetry

At observIQ, we've seen growing interest in observing the health of Windows systems and applications using OpenTelemetry. Requests on the SQL Server receiver continue to garner the most interest, so let's start there. Below are steps to get up and running quickly with the contrib distribution of the OpenTelemetry collector. We'll be collecting and shipping SQL Server metrics to a popular backend, Google Cloud.

How to Monitor Hybrid Networks for End-to-End Visibility: Hybrid Network Monitoring

Hybrid networks, which combine on-premises infrastructure with cloud-based services, have become the backbone of modern operations. While they offer numerous advantages, they also present unique challenges when it comes to network monitoring and management. Maintaining the health and security of a hybrid network requires a comprehensive understanding of its intricate architecture and real-time visibility into its performance.

Monitoring your infrastructure with StatsD and Graphite

Collecting metrics about your servers, applications, and traffic is a critical part of an application development project. There are many things that can go wrong in production systems, and collecting and organizing data can help you pinpoint bottlenecks and problems in your infrastructure. In this article, we will discuss Graphite and StatsD, and how they can help form the basis of monitoring infrastructure.