Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

AI-Based remediation for cloud incidents? We are on our way there

“Today, maintaining a high performance and continuous availability of cloud production environments are some of the most significant pain points for technology organizations. Production availability is commonly measured and managed by evaluating some of the following dimension: Does the product or service work fast enough? Does its performance meet the expected metrics? Is it able to deliver the optimal customer experience?

Monitor ClickHouse with Datadog

ClickHouse is an open source database management system, and was originally developed as a backend for Yandex’s Metrica analytics platform. ClickHouse is column oriented, meaning that it can quickly scan through ranges of values in a single column without touching irrelevant values in other columns. This makes ClickHouse well suited for online analytical processing (OLAP).

Docker container monitoring with Netdata

Properly monitoring the health and performance of Docker containers is an essential skill for solo developers and large teams alike. As your infrastructure grows in complexity, it’s important to streamline every facet of the performance of your apps/services. Plus, it’s essential that the tools you use to make those performance decisions work across teams, and allow for complex scaling architectures.

Key metrics for monitoring AWS Lambda

AWS Lambda is a compute service that enables you to build serverless applications without the need to provision or maintain infrastructure resources (e.g., server capacity, network, security patches). AWS Lambda is event driven, meaning it triggers in response to events from other services, such as API calls from Amazon API Gateway or changes to a DynamoDB table.

Tools for collecting AWS Lambda data

In Part 1 of this series, we discussed AWS Lambda functions and some key metrics for monitoring them. In this post, we’ll look at using Amazon’s native tooling to query those metrics. We’ll also show you how to collect logs and traces that provide further visibility into your Lambda functions. Amazon provides built-in monitoring functionality through CloudWatch and X-Ray.

Monitoring AWS Lambda with Datadog

In Part 2 of this series, we looked at how Amazon’s built-in monitoring services can help you get insights into all of your AWS Lambda functions. In this post, we’ll show you how to use Datadog to monitor all of the metrics emitted by Lambda, as well as function logs and performance data, to get a complete picture of your serverless applications. In this post, we will: Datadog integrates with AWS Lambda and other services such as Amazon API Gateway, S3, and DynamoDB.