Datadog

Datadog's AWS re:Invent 2018 guide

Nov 14, 2018 By Jason Yee In Datadog

Each November, AWS re:Invent draws thousands of AWS staff, partners, and users to Las Vegas for an intense week featuring all things AWS and AWS-related. As always, Datadog will be there and we’d love to meet you in person. Our engineers are excited to show off the new features they’ve been building and to answer your monitoring questions!

Read Post

Datadog

Read more about Datadog's AWS re:Invent 2018 guide

Monitoring Apache Spark applications running on Amazon EMR

Nov 5, 2018 By Priya Matpadi In Datadog

We recently implemented a Spark streaming application, which consumes data from from multiple Kafka topics. The data consumed from Kafka comprises different types of telemetry events generated by mobile devices. We decided to host the Spark cluster using the Amazon EMR service, which manages a fleet of EC2 instances to run our data-processing pipelines.

Read Post

Datadog

Read more about Monitoring Apache Spark applications running on Amazon EMR

Scaling Kafka at Datadog with Kubernetes and Kafka-Kit

Nov 1, 2018 By Datadog In Datadog

In this session, Balthazar Rouberol discusses how he is using Kafka-Kit and Kubernetes to manage and scale the Kafka clusters that power Datadog's metric pipelines. Gwen Shapria (Confluent.io) also presents on how to deploy Kafka on K8S.

View Video

Datadog

Read more about Scaling Kafka at Datadog with Kubernetes and Kafka-Kit

Kubernetes the very hard way at Datadog

Oct 24, 2018 By Datadog In Datadog

Rob Boll and Laurent Bernaille share their experiences in running and building the Kubernetes clusters that power Datadog.

View Video

Datadog

Read more about Kubernetes the very hard way at Datadog

Introducing the Datadog Cluster Agent

Oct 18, 2018 By Charly Fontaine In Datadog

As containers and orchestrators have surged in popularity, they have created highly dynamic environments with rapidly changing workloads—and the need for equally dynamic ways of monitoring them. After all, orchestration technologies like Kubernetes, DC/OS, and Swarm manage container workloads both at the node level and at the cluster level, which means that you need to gather insights from every layer to fully understand the state of your infrastructure.

Read Post

Datadog

Read more about Introducing the Datadog Cluster Agent

Track the status of your SLOs with the new monitor uptime widget

Oct 18, 2018 By Meghan Jordan In Datadog

Service level objectives are an important tool for maintaining application performance, ensuring a consistent customer experience, and setting expectations about service performance for both internal and external users. We are very pleased to announce the availability of a new monitor uptime widget that makes it simple to monitor the status of your SLOs and communicate that status to your teams, executives, or external customers.

Read Post

Datadog

Read more about Track the status of your SLOs with the new monitor uptime widget

Log Patterns: Automatically cluster your logs for faster investigation

Oct 18, 2018 By Renaud Boutet In Datadog

Sifting through all your logs to find what you need can be challenging—especially during an outage, when time is critical and you’re flooded with WARN and ERROR messages. To help you immediately surface useful information from large volumes of logs, we developed Log Patterns.

Read Post

Datadog

Read more about Log Patterns: Automatically cluster your logs for faster investigation

Pivotal Cloud Foundry Monitoring with Datadog

Oct 10, 2018 By Maxim Brown In Datadog

In part three of this series, we showed you a number of methods and tools for accessing key metrics and logs from a Pivotal Cloud Foundry deployment. Some of these tools help PCF operators monitor the health and performance of the cluster, whereas others allow developers to view metrics, logs, and performance data from their applications running on the cluster.

Read Post

Datadog

Read more about Pivotal Cloud Foundry Monitoring with Datadog

Collecting Pivotal Cloud Foundry logs and metrics

Oct 10, 2018 By Maxim Brown In Datadog

So far in this series we’ve explored Pivotal Cloud Foundry’s architecture and looked at some of the most important metrics for monitoring each PCF component. In this post, we’ll show you how you can view these metrics, as well as application and system logs, in order to monitor your PCF cluster and the applications running on it.

Read Post

Datadog

Read more about Collecting Pivotal Cloud Foundry logs and metrics

Key metrics for monitoring Pivotal Cloud Foundry

Oct 10, 2018 By Maxim Brown In Datadog

In the first part of this series, we outlined the different components of a Pivotal Cloud Foundry deployment and how they work together to host and run applications. In this article we will look at some of the most important metrics that PCF operators should monitor. These metrics provide information that can help you ensure that the deployment is running smoothly, that it has enough capacity to meet demand, and that the applications hosted on it are healthy.

Read Post

Datadog

Read more about Key metrics for monitoring Pivotal Cloud Foundry

Operations | Monitoring | ITSM | DevOps | Cloud

Datadog

Datadog's AWS re:Invent 2018 guide

Monitoring Apache Spark applications running on Amazon EMR

Scaling Kafka at Datadog with Kubernetes and Kafka-Kit

Kubernetes the very hard way at Datadog

Introducing the Datadog Cluster Agent

Track the status of your SLOs with the new monitor uptime widget

Log Patterns: Automatically cluster your logs for faster investigation

Pivotal Cloud Foundry Monitoring with Datadog

Collecting Pivotal Cloud Foundry logs and metrics

Key metrics for monitoring Pivotal Cloud Foundry

Monthly Archive

Follow Us