Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Monitor Alibaba Cloud with Datadog

Alibaba Cloud provides a comprehensive suite of cloud computing services to power businesses across the globe. We are excited to announce that our new integration with Alibaba Cloud is now in public beta. While the Datadog Agent has always been able to provide visibility into Alibaba Cloud instances, this new integration now enables you to also monitor the health and performance of Alibaba Cloud services (load balancers, managed databases, and more) in Datadog.

Prometheus metrics / OpenMetrics code instrumentation

In the following example-driven tutorial we will learn how to use Prometheus metrics / OpenMetrics to instrument your code whether you are using Golang, Java, Python or Javascript. We will cover the different metric types and provide readily executable code snippets. Prometheus is an open source time series database for monitoring that was originally developed at SoundCloud before being released as an open source project.

How to Monitor Amazon DynamoDB with CloudWatch

Amazon DynamoDB is a key-value and document database that allows you to easily scale to huge numbers of records with single digit millisecond performance. However, since it’s a managed service, you have less visibility with traditional monitoring tools. As such, it becomes even more important to take advantage of the available monitoring tools in AWS. In this post, we’ll explain how to use CloudWatch to monitor DynamoDB and what is important to watch.

NetFlow Basics: An Introduction to Monitoring Network Traffic

To fully understand what NetFlow is and why it’s used for network monitoring, we first need to know what a flow is. When computers need to talk to one another they establish communication channels, commonly referred to as connections. (Technically speaking, these communication channels can only be called connections when the TCP protocol is involved.) A flow refers to any connection or connection-like communication channel.

Quantifying the Digital Employee Experience

We’ve talked to a lot of people about their company’s digital employee experience the past few years – from C-suite executives and board members looking to make sure they’re doing what they can to make work lives better and retain staff, to the actual CIOs and IT managers tasked with changing and improving their employees’ workplace experience. We’ve even heard from employees on the front lines every day about what works and what doesn’t at their companies.

How to collect, standardize, and centralize Golang logs

Organizations that depend on distributed systems often write their applications in Go to take advantage of concurrency features like channels and goroutines (e.g., Heroku, Basecamp, Cockroach Labs, and Datadog). If you are responsible for building or supporting Go applications, a well-considered logging strategy can help you understand user behavior, localize errors, and monitor the performance of your applications.

Best Practices for Proactive Monitoring

If you could know information about your systems in advance, what would you choose to know? If there was a set of repeating behaviors that happened consistently before an outage, would you want to know what they were? This is the idea behind proactive monitoring – the switching of context from “reactive” monitoring to something that allows you to act before the problem arises. Here are some guidelines to help you get started with your customized solution.