Operations | Monitoring | ITSM | DevOps | Cloud

Datadog

Key metrics for monitoring Cilium

Cilium is a Container Network Interface (CNI) for securing and load-balancing network traffic in your Kubernetes environment. As a CNI provider, Cilium extends the orchestrator’s existing network capabilities by giving teams more control over how they build their applications and monitor traffic. For example, vanilla Kubernetes installations typically rely on traditional firewalls and Linux-based network utilities like iptables to filter pod-to-pod traffic by an IP address or port.

Monitor CockroachDB performance metrics with Datadog

CockroachDB is a highly resilient distributed SQL database developed by Cockroach Labs. CockroachDB assures ACID semantics and aims to make it easy to scale horizontally by adding nodes instead of manually sharding the database. Built to be resilient (much like its namesake insect) and highly available as it scales, CockroachDB readily recovers from node failures by repairing and rebalancing automatically.

Save and share reusable dashboard widget groups with Powerpacks

Dashboards allow you to visualize and correlate monitoring data from across disparate data sources, technologies, and infrastructure components to understand what’s going on in your environment. In a growing organization, it’s paramount to standardize how teams build their dashboards to ensure their consistency and legibility.

Analyze VPC Flow Logs for AWS Transit Gateway in Datadog

AWS Transit Gateway is a service that makes it easy to connect multiple Amazon Virtual Private Clouds (VPCs), AWS accounts, AWS Regions, and on-premises networks together through a central hub. For AWS customers operating at global scale with many accounts and VPCs, AWS Transit Gateway greatly simplifies AWS networking architecture by eliminating the need to manage complex peering relationships and massive route tables.

Monitor your T2A-powered GKE workloads with Datadog

Arm processors have become increasingly popular in recent years, providing energy-efficient, cost-effective processing power to both mobile and cloud computing ecosystems. As a part of this growth, more and more organizations are choosing to leverage the many benefits of Arm-based architectures for their containerized workloads. Today, Google Cloud announced its Arm-based Tau T2A virtual machines (VMs), which you can also use to run workloads in Google Kubernetes Engine (GKE).

Monitor custom serverless metrics with the Datadog Lambda extension

When building serverless applications on AWS Lambda, Amazon CloudWatch provides out-of-the-box metrics that measure the performance, errors, and duration of your functions. Although these standard Lambda metrics provide visibility into your serverless applications, it can also be invaluable to monitor custom metrics that are unique to your use case and application.

Proactively monitor service performance with SLO alerts

Service level objectives (SLOs) state your team’s goals for maintaining the reliability of your services. Adopting SLOs is an SRE best practice because it can help you ensure that your services perform well and consistently deliver value to users. But to gain the greatest benefit from your SLOs, you need ongoing visibility into how well your services are performing relative to your objectives.

Monitor Azure Functions with the Datadog extension for Azure App Service

Azure Functions is an on-demand serverless compute offering built on top of Azure App Service that enables you to deploy event-driven code without the need to provision and manage infrastructure. Because applications rely on Azure Functions to handle business-critical tasks such as processing orders or logging in users, it’s important to ensure that your functions respond quickly when they’re invoked.

Detect user pain points with Datadog Frustration Signals

Whether you run an ecommerce site, a digital publication, or any other customer-facing service, delivering optimum user experiences is key to the success of your business. Customers can grow frustrated and abandon your site when they run into hurdles such as JavaScript errors or confusing page designs, and that frustration negatively impacts your company’s bottom line.