September 2024

How to spot and fix memory leaks in Go

Sep 30, 2024 By George Kampitakis In Datadog

A memory leak is a faulty condition where a program fails to free up memory it no longer needs. If left unaddressed, memory leaks result in ever-increasing memory usage, which in turn can lead to degraded performance, system instability, and application crashes. Most modern programming languages include a built-in mechanism to protect against this problem, with garbage collection being the most common. Go has a garbage collector (GC) that does a very good job of managing memory.

Read Post

Datadog

Read more about How to spot and fix memory leaks in Go

How we used Datadog to save $17.5 million annually

Sep 27, 2024 By Bowen Chen In Datadog

Like most organizations, we are always trying to be as efficient as possible in our usage of our cloud resources. To help accomplish this, we encourage individual engineering teams at Datadog to look for opportunities to optimize. They can share their performance wins, big or small, in an internal Slack channel along with visualizations and, often, calculations of the resulting annual cost savings.

Read Post

Datadog

Read more about How we used Datadog to save $17.5 million annually

Optimize your AWS costs with Cloud Cost Recommendations

Sep 26, 2024 By Kayla Taylor In Datadog

Managing your AWS costs is both crucial and complex, and as your AWS environment grows, it becomes harder to know where you can optimize and how to execute the necessary changes. Datadog Cloud Cost Management provides invaluable visibility into your cloud spend that enables you to explore costs and investigate trends that impact your cloud bill.

Read Post

Datadog

Read more about Optimize your AWS costs with Cloud Cost Recommendations

Operator vs. Helm: Finding the best fit for your Kubernetes applications

Sep 26, 2024 By Nicholas Thomson In Datadog

Kubernetes operators and Helm charts are both tools used for deploying and managing applications within Kubernetes clusters, but they have different strengths, and it can be difficult to determine which one to use for your application. Helm simplifies the deployment and management of Kubernetes resources using templates and version-controlled packages. It excels in scenarios where repeatable deployments and easy upgrades or rollbacks are needed.

Read Post

Datadog

Read more about Operator vs. Helm: Finding the best fit for your Kubernetes applications

Integration roundup: Understanding email performance with Datadog

Sep 25, 2024 By Aaron Kaplan In Datadog

Visibility into email health and performance is indispensable to any organization seeking to reach its customers through their inboxes. As they work to curtail spam, internet service providers (ISPs) are redefining the standards of deliverability on an ongoing basis, and organizations often struggle to adapt.

Read Post

Datadog

Read more about Integration roundup: Understanding email performance with Datadog

Get insights into service-level Fastly costs with Datadog Cloud Cost Management

Sep 24, 2024 By Natasha Goel In Datadog

As your organization scales its applications across many different cloud and SaaS providers, it becomes more challenging to understand your costs. You likely receive your bill at the end of the month, meaning you don’t have real-time visibility into who’s spending what and which services or applications your teams are spending the most on. Changing service costs also makes it difficult to break down your costs and identify what is driving spend, leaving you unable to take action.

Read Post

Datadog

Read more about Get insights into service-level Fastly costs with Datadog Cloud Cost Management

Optimize Ruby garbage collection activity with Datadog's allocations profiler

Sep 24, 2024 By Ivo Anjo In Datadog

One Ruby feature that embodies the principle of “optimizing for programmer happiness” is how the language uses garbage collection (GC) to automatically manage application memory. But as Ruby apps grow, GC itself can become a big consumer of system resources, and this can lead to high CPU usage and performance issues such as increased latency or reduced throughput.

Read Post

Datadog

Read more about Optimize Ruby garbage collection activity with Datadog's allocations profiler

Best practices for monitoring and remediating connection churn

Sep 18, 2024 By Nicholas Thomson In Datadog

Elevated connection churn can be a sign of an unhealthy distributed system. Connection churn refers to the rate of TCP client connections and disconnections in a system. Opening a connection incurs a CPU cost on both the client and server side. Keeping those connections alive also has a memory cost. Both the memory and CPU overhead can starve your client and server processes of resources for more important work.

Read Post

Datadog

Read more about Best practices for monitoring and remediating connection churn

Anthropic Partners with Datadog to Bring Trusted AI to All

Sep 12, 2024 By Datadog In Datadog

At Datadog’s 2024 DASH conference, Anthropic President and Co-Founder, Daniela Amodei, announced the new Anthropic integration with Datadog’s LLM Observability. This new native integration offers joint customers robust monitoring capabilities and suite of evaluations that assess the quality and safety of LLM applications. Get real time insights into performance and usage, with full visibility into the end to end LLM trace. Enabling you to troubleshoot any issues, reduce downtime and get your Claude powered applications to market faster.

View Video

Datadog

Read more about Anthropic Partners with Datadog to Bring Trusted AI to All

Datadog for Financial Services

Sep 9, 2024 By Datadog In Datadog

Global financial services institutions monitor the health, security, and performance of their most business-critical systems with Datadog’s unified observability platform.

View Video

Datadog

Read more about Datadog for Financial Services

Key learnings from the State of Cloud Costs study

Sep 9, 2024 By Kayla Taylor In Datadog

We recently released our initial State of Cloud Costs report, which identified factors shaping the costs of hundreds of organizations that use Datadog Cloud Cost Management to monitor their AWS spend. The report reveals several widely applicable themes, including the ways in which resource utilization, adoption of emerging technologies, and participation in commitment-based discount programs all shape cloud environments and costs.

Read Post

Datadog

Read more about Key learnings from the State of Cloud Costs study

Datadog for Cloud Operational Excellence

Sep 6, 2024 By Datadog In Datadog

Datadog provides real-time visibility and actionable insights into hybrid and multi-cloud environments, helping complex organizations streamline incident management, reduce costs, and maximize uptime in a single, unified platform.

View Video

Datadog

Read more about Datadog for Cloud Operational Excellence

Monitor your Twilio resources with Datadog

Sep 5, 2024 By Brittany Coppola In Datadog

Twilio is a customer engagement platform that helps organizations build communication features to meaningfully interact with customers on the channels they prefer. Twilio consists of a set of APIs for integrating communication tools such as voice, SMS, chat, video, and email into applications. Datadog’s Twilio integration collects a wide variety of logs to allow you to analyze performance issues and detect security threats across all of your Twilio resources.

Read Post

Datadog

Read more about Monitor your Twilio resources with Datadog

Monitor Oracle Cloud Infrastructure with Datadog

Sep 5, 2024 By Bowen Chen In Datadog

Oracle Cloud Infrastructure (OCI) provides cloud infrastructure and platform services designed to support a broad spectrum of cloud strategies and workloads. OCI provides enterprise customers with scale-up resource scaling architectures, ultra-low-latency networks, and more to help them migrate legacy workloads to the cloud, while supporting cloud-native applications via an expansive network of cloud partners and services.

Read Post

Datadog

Read more about Monitor Oracle Cloud Infrastructure with Datadog

Burn rate is a better error rate

Sep 4, 2024 By James Frullo In Datadog

While building our Service Level Objectives (SLO) product, our team at Datadog often needs to consider how error budget and burn rate work in practice. Although error budgets and burn rates are discussed in foundational sources such as Google’s Site Reliability Workbook, for many these terms remain ambiguous. Is an error budget a static quantity or a varying percentage? Does burn rate indicate how fast I’m spending a fixed quantity, or is it just another way to express error rate?

Read Post

Datadog

Read more about Burn rate is a better error rate

Operations | Monitoring | ITSM | DevOps | Cloud

September 2024

How to spot and fix memory leaks in Go

How we used Datadog to save $17.5 million annually

Optimize your AWS costs with Cloud Cost Recommendations

Operator vs. Helm: Finding the best fit for your Kubernetes applications

Integration roundup: Understanding email performance with Datadog

Get insights into service-level Fastly costs with Datadog Cloud Cost Management

Optimize Ruby garbage collection activity with Datadog's allocations profiler

Best practices for monitoring and remediating connection churn

Anthropic Partners with Datadog to Bring Trusted AI to All

Datadog for Financial Services

Key learnings from the State of Cloud Costs study

Datadog for Cloud Operational Excellence

Monitor your Twilio resources with Datadog

Monitor Oracle Cloud Infrastructure with Datadog

Burn rate is a better error rate

Monthly Archive

Follow Us