Operations | Monitoring | ITSM | DevOps | Cloud

October 2024

Best practices for monitoring cloud costs with Datadog Scorecards

To ensure that your organization’s cloud spend is efficient, you need detailed and granular visibility to understand what comprises your costs, what causes them to change, and how the cloud services and resources you use are enabling your business goals. Extending your visibility and more closely monitoring your cloud costs can position you to successfully adopt FinOps, which provides a framework that can help you maximize the value you get from your cloud spend.

Detect issues, manage incidents, and streamline workflows with Datadog's Microsoft Teams integration

Microsoft Teams is deeply embedded in many organizations’ workflows, acting as a hub to both communicate and collect information about issues and ongoing projects. However, as with most communication platforms, it can be challenging to context-switch between conversations, tickets, and monitoring data when troubleshooting collaboratively.

This Month in Datadog: Google Gemini integration, Unified Error Tracking, and more

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. To learn more about Datadog and start a free 14-day trial, visit Cloud Monitoring as a Service | Datadog. This month, we put the Spotlight on Datadog LLM Observability’s native integration with Google Gemini.

How Appfolio uses Datadog LLM Observability to deliver exceptional GenAI experiences

Learn how Appfolio is delivering positive customer experiences in real estate with generative AI — supported and safeguarded by Datadog’s LLM Observability. See how you can use Datadog LLM Observability to monitor, troubleshoot, improve, and secure your LLM applications.

Transform and enrich your logs at query time with Calculated Fields

As the number of distinct sources generating logs across systems and applications grows, teams face the challenge of normalizing log data at scale. This challenge can manifest when you’re simply looking to leverage logs “off-the-shelf” for investigations, dashboards, or reports–especially when you don’t control the content and structure of certain logs (like those collected from third-party applications and platforms).

Datadog named a Leader in first ever 2024 Gartner Magic Quadrant for Digital Experience Monitoring

We are thrilled to announce that Datadog has been named a Leader in the first ever 2024 Gartner Magic Quadrant for Digital Experience Monitoring. Datadog was positioned the highest in its Ability to Execute. We believe this placement reflects our commitment to being an end-to-end observability platform that brings together all signals from across your tech stack into a unified ecosystem.

Trace your applications end to end with Datadog and OpenTelemetry

As teams adopt OpenTelemetry (OTel) to instrument their systems in a vendor-neutral way, they often face a challenge in effectively tracing activity throughout their entire stack, from frontend user interactions to backend services and databases. While OTel enables basic tracing, teams still need a way to access advanced capabilities like continuous profiling to adequately optimize performance and troubleshoot issues in their applications.

Flaky tests: their hidden costs and how to address flaky behavior

Flaky tests are bad—this is a fact implicitly understood by developers, platform and DevOps engineers, and SREs alike. When tests flake (i.e., generate conflicting results across test runs, without any changes to the code or test), they can arbitrarily fail builds, requiring developers to re-run the test or the full pipeline. This process can take hours—especially for large or monolithic repositories—and slow down the software delivery cycle.

Monitor your Azure OpenAI applications with Datadog LLM Observability

Azure OpenAI Service is Microsoft’s fully managed platform for deploying generative AI services powered by OpenAI. Azure OpenAI Service provides access to models including GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, DALLE-3, and the Embeddings model series, alongside the enterprise security, governance, and infrastructure capabilities of Azure.

Generate metrics from your high-volume logs with Datadog Observability Pipelines

Logs are a rich source of information, providing you with the minute details you need to troubleshoot a specific issue or perform extensive historical analysis. But with billions of logs being generated from your infrastructure every day, it isn’t practical to sift through them all to derive actionable insights. Firewall, CDN, network activity, and load balancer logs are especially high volume, requiring storage solutions that can be expensive and difficult to scale.

Hear how PayPal is accelerating their pace of innovation with Datadog

With over 426 million active users, comprised of consumers and merchants, Paypal processes approximately 25 billion transactions valued at around $1.53 trillion USD. Paypal is shaping the future of commerce for millions of customers globally, and to do that, they use Datadog to provide timely insights into their entire stack.

Datadog on OpenTelemetry

OpenTelemetry (OTel) is an open source, vendor-neutral observability framework that supplies APIs, SDKs, and tools to instrument, generate, collect, and export telemetry data (metrics, logs, traces and soon profiles). It has a vibrant ecosystem of components, integrations and vendors. In this episode, Juliano Costa will discuss OpenTelemetry with Felix Geisendörfer, Senior Staff Engineer on the Continuous Profiling team, and Pablo Baeyens, Software Engineer on the OpenTelemetry team.

Reduce your AWS Step Functions' error remediation time by redriving executions directly from Datadog

AWS enables customers to retry or redrive Step Functions executions to continue any failed executions of Standard Workflows from their points of failure while maintaining all inputs. For example, if you find broken downstream logic in your code or experience unexpected errors upon execution, you can remediate those errors by fully re-running an execution or use redrive to continue this execution.

Gain visibility into your Camunda 8 components with Bordant Technologies' Datadog integration

Camunda 8 is a process orchestration platform that automates and executes business processes at scale. Many organizations orchestrate their business processes using Camunda 8 Self-Managed because it can operate in their preferred public cloud provider, such as AWS, or in a private cloud, like a Kubernetes cluster. However, hosting Camunda 8 while maintaining its health and performance will require complete visibility into your environment, helping you properly allocate resources and minimize downtime.