November 2024

Build Datadog workflows and apps in minutes with our AI assistant

Nov 26, 2024 By Amber Tunnell In Datadog

Datadog is a central hub of information—enabling you to see logs, traces, and metrics from across your stack and providing a centralized source of notifications about potential issues. However, when Datadog notifies you of an issue, you often need to log in to other applications to fully assess and resolve it, which slows down mitigation.

Read Post

Datadog

Read more about Build Datadog workflows and apps in minutes with our AI assistant

Stream logs in the OCSF format to your preferred security vendors or data lakes with Observability Pipelines

Nov 25, 2024 By Micah Kim In Datadog

Today, CISOs and security teams face a rapidly growing volume of logs from a variety of sources, all arriving in different formats. They write and maintain detection rules, build pipelines, and investigate threats across multiple environments and applications. Efficiently maintaining their security posture across multiple products and data formats has become increasingly challenging.

Read Post

Datadog

Read more about Stream logs in the OCSF format to your preferred security vendors or data lakes with Observability Pipelines

Optimize LLM application performance with Datadog's vLLM integration

Nov 22, 2024 By Curtis Maher In Datadog

vLLM is a high-performance serving framework for large language models (LLMs). It optimizes token generation and resource management to deliver low-latency, scalable performance for AI-driven applications such as chatbots, virtual assistants, and recommendation systems. By efficiently managing concurrent requests and overlapping tasks, vLLM enables organizations to deploy LLMs in demanding environments with speed and efficiency.

Read Post

Datadog

Read more about Optimize LLM application performance with Datadog's vLLM integration

Get deeper visibility into your AWS serverless apps with enhanced distributed tracing

Nov 22, 2024 By Sumedha Mehta In Datadog

Serverless or event-driven applications can comprise many different distributed components, including serverless compute services such as AWS Lambda and AWS Fargate for Amazon ECS, as well as managed data streams, data stores, workflow orchestration tools, queues, and more. Having full end-to-end visibility into requests as they propagate across all of these parts of your application is crucial to monitoring performance, locating affected up- or downstream services, and troubleshooting issues.

Read Post

Datadog

Read more about Get deeper visibility into your AWS serverless apps with enhanced distributed tracing

Best practices for monitoring progressive web applications

Nov 21, 2024 By Thomas Sobolik In Datadog

Progressive web applications (PWAs) are a modern frontend architecture designed to provide a similar user experience to that of a native iOS, Android, or other platform-specific app. PWAs are built using common web platform technologies—such as, HTML, CSS, and JavaScript—and are intended not only to run in a browser and be accessed from the web, but also to be installed on users’ devices and accessed offline.

Read Post

Datadog

Read more about Best practices for monitoring progressive web applications

Kubernetes Security Fundamentals: Authentication - Part 2

Nov 20, 2024 By Datadog In Datadog

In this video we’ll continue looking at how Kubernetes handles authentication with a look at bootstrap and static token authentication.

View Video

Datadog

Read more about Kubernetes Security Fundamentals: Authentication - Part 2

Identify deprecated Lambda functions with Datadog

Nov 18, 2024 By Jordan Obey In Datadog

AWS Lambda supports nearly any programming language by enabling developers to run serverless functions with either supported or custom runtimes. Once a runtime is deprecated, however, AWS will set dates for when you can no longer create or update functions using that runtime. You will then need to decide what course of action to take to ensure your Lambda functions continue running as expected.

Read Post

Datadog

Read more about Identify deprecated Lambda functions with Datadog

Detect anomalies before they become incidents with Datadog AIOps

Nov 18, 2024 By Candace Shamieh In Datadog

As your IT environment scales, a proactive approach to monitoring becomes increasingly critical. If your infrastructure environment contains multiple service dependencies, disparate systems, or a busy CI/CD application delivery pipeline, overlooked anomalies can result in a domino effect that leads to unplanned downtime and an adverse impact on users.

Read Post

Datadog

Read more about Detect anomalies before they become incidents with Datadog AIOps

Datadog on Cloud Workload Identities

Nov 18, 2024 By Datadog In Datadog

Datadog operates dozens of Kubernetes clusters, tens of thousands of hosts, and millions of containers across a multi-cloud environment, spanning AWS, Azure, and Google Cloud. With over 2,000 engineers, we needed to ensure that every developer and application could securely and efficiently access resources across these various cloud providers.

View Video

Datadog

Read more about Datadog on Cloud Workload Identities

Detect and troubleshoot Windows Blue Screen errors with Datadog

Nov 15, 2024 By Bowen Chen In Datadog

Windows Blue Screen errors—also known as bug checks, STOP codes, kernel errors, or the Blue Screen of Death (BSOD)—are triggered when the operating system detects a critical issue that compromises system stability. To prevent further damage or data corruption, the OS determines that the safest course of action is to shut down immediately. The system then restarts and displays the well-known BSOD.

Read Post

Datadog

Read more about Detect and troubleshoot Windows Blue Screen errors with Datadog

Integrate usage data into your product analytics strategy

Nov 14, 2024 By Addie Beach In Datadog

Web applications emit a wealth of metadata and user interaction information that’s critical to understanding user behavior. However, parsing this data to find what is most relevant to your product analytics project can be challenging—what one product analyst might find useful, another might consider unnecessary noise.

Read Post

Datadog

Read more about Integrate usage data into your product analytics strategy

Get complete Kubernetes observability by monitoring your CRDs with Datadog Container Monitoring

Nov 12, 2024 By Nicholas Thomson In Datadog

Custom resources are critical components in Kubernetes production environments. They enable users to tailor Kubernetes resources to their specific applications or infrastructure needs, automate processes through operators, simplify the management of complex applications, and integrate with non-native applications such as Kafka and Elasticsearch.

Read Post

Datadog

Read more about Get complete Kubernetes observability by monitoring your CRDs with Datadog Container Monitoring

A guide on scaling out your Kubernetes pods with the Watermark Pod Autoscaler

Nov 12, 2024 By Alex Preston In Datadog

While overprovisioning Kubernetes workloads can provide stability during the launch of new products, it’s often only sustainable because large companies have substantial budgets and favorable deals with cloud providers. As highlighted in Datadog’s State of Cloud Costs report, cloud spending continues to grow, but a significant portion of that cost is often due to inefficiencies like overprovisioning.

Read Post

Datadog

Read more about A guide on scaling out your Kubernetes pods with the Watermark Pod Autoscaler

Kubernetes autoscaling guide: determine which solution is right for your use case

Nov 12, 2024 By Nicholas Thomson In Datadog

Kubernetes offers the ability to scale infrastructure to accommodate fluctuating demand, enabling organizations to maintain availability and high performance during surges in traffic and reduce costs during lulls. But scaling comes with tradeoffs and must be done carefully to ensure teams are not over-provisioning their workloads or clusters. For example, organizations often struggle with overprovisioning in Kubernetes and wind up paying for resources that go unused.

Read Post

Datadog

Read more about Kubernetes autoscaling guide: determine which solution is right for your use case

Use Datadog App Builder to peak, purge, or redrive AWS SQS.

Nov 11, 2024 By Datadog In Datadog

This video aims to showcase how developers can self-serve from an application to simplify the management of their AWS cloud resources. Rather than switching between tools or reaching out to another team for help, developers can take action directly from their observability tool, enabling faster resolution of application issues. We will demonstrate how to build a simple app that allows them to minimize disruptions by quickly taking action on their SQS queues in AWS, using insights provided by Datadog.

View Video

Datadog

Read more about Use Datadog App Builder to peak, purge, or redrive AWS SQS.

Troubleshoot and resolve pod issues easily with Kubernetes Active Remediation

Nov 11, 2024 By Allie Rittman In Datadog

As organizations increasingly turn to Kubernetes to support their cloud-native applications, it has become critical for teams to analyze and respond to the dense telemetry data related to this orchestration layer.

Read Post

Datadog

Read more about Troubleshoot and resolve pod issues easily with Kubernetes Active Remediation

Monitor Azure AI Search with Datadog

Nov 11, 2024 By Thomas Sobolik In Datadog

Azure AI Search is Microsoft Azure’s managed search service. In addition to tackling traditional search use cases, Azure AI Search also includes AI-powered features to make it a fully capable document catalog, search engine, and vector database. AI Search is highly interoperable—it can use models created in Azure OpenAI Service, Azure AI Studio, or Azure ML.

Read Post

Datadog

Read more about Monitor Azure AI Search with Datadog

Monitor the cost of your public sector applications with Datadog Cloud Cost Management

Nov 8, 2024 By Thomas Sobolik In Datadog

As federal, state, and local government agencies work to modernize their digital infrastructure and applications, managing costs effectively remains a constant challenge. Federal directives like Cloud Smart indicate the need for public sector IT organizations to track and optimize their cloud spends. However, as an organization’s IT environment grows in complexity, it becomes difficult to correlate cost data and extract useful insights.

Read Post

Datadog

Read more about Monitor the cost of your public sector applications with Datadog Cloud Cost Management

Troubleshooting RAG-based LLM applications

Nov 8, 2024 By Jordan Obey In Datadog

LLMs like GPT-4, Claude, and Llama are behind popular tools like intelligent assistants, customer service chatbots, natural language query interfaces, and many more. These solutions are incredibly useful, but they are often constrained by the information they were trained on. This often means that LLM applications are limited to providing generic responses that lack proprietary or context-specific knowledge, reducing their usefulness in specialized settings.

Read Post

Datadog

Read more about Troubleshooting RAG-based LLM applications

This Month in Datadog - October 2024

Nov 8, 2024 By Datadog In Datadog

On the October episode of This Month in Datadog, Jeremy Garcia (VP of Technical Community and Open Source) covers unified Error Tracking, Security Operational Metrics, and a new Datadog Serverless feature for retrying or redriving failed AWS Step Functions executions directly from Datadog. Later in the episode, Shri Subramanian (Group Product Manager) spotlights Datadog LLM Observability’s native integration with Google Gemini. Also featured are our blog posts Operator vs.

Read Post

Datadog

Read more about This Month in Datadog - October 2024

Create ServiceNow tickets from Datadog alerts

Nov 7, 2024 By Bowen Chen In Datadog

ServiceNow is a popular IT service management platform for recording, tracking, and managing a company’s enterprise-level IT processes in a single location. In addition to helping you manage your ServiceNow CMDB, Datadog also integrates with ServiceNow IT Operations Management (ITOM) and IT Service Management (ITSM), enabling you to automatically create and manage ServiceNow incidents and events from the Datadog platform.

Read Post

Datadog

Read more about Create ServiceNow tickets from Datadog alerts

How we use Scorecards to define and communicate best practices at scale

Nov 5, 2024 By Elise Burke In Datadog

In modern, distributed applications, shared standards for performance and reliability are key to maintaining a healthy production environment and providing a dependable user experience. But establishing and maintaining these standards at scale can be a challenge: when you have hundreds or thousands of services overseen by a wide range of teams, there are no one-size-fits-all solutions. How do you determine effective best practices in such a complex environment?

Read Post

Datadog

Read more about How we use Scorecards to define and communicate best practices at scale

Datadog on Building Reliable Distributed Applications Using Temporal

Nov 5, 2024 By Datadog In Datadog

Temporal is an open source platform to build resilient and reliable distributed systems. Datadog started using Temporal in 2020 as the foundation for our internal software delivery platform. Since then, its usage has been widely adopted as a platform that any engineering team can use to build their systems. In this Datadog on episode, Ara Pulido chats with Loïc Minaudier, Senior Software Engineer in the Atlas team, responsible for providing a developer platform on top of Temporal, and Allen George, Engineering Manager in the Datadog Workflows team.

View Video

Datadog

Read more about Datadog on Building Reliable Distributed Applications Using Temporal

LLM Observability: Native Integration with Google Gemini for Automatic Request Tracking #datadog

Nov 5, 2024 By Datadog In Datadog

On This Month in Datadog, we’re spotlighting LLM Observability’s native integration with Google Gemini, which automatically captures the LLM requests your application makes to Gemini models.

View Video

Datadog

Read more about LLM Observability: Native Integration with Google Gemini for Automatic Request Tracking #datadog

Introducing the Datadog Architecture Center

Nov 4, 2024 By David M. Lentz In Datadog

To prevent visibility gaps in your cloud environment, you need to efficiently deploy observability solutions that integrate easily with key technologies in your stack and scale reliably with new applications and migrated workloads. But observability deployments can be complex, often requiring deep and specific knowledge that may not be available within your teams.

Read Post

Datadog

Read more about Introducing the Datadog Architecture Center

Track and troubleshoot MongoDB performance with Datadog Database Monitoring

Nov 4, 2024 By Casey Culligan In Datadog

Many modern applications rely on MongoDB and MongoDB Atlas to manage growing data volumes and to provide flexible schema and data structures. As organizations adopt these and other NoSQL databases, effective monitoring and optimization become critical, especially in distributed environments.

Read Post

Datadog

Read more about Track and troubleshoot MongoDB performance with Datadog Database Monitoring

Ensure high service availability with Datadog Service Management

Nov 1, 2024 By Addie Beach In Datadog

Adopting a cloud-based, distributed architecture may help your organization scale quickly, but it can also add complexity. Correlating telemetry, security signals, and alerts across services often proves difficult, resulting in slower issue remediation. Additionally, when something goes wrong, figuring out who to contact—for example, the on-call responder or the service owner— may become needlessly time-consuming.

Read Post

Datadog

Read more about Ensure high service availability with Datadog Service Management

Operations | Monitoring | ITSM | DevOps | Cloud

November 2024

Build Datadog workflows and apps in minutes with our AI assistant

Stream logs in the OCSF format to your preferred security vendors or data lakes with Observability Pipelines

Optimize LLM application performance with Datadog's vLLM integration

Get deeper visibility into your AWS serverless apps with enhanced distributed tracing

Best practices for monitoring progressive web applications

Kubernetes Security Fundamentals: Authentication - Part 2

Identify deprecated Lambda functions with Datadog

Detect anomalies before they become incidents with Datadog AIOps

Datadog on Cloud Workload Identities

Detect and troubleshoot Windows Blue Screen errors with Datadog

Integrate usage data into your product analytics strategy

Get complete Kubernetes observability by monitoring your CRDs with Datadog Container Monitoring

A guide on scaling out your Kubernetes pods with the Watermark Pod Autoscaler

Kubernetes autoscaling guide: determine which solution is right for your use case

Use Datadog App Builder to peak, purge, or redrive AWS SQS.

Troubleshoot and resolve pod issues easily with Kubernetes Active Remediation

Monitor Azure AI Search with Datadog

Monitor the cost of your public sector applications with Datadog Cloud Cost Management

Troubleshooting RAG-based LLM applications

This Month in Datadog - October 2024

Create ServiceNow tickets from Datadog alerts

How we use Scorecards to define and communicate best practices at scale

Datadog on Building Reliable Distributed Applications Using Temporal

LLM Observability: Native Integration with Google Gemini for Automatic Request Tracking #datadog

Introducing the Datadog Architecture Center

Track and troubleshoot MongoDB performance with Datadog Database Monitoring

Ensure high service availability with Datadog Service Management

Monthly Archive

Follow Us