Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Best practices for cloud-based network monitoring

When cloud adoption grew rapidly in the early 2010s, businesses started facing new challenges. Managing distributed systems, monitoring cloud-hosted applications, and ensuring network performance across global infrastructures became more complex. This shift in how businesses run IT operations creates a clear need for cloud-based network monitoring tools that can give you real-time insights into performance, security, and overall system health.

The Contradictions of VDI: Is Your Investment Delivering the Desired Performance?

The promise of Virtualized Desktop Infrastructure (VDI) is deeply compelling - by virtualizing desktops, businesses can simultaneously give employees more flexibility and improve their productivity, while also reducing costs. However, despite significant VDI investment, too many organizations are still unable to realize these promised benefits.

Build user trust and foster transparency with StatusIQ's 2024 enhancements

2024 has been a transformative year for StatusIQ, marked by continuous innovation to meet the growing needs of our users. This year’s enhancements aim to simplify incident communication, improve team efficiency, and, most importantly, build trust with your users. From providing deeper insights into status history to enabling seamless platform migrations, these updates reflect our unwavering commitment to empowering teams and stakeholders.

Comparing Azure NSG and VNet Flow Logs

Azure VNet flow logs significantly improve network observability in Azure. Compared to NSG flow logs, VNet flow logs provide broader traffic visibility, enhanced encryption status monitoring, and simplified logging at the virtual network level enabling advanced traffic analysis and a more comprehensive solution for modern cloud network management.

AWS re:Invent '24: Generative AI Observability, Platform Engineering, and 99.9995% Availability

I attended Amazon Web Services re:Invent conference. This is AWS's annual user conference, which takes over most of Las Vegas for a week. There’s a lot to do and take in—customer stories galore, new tech, learning different use cases, and all the walking. But you’re here to hear what I learned, so I’ve broken it down into sections. Enjoy!

Monitor your Pinecone vector databases with Datadog

Pinecone is a vector database that helps users build and deploy generative AI applications at scale. Whether using its serverless architecture or a hosted model, Pinecone allows users to store, search, and retrieve the most meaningful information from their company data with each query, sending only the necessary context to Large Language Models (LLMs). By providing the ability to search and retrieve contextual data, Pinecone enables you to reduce LLM hallucinations and enhance data security.

How to support a growing Kubernetes cluster with a small etcd

Etcd plays a critical role in your Kubernetes setup: it stores the ever-changing state of your cluster and its objects, and the API server uses this data to manage cluster resources. As your applications thrive and your Kubernetes clusters see more traffic, etcd handles an increasing amount of data. But etcd’s storage space is limited: the recommended maximum is 8 GiB, and a large and dynamic cluster can easily generate enough data to reach that limit.