Operations | Monitoring | ITSM | DevOps | Cloud

Blog

The four pillars of observability

When discussing the technical foundations of observability, several key components, often referred to as the “pillars,” emerge. While there is no universally agreed-upon number of pillars, this post will focus on four fundamental elements: metrics, logs, traces, and profiles. Due to the vast amount of data generated by metrics, logs, and traces, sampling is often employed to reduce data volume while maintaining representative information.

Ask the Expert: How Can I Better Respond to Emergency Medical Technicians' Calls for Help in Europe?

When an ambulance is dispatched to a call, the assumption is that it will be fully stocked with the right equipment and medicines for emergency medical technicians, EMTs for short, to stabilize patients or perform life-saving measures in the field. But with the increased number of calls for service in many parts of the world, it’s becoming more difficult for EMTs to maintain proper stock in their ambulances – and that has patient care implications.

The Meaning of Monitoring & Observability in The Financial Services Industry

Monitoring and Observability of messaging and middleware has and will continue to be a function of increasing importance and this is especially true for organizations in the Financial Services industry. In the financial services industry, observability refers to the ability to monitor, measure, and analyze the performance, health, and security of financial systems, applications, messaging and middleware which power long running processes in real-time.

Why Next-Generation AIOps is a Game Changer for Managing IT Complexity

There is immense pressure on IT. Now more than ever, IT teams bear the brunt of the seismic shift in how people live and work. Delivering service quality while driving innovation is imperative. Yet, IT teams are continually fighting outage fires, managing day-to-day events, updating legacy systems, and navigating IT complexity – while trying to innovate. AIOps and cloud computing sought to address these challenges.

Windows Automation: Comparing Methods & Tools for Automating Windows Infrastructure

Finding the right automation tool for Windows environments can be frustrating. Legacy systems, a GUI-centric design, and proprietary tooling are a few of the reasons automating Windows infrastructure can be challenging – especially in environments where Windows isn’t the only OS. Many organizations struggle to choose tools that will let them automate Windows infrastructure without contributing to tool sprawl.

Rightsizing & Handling Resource Allocation in Kubernetes

Handling resource allocation within Kubernetes clusters is of paramount importance. Proper resource allocation in Kubernetes ensures optimal performance and efficient utilization of the underlying infrastructure, safeguarding against capacity issues and application downtime. In contrast, improper resource allocation can lead to a plethora of challenges, from wasted resources to compromised application performance.

Splunk Named a Leader in the Gartner Magic Quadrant for Observability Platforms

"Transformative Solution" says a Director of IT in a $30B+ retailer. "Best Monitoring and Observability Tool > Splunk," is how a software engineer in a software company labels it. These are only a couple of the terms our customers use when describing the value they are getting from Splunk. With these descriptions in mind, we are elated that Splunk has been named a Leader in the 2024 Gartner Magic Quadrant for Observability Platforms for the second year in a row in this category.

Don't get caught in the dark: Lessons from a Lumen & AWS micro-outage

While major outages like the recent CrowdStrike incident dominate headlines, those of us in the trenches ensuring Internet Resilience know that most of our issues are not necessarily global but localized by geography, autonomous systems, or something else. Micro-outages – those elusive, localized incidents – can pose the most persistent threat to observability.

How DPM monitoring helps you manage your metrics volume

At Sumo Logic, we’re committed to helping you scale without breaking your budget. As you may have heard, we recently launched Flex Licensing, a first-of-its-kind economic model that offers free, unlimited log data ingest so different teams can capture and analyze critical data across their enterprise in one place. We’re also committed to tackling related challenges raised by other data sources — like metrics.

Understanding and Controlling AWS Transit Gateway Costs with Kentik

AWS Transit Gateway costs are multifaceted and can get out of control quickly. In this post, discover how Kentik can help you understand and control the network traffic driving AWS Transit Gateway costs. Learn how Kentik can help you understand traffic patterns, optimize data flows, and keep your Transit Gateway costs in check.