Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Network Speed Uncovered: What Is It Really & How to Measure It

When we talk about network performance, one metric that comes up constantly is "network speed." It's the measure of how fast your applications run, how swiftly you can transfer data, and, essentially, how responsive your digital world is. Speed in networking is like the speedometer in a car—it tells you how quickly things are moving. However, network speed isn't just about raw velocity.

How to Build an Effective Network Monitoring Dashboard

Whether you're a small startup or a large enterprise, the health and performance of your network infrastructure are critical to your success. This is where network monitoring comes into play. Network monitoring involves the continuous observation and analysis of network traffic, devices, and performance metrics to ensure smooth operations, detect anomalies, and troubleshoot issues promptly.

A guide to scaling Grafana Alloy deployments across multiple hosts

Last week we introduced Grafana Alloy, our distribution of the OpenTelemetry Collector with built-in Prometheus pipelines and support for metrics, logs, traces, and profiles. We’re excited to see the community embrace Alloy, and we want to help them use and scale it as easily as possible. Many developers that need to deploy and manage software across several hosts turn to Ansible for its ease of use and versatility.

ChatGPT & the Enterprise: Balancing Caution and Innovation in the Age of AI

OpenAI's groundbreaking AI tool ChatGPT was officially launched on November 30th, 2022. However, it wasn't until the early months of 2023 that its impact truly began to ripple through the global consciousness. This transition from a novel technological release to a sensation that captivated the world was both rapid and remarkable. The metrics speak volumes: According to Similarweb, ChatGPT garnered around 266 million visits in December 2022.

The role of psychological safety in incident response

Incidents impacting your customer and user-facing services can be stressful, both for the responders on your team who are working on a resolution, and for the other stakeholders in your business. For teams to solve incidents quickly and effectively, responders need to be able to trust each other and stakeholders have to trust the responders. This level of trust is hard to cultivate if your organization doesn’t have a significant amount of psychological safety.

Maximize Kubernetes resource efficiency with Spot Ocean's Accelerated Scale Down

One significant challenge that every dynamic, fast-paced business in the cloud faces is efficiently managing their clusters’ workloads while minimizing unnecessary expenses. But it’s not just overages that these businesses must worry about. Especially common is the underutilization of resources, which can lead to unnecessary costs and inefficiencies.

Mattermost's cloud optimization journey: Pillars of success, future strategies & lessons learned

Mattermost has embarked on a transformative journey in cloud optimization. This journey is marked by strategic initiatives, innovative approaches, and valuable lessons, all aimed at enhancing efficiency and reducing costs. This blog post explores the successful strategies that have guided our cloud optimization efforts. It also highlights our future direction with an emphasis on ARM/Graviton workloads and shares insights from our experiences, particularly regarding spot instances.

Introduction to Observability

These days, systems and applications evolve at a rapid pace. This makes analyzing the internal performance of applications complex. Observability emerges as a path to efficient and effective operational insights. Imagine a team of doctors monitoring a patient’s vitals—heart rate, temperature, blood pressure. These readings, combined with observation of symptoms, paint a picture of the patient’s health. This allows doctors to diagnose issues and provide care.

What's New in Kubernetes 1.30?

Kubernetes 1.30 brings a plethora of enhancements, including a blend of 58 new and improved features. From these, several are graduating to stable, including the highly anticipated Container Resource Based Pod Autoscaling, which refines the capabilities of the Horizontal Pod Autoscaler by focusing on individual container metrics. New alpha features are also making their debut, promising to revolutionize how resources are managed and allocated within clusters.