Operations | Monitoring | ITSM | DevOps | Cloud

Server Monitoring Explained: How to Outwit Downtime Before it Strikes

Server monitoring is the practice of continuously tracking server health, performance, and resource usage to catch issues before they cause downtime. When a server crashes, it can mean lost revenue, frustrated users, and a mad scramble to fix the problem. The right server monitoring tool helps your IT team stay ahead by providing real-time alerts and visibility into critical metrics. In this guide, we’ll break down how server monitoring works, why it matters, and what to look for in a solution.

COREDUMP #005: The Current Realities of Cellular IoT

Join the Founders of Memfault and special guest Fabian Kochem, Director of Product Strategy at 1NCE, as they break down the latest advancements in cellular IoT. This conversation covers key considerations for businesses adopting cellular, common pitfalls, and the best tools to ensure connectivity success.

Why IoT Security Can't be Left To Users

A web-connected building intercom system is leaving homes across the US and Canada vulnerable to remote attacks—all because of one major security flaw. François Baldassari shares how weak IoT security can put thousands at risk and what manufacturers must do to fix it. Watch to learn why secure by default should be the standard for all connected devices.

Tech Debt as Innovation? How Netflix Turns It Into Opportunity

At Civo Navigate San Francisco 2025, Lisa Smith, from Netflix shares a fresh perspective on how tech debt can drive innovation instead of slowing teams down. Learn how to staff legacy systems, handle tricky deprecations, and evaluate the “tech debtiness” of your infrastructure to unlock growth and efficiency. Discover how to turn tech debt into a strategic advantage for your engineering team.

New In Playwright 1.51 - Can AI Fix Failing Tests With The New Error Prompt?

In this episode, Stefan Judis, Playwright ambassador, explores the new 'Copy as prompt' feature in Playwright 1.51. This feature allows you to copy a pre-filled LLM prompt with all the context of a failing test case. Does this mean that AIs can take over and magically fix all the failing tests? Let's find out!

Building optimized LLM chatbots with Canonical and NVIDIA

The landscape of generative AI is rapidly evolving, and building robust, scalable large language model (LLM) applications is becoming a critical need for many organizations. Canonical, in collaboration with NVIDIA, is excited to introduce a reference architecture designed to streamline and optimize the creation of powerful LLM chatbots. This solution leverages the latest NVIDIA AI technology, offering a production-ready AI pipeline built on Kubernetes.

Unlocking Edge AI: a collaborative reference architecture with NVIDIA

The world of edge AI is rapidly transforming how devices and data centers work together. Imagine healthcare tools powered by AI, or self-driving vehicles making real-time decisions. These advancements rely on bringing AI directly to edge devices. However, building a robust architecture for diverse edge environments presents significant hurdles. This blog introduces our new reference architecture, designed to simplify edge AI deployment.

Using CircleCI to test and deploy Python serverless functions on Microsoft Azure

Serverless computing simplifies app development by abstracting away server management. Azure Functions provides a robust platform for event-driven, on-demand code execution. In this tutorial, we’ll create and deploy a Python-based Azure Function—one that parses incoming JSON—using CircleCI. For a more granular and enable programmatic access to Azure resources, we’ll use service principal for secure authentication and the Azure CLI orb to streamline our CI/CD pipeline.

Proactive Monitoring: How DinoCloud Uses CloudWatch to Save Clients Money

At MetricFire, we love talking with engineers about their tech stacks, SRE challenges, and how they approach infrastructure monitoring. Recently, we had a great chat with Yoimer Roman from DinoCloud, a Latin American company that helps clients make smarter business decisions by leveraging AWS CloudWatch monitoring. Yoimer wears many hats: mentoring his team on all things AWS, designing custom cloud environments, and bridging the gap between technical challenges and non-technical stakeholders.