Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Introducing Network Containers in Tidal LightMesh IPAM

As networks expand across on-premises and cloud environments, many organizations find themselves struggling with scattered, unstructured networks. Often, networks grow in silos, with subnets being created independently in AWS, Azure, and other environments over time, leading to a messy and hard-to-manage structure. This is especially true for enterprises undergoing mergers and acquisitions, where integrating entirely separate network systems can quickly turn into an administrative headache.

The Incident Maturity Model

I want to walk you through how incident management has evolved, drawing from real data and the experiences of some of the most sophisticated tech organizations out there. I'll also introduce you to a framework we’ve developed at incident.io: the Incident Maturity Model. This framework is the result of thousands of conversations with companies and provides a clear roadmap to help your organization improve its incident management practices—no matter where you're starting from.

The Comprehensive Guide to Understanding IT Incidents

In today’s world, where technology underpins nearly every aspect of business, IT systems play a critical role in ensuring smooth operations. However, what happens when something goes wrong? When systems fail or services are disrupted, businesses face what’s commonly known as an incident. For someone who is not technical, the idea of an IT incident can seem scary. However, it is a simple and organized process when explained clearly.

Security vulnerability uncovered and patched in the golang.org/x/crypto /ssh package

Platform.sh teams are always striving to ensure a safe space for all developers within our product. And this consistent diligence led to the Platform.sh Engineering team discovering a security vulnerability in the golang.org/x/crypto/ssh package on 5 September 2024. Upon investigating an unexpected Panic: runtime error: invalid memory address or nil pointer dereference message in our edge proxy, the engineers discovered a misimplementation of the PublicKeyCallback function.

How to design apps with Docker containers?

Do you want to streamline your app development process to make it efficient, scalable, and reliable? Building and deploying applications without the right tools quickly becomes complex and resource-intensive. Docker containers address these challenges by providing effective solutions. The 2023 Stack Overflow Developer Survey reveals that 42% of developers rely on Docker, demonstrating its pivotal role in modern workflows. Similarly, Datadog states that over 25% of organizations use Docker in production, which underscores its widespread adoption.

The 4 Golden Signals: All You Need to Know

As a team, we have spent many years troubleshooting performance problems in production systems. Applications have become so complex that you need a standard methodology to understand performance. Our approach to this problem is called the Golden Signals. By measuring these signals and paying very close attention to these four key metrics, providers can simplify even the most complex systems into an understandable corpus of services and systems.

The Art of On-Call Collaboration: 5 Strategies for Team Health Improvement

For a fast-paced work environment, effective on-call management is crucial for maintaining seamless operations. Whether you’re in IT or any other industry that requires constant availability, the on-call system ensures that teams can respond to critical incidents efficiently. However, achieving optimal on-call management isn’t just about being available—it’s about collaboration, communication, and ensuring team health.

Top tips: Must-know holiday hacks for IT admins

Top tips is a weekly column where we highlight what’s trending in the tech world and list ways to explore these trends. This week, we explore ways in which IT admins can optimize the IT infrastructure during the holidays while leaving room for enjoyment. December is here, and the holiday spirit is in the air. While you prune your Christmas tree at the start of the holiday season, your IT infrastructure requires consistent pruning throughout the year by IT admins.

New in Microsoft Teams: Automatically Create Group Chats for Incident Communication

When we launched our fully-featured Microsoft Teams integration in May, our goal was clear: to provide enterprise teams with the robust and comprehensive toolset they need to manage incidents faster and more effectively – right where they work. It’s all part of our commitment to building the leading enterprise incident management solution. Today, we’ve enhanced our Teams integration by adding the ability to automatically create Microsoft Teams group chats directly from your Runbooks.

What is vulnerability management?

Today, computers are more sophisticated, interconnected and complex than ever. This means they’re more capable than ever – but it also comes with a downside: that their attack surface is larger than ever, leading to an elevated risk of cyber incidents. Therefore, your approach to managing vulnerabilities needs to be holistic and prepare for all the ways in which vulnerabilities could cause damage to users, systems or your organization’s operations and reputation.