Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Tell data-driven stories with Collaborative Notebooks

Whether you’re deciding on follow-up items for a postmortem, submitting a request-for-comments for review, or creating an executive report, making sense of all the available data and collaborating with others to tell a clear, data-driven story can be difficult. When there are multiple stakeholders and teams working together, it can be hard to manage the back-and-forth process of feedback and revision to ensure you have up-to-date information.

4 Questions with Jeff Smith

This is the first in a series of interviews with experts about reliability, incident response, and related topics. Jeff Smith has been in the technology industry for over 20 years, oscillating between management and individual contributor. Jeff currently serves as the Director of Production Operations for Centro, an advertising software company headquartered in Chicago, Illinois. Previously he served as the Manager of Site Reliability Engineering at Grubhub.

From SysAdmin to SRE: How to evolve your skillset

Are you wondering what it takes to become an SRE from a SysAdmin background? Our latest blog, covers the growth areas and technical skills needed to successfully transition to an SRE role. The last decade has seen widespread adoption of SRE practices based on the best practices laid out by Google. Many SysAdmins have observed this trend and are now evaluating becoming SREs. Which gives rise to the question how much of a skills overlap is there between an SRE and a SysAdmin?

Digital Transformation Is Driving Operational Excellence in Customer Service Teams

Digital initiatives are on the rise, with spending on the public cloud expected to increase by 18% in 2021. These initiatives are also driving customer service teams towards operational excellence with a focus on customer advocacy and engagement. As more companies embrace the cloud and digital-first operating models, customer service organizations are finding they need to be more proactive in order to create great customer experiences.

December 2020 Update: Multiple teams, new user management and enterprise licensing

Our December update lets it rip once again: From now on, you can create and manage multiple teams in your SIGNL4 subscription. For this purpose, we have also added a new user role and enhanced our web app. This makes it much easier to centrally manage multiple departments in your organization. Creating, setting up and supporting these departments or teams thus becomes a breeze. All details can be found in this blog article.

3 Tips for MSPs to Handle Cyber Attacks

Managed service providers (MSPs) face ransomware, malware, and other cyber attacks — and these issues can affect both MSPs and their clients. To understand the full impact of an MSP cyber attack, let’s examine the topic in more detail. Businesses use MSPs to manage IT infrastructure and other resources. In doing so, businesses outsource the maintenance and care of applications, networks, security, and other aspects of their IT operations to a third-party.

Observability and AI: Better Together

There’s an AI-led developer and operations (DevOps) evolution afoot which is stoking SREs’ increasingly critical efforts to assure and improve the customer experience by automating the toil out of observability. This movement feeds on a supercharged process of turning telemetry into actionable insight by automatically drawing anomalies, changes and events out of the full-stack event and telemetry data, and analyzing it for correlation and causality.

Let's Talk: Full-Service Ownership

We recently surveyed 700 DevOps and IT practitioners around the world and found that more than 80% of organizations have experienced a significant increase in pressure on digital services since the start of the pandemic. Compared to 6 months ago, respondents reported a 47% increase in the number of daily incidents, and 62% of DevOps and IT responders work at least an extra 10 hours per week resolving incidents.