Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Why AIOps is Worth the Investment During an Economic Downturn

Recent talks of an economic softening have left IT leaders concerned about the future of their enterprises. That concern is understandable — tech layoffs create near-daily headlines at this point, with top companies rolling back their operations and rolling up their sleeves to focus on mission-critical expenses. And for many in ITOps, that means cutting tools.

Quick! Grab all the evidence: Capturing application state for post-incident forensics.

Everyone loves a good mystery thriller. Ok, not everyone – but Hollywood certainly does. Whether it’s Sherlock Holmes or Hercule Poirot, audiences clearly enjoy a page-turning plot of hunting down the culprit for some heinous crime.

5 Best practices for developing a culture of continuous improvement

How do you create a great engineering team? Exclusively hire brilliant, tenured computer science PhDs. There we solved it. You can skip the next 400 words. (I can hear my college professor in my head saying “Humor might not be your strong suit”) Building a great engineering team isn’t easy. Understatement of the year. It’s not even a problem to be solved per se. We need to think about it as preparation to solve an infinite set of constantly evolving problems.

3 examples of DevOps automation

Automating processes and the tools that enable them is vital for empowering highly productive teams. The right automation tools and workflows help DevOps and SRE teams minimize repetitive tasks, improve monitoring capabilities, enable continuous integration/continuous deployment (CI/CD), and work with massive volumes of data.

Your non-technical teams should be using incident management tools, too

For many businesses across the world, incident management is something that’s usually left to engineers. These teams are on the front lines, declaring, managing, and resolving all sorts of incidents across the org, regardless of where it originates or what form it takes. But there’s a glaring issue with this approach. Outside of technical teams, people across organizations aren’t accustomed or trained to use the word “incident” whenever an issue comes up.

Knightscope Relies on PagerDuty to Keep Their Robots Rolling

As security becomes more advanced and available, companies must look for ways to be more efficient with their resources in order to stay competitive. With challenges that limit the capabilities of companies, such as limited employee resources and low customer tolerance for delays in services, reliable and affordable solutions are necessary. In this case, it means disrupting the traditional security industry. Organizations are achieving their goals by relying on automation and technology.

Best Practices for Managing Incidents at Varying Severity Levels

A software incident is an event or unplanned interruption that causes the software to deviate from its intended behavior, affecting the quality of service. With the ever-changing nature of the software industry, incidents are inevitable, particularly in teams that practice iterative software development cycles with constant releases to production. This necessitates a robust incident management strategy.

Announcing our improved Schedules & On-Call Rotations

Hey folks! We are super excited to announce that our schedules feature has gone through a bit of an update. Well, more than a bit 🙂. We’ve gone through the feature with a fine-toothed comb and introduced a bunch of UI and functional improvements which we hope will help you achieve one thing: set up, edit and manage your on-call schedules at scale in a matter of minutes (Yes, that was three things but it was tough to condense it to ONE thing)