Operations | Monitoring | ITSM | DevOps | Cloud

Let's Talk AIOps: Part 1: What IS AIOps, Exactly?

This is the first in a two-part blog series deconstructing AIOps for ITOps leaders. If you gave me a dollar for every company that claims that they use “A.I.,” I’d be doing pretty well. But as a marketer, I can’t help but be a little skeptical about those claims. Let me explain.

Working with multiple on-call teams using Zabbix and iLert

This post outlines how to use Zabbix and iLert with multiple on-call teams, where each team is responsible for a set of host groups in Zabbix, and therefore, will only receive alerts for the services it is responsible for. But first, let’s start with the basic needs when being on-call.

Alerts vs Incidents vs ITSM

In order to effectively address production issues in your application, you need to have a strong incident response strategy. Incident response starts with an alert which leads to mobilization and response, and finally results in a record of all that happened and was learned from addressing issues. In this session of Dissecting DevOps, learn about the lifecycle of incidents from alert to post mortem and why incident response is as much a strategy as a process.

Retail Industry Trends 2020: All-In on Digital Since COVID-19

This is the first in a series of posts we’ll be publishing on trends we’re seeing in the retail industry and how IT organizations tasked with deploying and maintaining flawless digital customer experiences can take advantage of PagerDuty to ensure always-on reliability. It’s been a tough year for retail.

Fiserv Eliminates Ticket Overload with AIOps

Fiserv, the Fortune 500 payments and financial technology provider, needed to streamline and automate its IT incident management process to detect and fix issues earlier and more quickly. The incident management workflow was complex, primarily because mergers and acquisitions over the years had made Fiserv’s IT environment very heterogeneous. “The challenges we were facing were enormous,” IT Director Chris Kreps says.

Monitoring IoT devices using heartbeats and MQTT gateways

When working with IoT (internet of things) devices one of the key issues is to keep track of the health of all installations. Most of the time, especially with smaller devices, the applications (firmwares) are flashed for a single time during setup and stay untouched at their location of action for a long while.

AIOps - Done the Self-Service Way

Last week I went camping with some friends. One of them did the shopping for all of us, so I sent him my share using a payment app. It took me less than 2 minutes to complete the transaction. A few years ago, a similar transaction would have me going to the bank to complete the task, or at a minimum, calling a bank teller and having him do it. Try to imagine a bank asking its customers to do any of these things today. It would probably lose all its customers in no time.

How Enterprise Alert solves typical problems in network monitoring

A new article in the September issue of LANLine (“Automation creates productivity”) summarizes typical challenges and problems in network monitoring very well and is really worth reading. I would like to briefly discuss some of the issues raised and how our product Enterprise Alert® was developed to solve them. Of course, it is important that especially critical alarms are processed promptly. Even small problems can quickly lead to major failures.

University's IT Teams Struggle as Fall 2020 Semester Kicks Off

COVID-19 has left colleges and universities scrambling to come up with alternative plans for the 2020 fall semester. Some are opting for 100% online learning while others are taking more of a hybrid approach with a combination of in person and online classes. While schools face an abundance of change this academic year, it’s not just the faculty and students that need to adapt but also the entire IT infrastructure for which the school is run.

Your Burning Questions about AIOps and Observability Answered

A fireside chat to discuss use cases and deployment tips for AIOps with observability generated a stream of compelling questions from attendees, which the Moogsoft hosts answered with depth and expertise. Combining AIOps analysis with detailed observability data is key for DevOps and SRE teams to attain continuous service assurance, so Moogsoft just published a new ebook about this topic titled “Observability with AIOps For Dummies.”