%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Deliver CASB policy alerts via OnPage to ensure rapid response

Jul 20, 2020 By OnPage In OnPage

A simple, efficient way to deliver CASB policy alerts, ensuring that the SOC teams are notified of policy breaches immediately in order to start the incident triage and remediation process using OnPage incident alert management system. About OnPage Organizations large and small, are adopting OnPage's intelligent alerting solution, ensuring that encrypted, secure critical incident notifications are NEVER missed and are always delivered to the right person at the right time.

View Video

OnPage

Read more about Deliver CASB policy alerts via OnPage to ensure rapid response

PagerDuty Paying Dividends for Form3's Digital Payment Platform

Jul 16, 2020 By Isabella Dossola In PagerDuty

Your payment systems have slowed to a crawl, customers are getting impatient and abandoning their shopping carts both online and in stores, and you’re losing money every minute this problem goes on. Behind the scenes, technical responders are scrambling to resolve the issue before it impacts more customers—and before even more money is lost.

Read Post

PagerDuty

Read more about PagerDuty Paying Dividends for Form3's Digital Payment Platform

Alarming and Incident Reaction on Azure - An architecture Guide for Enterprise Alert on Azure by Patrick Fontana

Jul 16, 2020 By Patrick Fontana In Derdack

More and more companies move business critical communication instruments into a cloud based environment. This could be established in a partner datacenter or in a public cloud environment. The main deciding factors between these two options are the trust to the provider and the costs of the solution.

Read Post

Derdack

Read more about Alarming and Incident Reaction on Azure - An architecture Guide for Enterprise Alert on Azure by Patrick Fontana

Communicate incidents in real-time with StatusIQ

Jul 15, 2020 By Site24x7 In Site24x7

Learn how you can use status pages to keep your users in the loop during a downtime and transparently communicate incident status.

View Video

Site24x7

Read more about Communicate incidents in real-time with StatusIQ

Adam Frank Demos Moogsoft Express June 24, 2020

Jul 15, 2020 By Moogsoft In Moogsoft

As part of a live launch event, Adam Frank, Moogsoft's VP of Product and Design, demoed the latest AIOps & Observability solution for cloud-first companies: Moogsoft Express. Moogsoft Express helps DevOps and SREs detect app performance problems, keep software pipelines humming and honor customer SLAs — all while being extremely simple to use.

View Video

Moogsoft

Read more about Adam Frank Demos Moogsoft Express June 24, 2020

Building Automated Monitoring with Icinga and iLert

Jul 14, 2020 By iLert In iLert

How many servers can be managed by one system administrator? This question is pretty hard to answer since it depends decisively on the tasks that need to be operated. It is clear, however, that the amount of servers one engineer can manage has increased tremendously over the time, and is still growing. Public and private clouds, in combination with automation tools, enables us to automate many daily tasks. In a modern IT infrastructure almost everything can, and should, be automated.

Read Post

iLert

Read more about Building Automated Monitoring with Icinga and iLert

FYI: Email Alerting Isn't Enough

Jul 14, 2020 By Christopher Gonzalez In OnPage

Email alerting is an inefficient way to receive and address critical alerts. Email inboxes tend to get flooded with “clutter,” as irrelevant messages bury urgent incident notifications. Incident management procedures require incident management systems, ensuring that urgent issues are immediately addressed. Yet, some services are reluctant to say goodbye to email alerting and its inefficiencies. This is the case with Google Voice, which recently solidified its commitment to email alerting.

Read Post

OnPage

Read more about FYI: Email Alerting Isn't Enough

Event Chaos or Enrichment? BigPanda's CTOs Can Help You Decide

Jul 13, 2020 By Jason Walker and Scott Stradley In BigPanda

In our recent “IT Ops Demystified – Event Chaos or Enrichment?” webinar our field CTOs discuss how enrichment can help reduce operational costs by an order of magnitude. Here is a quick overview of all the goodness that you’ll be watching.

Read Post

BigPanda

Read more about Event Chaos or Enrichment? BigPanda's CTOs Can Help You Decide

What is SRE?

Jul 13, 2020 By Rich Burroughs In FireHydrant

Site Reliability Engineering (SRE) is a practice for managing the reliability of systems that began at Google in the early 2000s. Ben Treynor Sloss from Google started the first SRE team and coined the name.

Read Post

FireHydrant

Read more about What is SRE?

Your Mac Is Fast. You Are Not.

Jul 12, 2020 By Brian Smith In Moogsoft

I can tell you the day I knew I would be a Systems Administrator (the term SRE hadn’t been invented yet.) My Linux professor, a brilliant engineer at NASA, said: "The best system administrators are the laziest." He went on to qualify that statement but I had stopped listening. My fate was sealed.

Read Post