Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Automation: The Key to Modern IT

Automation is everywhere in our day-to-day IT practices. Many of the processes that have been created for managing hardware and software components were designed, or at least initiated, in a time when managing only a few instances of an application was the norm. When we look at the work required to create, deploy, and maintain applications at a modern scale, the shortcomings of these processes become apparent.

What is IT Operations Management (and should you prioritize it)?

IT operations management (ITOM) involves the administration of technology applications and components across an enterprise. To effectively manage your IT operations, you must prioritize capacity management, security, availability, and cost-control of all IT infrastructure and assets. Yet, doing so can put a strain on your enterprise. At AlertOps, we offer a major incident management and response platform designed to help your enterprise manage its IT operations.

Is your online gaming platform "Chaos Monkey"-proof?

Try to imagine a bunch of monkeys running around your data center, pulling cables, trashing routers and wreaking havoc on your applications and infrastructure. Ever more crucial in these days of heated competition between online gaming operators, is player experience. Continuity of operations is “Uber-Alles” and avoiding churn, due to service disruption, is the organizational mantra.

Zen Your Life With IT Event Noise Reduction

IT incident responders have been inundated with alerts since the start of the COVID-19 pandemic. These engineers must dig through their messages to collect and respond to real alerts for real critical events. This process wastes time and prolongs incident response. The objective is to focus on IT event noise reduction to recognize and resolve real incidents promptly.

Incident Management in Mattermost: Creating an Incident Playbook

The idea behind Incident Management is to be ready. Not ready for anything, as that can be an unrealistic expectation, but ready to respond when the unexpected inevitably happens. DevOps teams often create incident playbooks in order to ensure they are as ready as possible to handle situations as they arise. Luckily, there is some amazing documentation on how to do just that from our friends at PagerDuty.

Escalating Prometheus alerts to SMS/Phone/Slack/Microsoft-Teams via AlertManager and Zenduty

Prometheus is by far, one of the most popular open-source monitoring tools used by millions of engineering teams globally with a robust community and continued adoption and evolution. We at Zenduty shipped our Prometheus integration integration a while back and we’re happy to report that the adoption of our Prometheus integration has been absolutely through the roof!

Improve Customer Satisfaction With Customer Service Incident Commanders

The global pandemic has drastically accelerated digital transformation initiatives and forced organizations to reimagine customer service by having them take on the incident commander role in managing and responding to customer issues and engaging with customers. In addition to prioritizing digital services, many businesses have migrated to the cloud to increase business agility, develop and deliver new features faster, and meet the growing demands of end users.

An end-to-end incident in Blameless and PagerDuty

PagerDuty is a leading on-call management platform that aggregates monitoring and alerting data, notifies on-call teams, and accelerates incident resolution. The platform is used by thousands of teams responsible for software experiences. It integrates incident triage with rapid responder mobilization, so teams can resolve incidents in real time.

Curb alert noise for better productivity : How-To's and Best Practices

On the quest to provide the best uptime, software platforms depend on complex interconnected microservices. This often leaves them vulnerable to cascading failures creating a massive deluge of alerts from monitoring tools when things go wrong. In this blog, we explore how Squadcast can be configured to curb alert noise for better productivity with the help of the most advanced deduplication features.