Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Announcing PagerDuty's Solution For HybridOps

For years, traditional infrastructure provisioning and management followed a specific operating model that depended on Network Operations Centers (NOCs) to process operational events. As enterprise companies started to undergo digital transformation, the cloud created a different operating model: One that was much more agile and, some would argue, more efficient—and would replace all other operating models to create IT homogeneity.

Can AIOps Tools Solve Your Service Desk Problems

The state of IT Operations today is loud, noisy and fragmented. This means that your service desk teams, that may be using ServiceNow, JIRA or other tools, are suffering. Sounds all too familiar, right? We know that the noise and chaos exist, and we know that they cause major headaches for service desk teams. The real question is, can we control it with AIOps? To put it simply, yes, the RIGHT AIOps tools can help.

Context is Key: Additive vs. Subtractive Topology

Understanding the context of an IT incident can greatly reduce the MTTR and enhance the ability to determine the root cause. In an IT environment, ‘context’ is used to refer to the subset of information necessary to troubleshoot and diagnose an incident, or event. For some scenarios, the context may be the downstream dependencies after a high availability pair of firewalls goes offline, and in others, it may be the datastore in contention from multiple VMs.

Instrumenting Ruby on Rails with Prometheus

If you’re running a production application, you need metrics. There are great products out there that allow you to gain visibility into how your application is performing, give some nice graphs, and charge you for it. In the Rails community, this is commonly achieved by using NewRelic and Skylight. But for some of us, we achieve visibility by using Prometheus and Grafana that we build and host ourselves.

PyCon 2019 - Scout brings APM for Python

The 2019 edition of PyCon USA takes place over the next few days in Cleveland, Ohio. Scout is delighted to be there, sharing our APM tool with the Python community. Plus, we'll have great t-shirts and stickers for you, and we love to get geeky - one of our lead product engineers, plus two of our smart support engineers, are working the booth, ready to help you figure out your Python performance problems.

How Big Data and Log management work hand in hand

As Stephen Marsland once said, “if data had mass, the earth would be a black hole.” A vast part of the immense amount of structured and unstructured data that we call “Big Data” is nothing but machine-originated log data. Logs are generated for a lot of different purposes – from security to debugging and troubleshooting. They constitute a gold mine of useful information and actionable insights if properly stored, managed, and analyzed.

Infrastructure Ops in 2019 - How Legacy Technology Compounds Technical Debt (1/2)

In today’s software-driven economy, every organization faces an imperative to modernize the way they deliver software in order to adapt and enable the digital era — or perish. Digital transformation across industries is driving the need for IT to enable Cloud-Native applications. This has led enterprises to adopt Kubernetes as the most effective way to support cloud-native architectures and to modernize their applications and IT infrastructure.