Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Spiceworks research reveals storage, network, desktop, and server virtualization trends

There's no doubt that over the last decade, virtualization technology drastically altered server rooms around the world. Server virtualization has improved IT efficiency while allowing for greater flexibility and redundancy, quickly jumping from a niche technology to one that's used by the vast majority of businesses. With server virtualization now ubiquitous, many businesses are now turning to other forms of virtualization to reap similar IT benefits.

How to Create an Azure Monitor Alert

Azure Monitor gathers performance metrics from your various Azure resources and allows you to explore those metrics through visualizations. It also allows you to manually create alerts that will notify you when a metric crosses a predefined threshold. In this blog post, we’ll cover how to create an alert in Azure Monitor.

Metrics Documentation with the metrics2docs Tool

Metrictank exposes many metrics to aid with operating the software in production. As the metrictank team (the primary on-call team for metrictank at Grafana Labs) grows and onboards new people, and more customers deploy the software on their premises, we need to solve a few problems regarding the metrics exposed by metrictank.

Sentry Integration Platform: Optimizing Incident Management with Amixr

It’s hard (if not impossible) to imagine production infrastructure without incidents. And service reliability can be highly dependent on how quickly and efficiently engineers are able to tackle these incidents. Reliability engineers are often faced with four questions... Sometimes the answers to these questions are surprising.

Turbocharge QA with Pre-Production Monitoring

Traditionally, Quality Assurance (QA) has been a very manual process. Our QA teams do an amazing job running through test plans, finding critical bugs, and logging reports. But it can be a lot of work to run through the tests again and again, dig into the errors to provide the contextual information developers need to fix bugs quickly, and prepare the reports your developers need to find and fix errors in the codebase.

Understanding common library implementation

As Falco grows in popularity, many new users get exposed to it on a daily basis. As should be expected, most of these users are not aware of what the architecture underneath Falco is. What components play a role in powering it? How do these components relate to each other? I thought it would be fun to write a blog post that answers these questions. And I thought it would be fun to write it with an historical perspective.

Troubleshooting On Steroids with Logz.io Log Patterns

It’s 3 AM and your phone is ringing. Rubbing your eyes, you take a look at the alert you just got from PagerDuty. A critical service has just gone offline. Angry customers are calling support. Your boss is on the phone, demanding the issue be resolved ASAP. You open up your log management tool only to be faced by 5 million log messages. What now?

Why chat-style messaging is crucial for developer productivity

For most organizations, software development is team-driven. Good communication—messaging—is crucial to working together as a team and, increasingly, for working effectively with the tools used by the team. In recent years, instant messaging has taken over not only social networks, but also the workplace. In many ways, a collaboration tool based on instant messaging is key to collaboration, knowledge transfer, and solid teamwork.