Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Grubhub and JPMC Shift Reliability Testing Left at Chaos Conf 2020

Get started with Gremlin's Chaos Engineering tools to safely, securely, and simply inject failure into your systems to find weaknesses before they cause customer-facing issues. Gremlin’s Chaos Conf is always an exciting event, bringing together leaders at the forefront of Chaos Engineering practices. This year was no exception, moving beyond defining Chaos Engineering to more advanced adoption and best practices discussions.

Announcing HAProxy 2.3

HAProxy 2.3 adds exciting features such as forwarding, prioritizing,and translating of messages sent over the Syslog Protocol on both UDP and TCP, Stats Contexts, SSL/TLS enhancements, an improved cache, and changes in the connection layer that lay the foundation for support for HTTP/3 / QUIC. This release was truly a community effort and could not have been made possible without all of the hard work from everyone involved in active discussions on the mailing list and the HAProxy project GitHub.

How to Manage Ruby Memory Usage

Even the most prominent and reliable frameworks are notorious for burning out resources if not configured perfectly. In this post, we are about to take a look at how Ruby, one of the most prominent programming languages and an awesome web application alternative when combined with Rails, manages memory, and how you can make it perform even better. Ruby is a scripting language built for use in web applications and similar stuff.

Incident Management in Mattermost: Creating an Incident Playbook

The idea behind Incident Management is to be ready. Not ready for anything, as that can be an unrealistic expectation, but ready to respond when the unexpected inevitably happens. DevOps teams often create incident playbooks in order to ensure they are as ready as possible to handle situations as they arise. Luckily, there is some amazing documentation on how to do just that from our friends at PagerDuty.

Service Requests Go Mobile

The ability to deliver IT services in an effective and user-friendly manner is a key to success in the IT Service Management world. Alloy Navigator delivers a great experience to employees and customers by automating a broad range of standard service requests, including employee onboarding, password resets, provisioning remote access, and hardware requests. Now with the latest update for our mobile app, service requests can be managed using phones or tablets.

Level Up Your IT Asset Management Strategy

Whether you’re supporting remote teams or working in a hybrid environment, IT asset management (ITAM) is a critical practice to the business and your service management strategy. If you’ve been following our ITAM series and are delivering services to employees, you most likely have an understanding of the ITAM essentials and questions to keep in mind when developing a strategy. Now in this blog, I’ll guide you on the journey to take your strategy to the next level.

Location Matters when Monitoring Digital Experience

There’s a saying in real estate that the three most important things for a property are: “location, location, location.” At Catchpoint, we believe the same is true for digital experience monitoring. Location matters. That’s why we’ve built the largest, most diverse global network of monitoring points available, with more than 800 monitoring nodes in over 230 cities and 280 providers around the world.

Escalating Prometheus alerts to SMS/Phone/Slack/Microsoft-Teams via AlertManager and Zenduty

Prometheus is by far, one of the most popular open-source monitoring tools used by millions of engineering teams globally with a robust community and continued adoption and evolution. We at Zenduty shipped our Prometheus integration integration a while back and we’re happy to report that the adoption of our Prometheus integration has been absolutely through the roof!

Improve Customer Satisfaction With Customer Service Incident Commanders

The global pandemic has drastically accelerated digital transformation initiatives and forced organizations to reimagine customer service by having them take on the incident commander role in managing and responding to customer issues and engaging with customers. In addition to prioritizing digital services, many businesses have migrated to the cloud to increase business agility, develop and deliver new features faster, and meet the growing demands of end users.

New CloudZero CRO Steve Lewis Shares Why Cloud Cost Management Is Ripe for Disruption

This month, I made the decision to join CloudZero as their new chief revenue officer — a choice I’m incredibly excited about. In the past week, a number of colleagues have asked me to share why I joined the company. After answering the question a few times, I decided to take a minute to collect my thoughts and explain why this incredible company is the next step in my career.