Latest Posts

On-call compensation models

Aug 3, 2020 By Vishwa Krishnakumar In Zenduty

Providing customers with a world-class and seamless user experience is critical for the success of any business. It is therefore important that you have a robust on-call strategy that optimizes the availability of the right subject matter experts, on-call engineers, and support engineers to resolve critical, user-impacting incidents as soon as possible.

Read Post

Zenduty

Read more about On-call compensation models

It's a known issue - How Product Managers should deal with issue or feature related enquiries or feedback

Aug 2, 2020 By Vishwa Krishnakumar In Zenduty

I often hear folks in my network being triggered by interactions with product managers within their companies whenever they follow up on certain product-related issues. The triggering phrase invariably is “It’s a known issue”. And they often wonder, well if it’s a known issue, why on earth isn’t anything done about it?

Read Post

Zenduty

Read more about It's a known issue - How Product Managers should deal with issue or feature related enquiries or feedback

How to build a customer advisory board

Aug 1, 2020 By Vishwa Krishnakumar In Zenduty

Regardless of where you are in your product journey, it is impreative that you constitute a customer advisory board who can share perspectives into their business challenges so that you can gain insights on how to shape our road map, develop new features, formulate your vision and give you constant feedback on your product. So, how many customers should to include in a customer advisory board? Should you target higher level stakeholder or individual users?

Read Post

Zenduty

Read more about How to build a customer advisory board

Defining your Sev-1s

Jul 27, 2020 By Vishwa Krishnakumar In Zenduty

One of the primary things you need to figure out whenever your team is formulating your incident management process is describing in words what a Sev0(your highest incident priority) looks like. “Website doesn’t work” is certainly no enough. “Website is up but a key resource (ie CSS file) is missing, rendering the website unusable” is still not enough. “A single page on the website is 404’ing” is not a major but could be a minor incident.

Read Post

Zenduty

Read more about Defining your Sev-1s

Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

Jul 14, 2020 By Vishwa Krishnakumar In Zenduty

Nagios is one of the most widely used open-source network monitoring software used by thousands of NOC teams globally to monitor the health of a vast array of their hosts and services. Most teams rely on Emails as their primary Nagios alert notification channel, which may take a few minutes to respond to by your NOC team.

Read Post

Zenduty

Read more about Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

Product Metrics for Discovery Activities

Jul 12, 2020 By Ankur Rawal In Zenduty

Most companies today compile a set of metrics for their product teams to regularly report on to the company management. This includes a variety of product performance metrics(usage frequency, churn rate, NPS, etc.). But a lot of them struggle a bit with product discovery activities. So how do your track discovery?

Read Post

Zenduty

Read more about Product Metrics for Discovery Activities

Two tips to incorporate the voice of the customer in your story grooming/sprint planning

Jul 9, 2020 By Vishwa Krishnakumar In Zenduty

Constantly talking to your users about their business problems and incorporating those solutions is key to the success off your product and company. There are many ways to incorporate the voice of your users into your product planning. Formulate an experience brief that’s less than 2 pages, or a 5-minute clip of user interviews. The best is to have devs in the interviews and discovery activities with you as well.

Read Post

Zenduty

Read more about Two tips to incorporate the voice of the customer in your story grooming/sprint planning

Our favorite(top?) SRE talks

Jul 2, 2020 By Ankur Rawal In Zenduty

Over the years there have been a bunch of great talks on site reliability and incident response. Below are a few we thought stood out(in no specific order) and is defintely worth a peek.

Read Post

Zenduty

Read more about Our favorite(top?) SRE talks

Learning from Incidents - what to do after you write a postmortem?

Jun 29, 2020 By Vishwa Krishnakumar In Zenduty

For folks who’ve made post mortems more meaningful at your company, it is important that you spread that learning around. A lot of companies have teams that do postmortems really well and a lot of engineering managers(EMs) want to spread it organically, but writing and following postmortems is the kind of practice that a lot of devs really just don’t think about or care about and it can get extremely hard to force this practice, especially without support from upper management.

Read Post

Zenduty

Read more about Learning from Incidents - what to do after you write a postmortem?

Creating Histograms in Grafana from Prometheus buckets

Jun 7, 2020 By Ankur Rawal In Zenduty

In the following example, we will be creating a histogram in Grafana. Our datasource is Prometheus’s cumulative histogram. I have captured the metrics using micrometer’s distribution summary.

Read Post

Zenduty

Read more about Creating Histograms in Grafana from Prometheus buckets

Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

On-call compensation models

It's a known issue - How Product Managers should deal with issue or feature related enquiries or feedback

How to build a customer advisory board

Defining your Sev-1s

Sending Nagios alerts to Microsoft Teams and rapid incident response with Zenduty

Product Metrics for Discovery Activities

Two tips to incorporate the voice of the customer in your story grooming/sprint planning

Our favorite(top?) SRE talks

Learning from Incidents - what to do after you write a postmortem?

Creating Histograms in Grafana from Prometheus buckets

Monthly Archive

Follow Us