Operations | Monitoring | ITSM | DevOps | Cloud

Why 'owning Services' is critical for effective Incident Response

There is a famous quote that goes like this…‘For every minute spent organizing, an hour is earned.’ At least in the world of incident response, nothing is more apt than this. Digital infrastructure these days is made up of multiple services, an outage could result from either one impacted service or multiple impacted services. So it's essential to have a catalog of all the services along with the point of contact (service owner) responsible for maintaining it.

Getting Started with Python and Geo-Temporal Analysis

This article was originally published in The New Stack and is reposted here with permission. Working with geo-temporal data can be difficult. In addition to the challenges often associated with time-series analysis, like large volumes of data that you want real-time access to, working with latitude and longitude often involves trigonometry because you have to account for the curvature of the Earth. That’s computationally expensive. It can drive costs up and slow down programs.

What can Elastic Synthetics tell us about Kibana Dashboards?

I like to leverage our technologies to ensure our products have a pleasant user experience. Elastic Synthetics enables you to configure it in an out-of-the-box experience directly through your Elastic Cloud deployment without the need to install anything! It also works across the globe with multiple locations you can choose from. Ever wondered how fast your web service is when accessed from Japan, Germany, or the eastern U.S.? Now you can do this by simply clicking on a checkbox.

S2E3: How to build life cycles and workflows with a visual builder in ServiceDeskPlus Cloud

Learn how organizations can use visual workflow builders to systematize various ITSM practices, including incident and problem management; service request fulfillment; and change, project, release, and asset management. Design graphical life cycles and workflows on an easy-to-use canvas to guide IT teams, minimize errors, and perform custom actions like notifying users, updating fields, executing custom functions and more.

On Building a Platform Team

It may surprise you to hear, but Honeycomb doesn’t currently have a platform team. We have a platform org, and my title is Director of Platform Engineering. We have engineers doing platform work. And, we even have an SRE team and a core services team. But a platform team? Nope. I’ve been thinking about what it might mean to build a platform team up from scratch—a situation some of you may also be in—and it led me to asking crucial questions. What should such a team own?

What is Kafka?

Apache Kafka is a popular open source platform for streaming, storing, and processing high volumes of data. In this video, we break down how Kafka works and how it’s able to provide you with a reliable, scalable, and highly performant service for managing events. We also touch on some key resources for effectively monitoring your Kafka deployments via Datadog.

ITSM Statistics: The 2023 State of ITSM Process Adoption

Whether you know it or not, it’s quite possible that your organization has an IT Service Management approach. After all, every business wants to have the right capabilities to deliver value to its customers. But is it too possible? What’s the state of ITSM process adoption? That’s what we’re about to find out! In H1 2022, AXELOS created an IT Service Management (ITSM) Benchmarking Report for its myAXELOS subscribers.

Confidential computing in public clouds: isolation and remote attestation explained

In the first part of this blog series, we discussed the run-time (in)security challenge, which can leave your code and data vulnerable to attacks by both the privileged system software of the public cloud infrastructure, as well as its administrators. We also introduced the concept of trusted execution environments and confidential computing, (CC), as a paradigm to address this challenge.