Operations | Monitoring | ITSM | DevOps | Cloud

5 ways to create a memorable customer experience

Mayank A, senior principal inbound product manager for Customer Workflows at ServiceNow, co-authored this blog. The way customers want to engage with organizations is radically changing. The pandemic and multiple shutdowns brought these changes to the forefront, forcing organizations to rethink their customer experience, both in person and online, and accelerate digital transformation projects to try to better align the customer journey with expectations.

SRE Principles: The 7 Fundamental Rules

In one of our previous articles, we discussed what an SRE is, what they do, and some of the common responsibilities that a typical SRE may have, like supporting operations, dealing with trouble tickets and incident response, and general system monitoring and observability. In this article, we will take a deeper dive into the various SRE principles and guidelines that a site reliability engineer practices in their role.

DevOps State of Mind Podcast Episode 1: Trust, tooling, and a no-blame culture with LogDNA

Tucker Callaway is the CEO of LogDNA. He has more than 20 years of experience in enterprise software with an emphasis on developer and DevOps tools. Tucker fosters a DevOps culture at LogDNA by tying technical projects to business outcomes, practicing extreme transparency, and empowering every person in the company to contribute.

3 Ways To Prevent Cyber Security Threats When Marketing Online

No matter what type of business you operate, cyberattacks can be destructive to your company. Even though you think your Information Technology (IT) team should be handling any cybersecurity issues, it doesn't have to always go that way. All the departments should take a proactive role in safeguarding the privacy of your business.

Generate span-based metrics to track historical trends in application performance

Tracing has become essential for monitoring today’s increasingly distributed architectures. But complex production applications produce an extremely high volume of traces, which are prohibitively expensive to store and nearly impossible to sift through in time-sensitive situations. Most traditional tracing solutions address these operational challenges by making sampling decisions before a request even begins its path through your system (i.e., head-based sampling).

Embedding Artificial Intelligence At Work: From Efficiency Gains To Leadership Expertise

With the increasing adoption of artificial intelligence (AI) applications at the workplace, the debate about the future of work, workers, and the workplace has intensified. The polarised nature of debate ranges from job losses versus new-technology job creation through performance efficiency versus performance effectiveness to liberating humans from drudgery versus being controlled by machines. While several other polarities are evident in this debate, the truth always lies somewhere in between.

Willy Tarreau on HAProxy at Its 20-Year Anniversary

Willy Tarreau, the founder of the HAProxy load balancer, 20 years past its initial, open-source release, still guides the project, often submitting code patches and writing long and meticulous replies on the community forum. Over the years, he has been joined by a cast of regular contributors, but also newcomers. This collaboration has kept the project evolving over time. In this interview, Willy describes his views on the success of the project, and how it grew over the years.

IT Failures are Inevitable

As infrastructure stacks grow increasingly complex and involve an ever-growing number of services, system failures are becoming more and more common. There can be a variety of reasons why systems fail: software bugs, misconfiguration or interactions between services that cause unexpected behavior, the network is down, and of course, those rare occasions where natural events can render data centers inoperative.

Get planet-scale monitoring with Managed Service for Prometheus

Prometheus, the de facto standard for Kubernetes monitoring, works well for many basic deployments, but managing Prometheus infrastructure can become challenging at scale. As Kubernetes deployments continue to play a bigger role in enterprise IT, scaling Prometheus for a large number of metrics across a global footprint has become a pressing need for many organizations.