Operations | Monitoring | ITSM | DevOps | Cloud

Reproducing and testing distributed system failures with xk6-disruptor

Distributed systems, such as modern microservices-based applications, are highly scalable, but also highly complex. Dependencies and unexpected interactions between services are a common cause of incidents, and these incidents are also notoriously hard to test for. xk6-disruptor — an extension that adds fault injection capabilities to Grafana k6, the open source reliability and load testing tool — can help overcome these challenges.

Adding automation to monitoring: Azure troubleshooting simplified

The transition from traditional on-premises IT infrastructure to the public cloud has brought substantial relief to IT decision-makers and sysadmins. Since many organizations use Microsoft Windows as their preferred operating system, Microsoft Azure has become the public cloud provider of choice automatically owing to a familiar GUI and Active Directory sync.

Canonical announces supported solution for Apache Spark on Kubernetes

Today, Canonical announced the release of Charmed Spark – an advanced solution for Apache Spark® that provides everything users need to run Apache Spark on Kubernetes. Apache Spark is suitable for use in diverse data processing applications including predictive analytics, data warehousing, machine learning data preparation and extract-transform-load (ETL).

What Is the True Cost of Downtime for Businesses?

The financial and operational ramifications of downtime have become increasingly pronounced over the past seven years. In 2014, Gartner predicted that downtime costs organizations an average of $300,000 per hour. However, recent statistics lie in sharp contrast to this 6-figure estimate, with 44% of organizations now counting their hourly downtime costs at over $1 million - exclusive of the ensuing penalties or legal fees.

Gremlin for DORA compliance: how financial services firms build digital resilience-and prove it

The Digital Operational Resilience Act (DORA) is set to significantly impact the financial sector. Coming into full effect in 2025, this EU regulation will set new standards for information and communications technology (ICT) risk management. In this landscape, how can financial firms ensure they’re not only compliant, but also operationally resilient?

4 ways innovative companies can navigate digital transformation

From the tight global labor market to social and political volatility, macroeconomic headwinds continue to hamper business growth. Such a complex environment is anything but smooth. Your organization may be feeling pressure to deliver on initiatives to support talent, products and services, and business operations. Finding success in these areas requires digital transformation. Innovative companies are best positioned to prosper and thrive.

Service Blueprinting and Orchestration for Elevated Customer Experiences

Chances are, you’re familiar with the strategy of adding an additional “9” to service level agreements (SLAs) to boost the experiences your organization provides. With plenty of ways to do so, there’s one that particularly stands out among the others: Service Blueprinting. Banking executive Lynn Shostack in 1984 first described a service blueprint in a Harvard Business Review publication.