Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Does Step Function's new TestState API make end-to-end tests obsolete?

Step Function added support for testing individual states . Which lets you execute individual states with the following: And returns the following: With the TestState API, you can thoroughly test every state and achieve close to 100% coverage of a state machine. So, does this eliminate the need for Step Functions Local ? Can we do away with end-to-end tests as well? If not, where should this new API fit into your workflow, and how should you use it?

Marking deployments and more in Redgate Monitor

SQL Monitor is an essential tool for DBA teams worldwide, providing real-time monitoring of SQL Server and PostgreSQL performance. With SQL Monitor, you can easily track deployments, errors, and other events on the timeline. This feature, called annotations, allows you to quickly identify the root cause of performance issues and take corrective action. SQL Monitor’s timeline is a powerful tool that helps you stay on top of your database performance and keep your systems running smoothly.

Mastering IT Alerting: A Short Guide for DevOps Engineers

$575 million was the cost of a huge IT incident that hit Equifax, one of the largest credit reporting agencies in the U.S. In September 2017, Equifax announced a data breach that impacted approximately 147 million consumers. The breach occurred due to a vulnerability in the Apache Struts web application framework, which Equifax failed to patch in time. This vulnerability allowed hackers to access the company's systems and exfiltrate sensitive data. ‍

Debugging Go compiler performance in a large codebase

As we’ve talked about before, our app is a monolith: all our backend code lives together and gets compiled into a single binary. One of the reasons I prefer monolithic architectures is that they make it much easier to focus on shipping features without having to spend much time thinking about where code should live and how to get all the data you need together quickly. However, I’m not going to claim there aren’t disadvantages too. One of those is compile times.

Observability and Telecommunications Network Management [Part 1]

The border between the management of telecommunications networks and the services that they support and the management of IT infrastructures and the applications that they support has always been a porous one. One might say that they are like two dialects of the same language rather than different languages. Nonetheless, these areas, whether characterised by technology or practice, are different and have, for the most part, been served by different vendors and products.

Managing software in complex network environments: the Snap Store Proxy

As enterprises grapple with the evolving landscape of security threats, the need to safeguard internal networks from the broader internet is increasingly important. In environments with restricted internet access, it can be difficult to manage software updates in an easy, reliable way. When managing devices in the field, change management and compliance policies can introduce even more complexity to the update process. You can solve these challenges using snaps and the Snap Store Proxy.

What is Infrastructure as Code (IaC)?

Infrastructure as code (IaC) is the act of writing infrastructure configurations as code so they can be understood, repeated, and enforced with less manual effort. IaC is also a powerful way to convert tribal knowledge into technical knowledge. It’s a far-reaching and essential part of managing infrastructure at scale, with benefits that have expanded to platform engineering, security and compliance, network administration, and so much more.

5 Ways CloudZero Found Savings Using Its Own Platform

Naturally, our own platform forms the backbone of our cloud cost savings strategy. And before releasing any new feature, we try it out ourselves to see how well it helps us manage our own SaaS costs. If it works well for us, that means there’s already one satisfied user in the world — and there will likely be more!

Enhancing Data Center Efficiency and Sustainability with Power Capacity Effectiveness

PCE is a performance metric that evaluates the effective utilization of power in data centers. It measures the ratio of IT equipment power to the total power consumed, encompassing all aspects of the facility’s operations, including cooling and lighting. Unlike traditional metrics, PCE provides a comprehensive assessment of how power is used within data centers, aiming to optimize the actual power capacity available.