Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Product Release Notes July 2024

For years, CloudZero has been a leader in the cloud cost allocation space, a statement supported by the fact that we remain the only cloud cost management platform to have received SOC1 Type 2 certification for our extreme accuracy when it comes to allocation and financial reporting capabilities. To quote our co-founder in the 2022 blog post listed above, “Quite simply, we organize cloud spend better than anyone else in the world.” Even so, we’re always looking to iterate and improve.

The Leading Network Device Monitoring Tools

Ensuring the security of your network infrastructure is critical for all organizations, and this requires going beyond traditional network monitoring and incorporating the monitoring of network devices, such as routers, switches, and other network devices. Whilst network monitoring includes the monitoring of devices, dedicated network device monitoring is a more thorough process for guaranteeing the health and performance of your organization's network devices.

Prometheus vs Grafana - A Comparative Guide to Key Differences

Prometheus and Grafana are both great observability solutions. Although they share some overlapping features, both Prometheus and Grafana have different priorities. Prometheus focuses on data acquisition, allowing users to select and aggregate time series data in real-time. Grafana, on the other hand, specializes in data visualization. Together, they form a powerful monitoring system effectively. But how well do these tools perform individually?

HetrixTools and ilert: Augment your Uptime and Blacklist Monitoring with Powerful Incident Management

ilert users can now seamlessly connect ilert with HetrixTools' monitoring capabilities. This streamlined integration ensures smooth IT operations with minimal downtime and faster issue resolution.

8 Key Insights for My Clients from the OpsRamp State of Observability Report

The OpsRamp State of Observability 2024 report not only presents fascinating data from a strong sample of IT leaders, but also outlines many highly actionable findings. As an independent analyst and advisor, I appreciate how this report outlines a powerful action plan for any CIO, CTO, or other IT leader who has not yet adopted or achieved success with observability.

Charmed Kubeflow 1.9 enters general availability with new support for GenAI

Kubeflow 1.9. Kubeflow is a fully open source MLOps platform, part of the Cloud Native Computing Foundation (CNCF) as an incubation project. Canonical is one of the project’s official distributors, ensuring security maintenance of the container images, tight integration with the wider AI/ML ecosystem, and delivering enterprise support and managed services. The latest release offers new features related to GenAI, enabling organisations to run any new initiative in production.

Unlocking Business Insights with Telemetry Pipelines

Imagine running a large company where data-driven decisions give you a competitive edge. You use a lot of business intelligence tools that tap into vast amounts of data, such as sales figures, inventories, and expenses. This analysis tells you how your company is performing. However, it does not reveal how your "company infrastructure" is performing. This crucial information comes from your systems in the form of telemetry data, such as logs and events.

GitOps Secrets with Argo CD, Hashicorp Vault and the External Secret Operator

Teams adopting GitOps often ask how to use secrets with Argo CD. The official Argo CD page about secrets is unopinionated by design and simply lists a set of projects that can help you with secrets. We’ve seen several approaches to secret management. These include sealed secrets, the Argo CD Vault plugin, and the External Secret Operator. In this post, we showcase the External Secret Operator and Hashicorp Vault and focus on 2 important aspects.

Transforming IT Operations at a Large Public Sector Bank with HEAL

In today’s digital age, IT organizations face numerous challenges that can hinder their ability to provide seamless services. Common pain-points include frequent outages, unexplained end-user experiences, negative brand impact, unaccomplished business demands, and complex application environments. These issues are exacerbated by technology silos, an overload of alerts, inaccurate and prolonged root cause analyses, and inadequate current SRE/DevOps tools.

Kubernetes Security: The Essential Role of Guardrails

In today’s rapidly evolving technology landscape, Kubernetes has become essential for deploying and managing containerized applications. As organizations increasingly rely on Kubernetes to scale their operations, the need for robust guardrails becomes paramount. In this context, guardrails refer to the policies and mechanisms that ensure the safe and efficient operation of Kubernetes environments.