Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How Cloud-Native Artifact Management Can Facilitate the Move to Platform Engineering

In this session we will explore why prioritizing artifact management is essential in advancing to self-service IDPs and how adoption of Cloudsmith offers solutions to common challenges such as high infrastructure costs, managing security vulnerabilities, and maintaining productivity. Attendees will gain insights into best practices of implementing an artifact repository at the center of their CI/CD pipeline.

Developer Self-Service: Overview & Best Practices

According to the 2024 State of Production Readiness report, 54% of engineering leaders said poor production readiness standards contributed to a decrease in developer productivity. But how? If software falls out of alignment with best practice—including those designed to maintain the health, observability, and security of software—developers wind up spending more time finding information and fixing issues than building new value.

A data lake on your cloud with Spark, Kubernetes and OpenStack

Data lake is a very large scale data processing paradigm that disrupts the conventional data warehousing model. Data lakes can offer greater flexibility whilst retaining the benefits and efficiency of centralised data governance. With Canonical OpenStack private cloud platform, Kubernetes and Charmed Spark solutions, your data lake architecture can also benefit from extended flexibility and scalability whilst remaining cost effective to operate.

3 Key Strategies for End-to-End DevOps Automation

DevOps automation is essential for speeding up delivery, minimizing errors, and boosting team collaboration. But selecting the right approach can make or break your organization’s agility and scalability. Let's break down three key approaches—DIY with Infrastructure-as-Code (IaC), Platform-as-a-Service (PaaS), and DevOps Automation Platforms—so you can identify the best strategy for your needs.

Network Management: Definition, Best Practices, and Tools

The efficiency of your IT infrastructure can make or break your business, specially in the IT world. At the core of this infrastructure lies Network Management (NM)—a critical process that ensures your network operates smoothly, securely, and efficiently. Whether you're managing a small office network or a sprawling enterprise system, understanding NM is key to maintaining optimal performance and security. But what exactly does Network Management involve? And why is it so important?

Building On-call: Our observability strategy

At incident.io, we run an on-call product. Our customers need to be sure that when their systems go wrong, we’ll tell them about it—high availability is a core requirement for us. To achieve the level of reliability that’s essential to our customers, excellent observability (o11y) is one of the most important tools in our belt. When done right, observability improves your product experience from two angles.

How to monitor your Kubernetes metrics server

In this article, we will examine a Kubernetes metrics server and its uses. We will also learn how to set one up and use it to monitor Kubernetes metrics. Finally, we will explore using Hosted Graphite by MetricFire to monitor Kubernetes metrics. To easily get started with monitoring Kubernetes clusters, check out our tutorial on using the Telegraf agent as a Daemonset to forward node/pod metrics to a data source and use that data to create custom dashboards and alerts.