Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

5 Tools for Managing a Network from a Remote Location

If you’re responsible for overseeing a network infrastructure, but you’re not always on-site to complete tasks and tackle issues in person, you need the right tools to empower you in your admin efforts. There’s a diverse array of resources out there which will enhance your network management capabilities, even when you’re working remotely. Here are just a few examples of must-have apps for you and your team in this context.

5 Essential Things Every FinOps Team Needs

Every time your company onboards a new client or releases a new product, your cloud bill will grow. In fact, it doesn’t take a large event at all to see a spike. Whenever your company changes direction even slightly, it can affect the bottom line. Add to that factors such as economic inflation and increased demand for high-speed, high-power cloud services, and it may seem like each month’s cloud bill is higher than it was before. If that’s the case for you, you’re not alone.

Platform Engineering 101: Origins, Goals, DevOps vs SRE & Best Practices

Platform engineering is the practice of automating infrastructure operations and enabling self-service infrastructure capabilities within collaborative Dev, Ops and QA teams. It involves designing and building platforms, technologies and workflows that enable self-service capabilities to automatically manage, provision and operate complex modern software architecture environments.

Harnessing the Power of AZCopy with Azure Storage

In today’s data-driven world, the ability to efficiently and effectively manage vast amounts of data is crucial. As businesses increasingly rely on cloud services to store and manage their data, tools that can streamline data transfer processes become indispensable. AZCopy is one such powerful tool that, when combined with Azure Storage, can greatly simplify data management tasks while maintaining optimal performance.

Making Kubernetes Dev-Friendly with Komodor & Okteto

Kubernetes has become the software world’s infrastructure, leading to significant changes in application architecture and packaging. Despite the introduction of new technologies and practices, they have not kept pace with the rapid growth of the K8s ecosystem. As a result, developers who once solely focused on coding are now spending hours on operations, leading to a longer feedback loop during development. They’re expected to have an understanding of Kubernetes in order to do their jobs, causing a significant drop in productivity and leading to a poor dev experience.

Anomaly Detection Using OSquery and Grafana

Detecting unauthorized usage and malicious applications in an instance involves analyzing OS and application logs. Doing this manually is a herculean effort because of the number of logs and the patterns one has to look for. Having a tool that can provide an aggregated view of your instance and the ability to analyze them easily can greatly reduce manual effort.

The Guide to SRE Principles

Site reliability engineering (SRE) is a discipline in which automated software systems are built to manage the development operations (DevOps) of a product or service. In other words, SRE automates the functions of an operations team via software systems. The main purpose of SRE is to encourage the deployment and proper maintenance of large-scale systems.

What do you see in the clouds?

Remember being a carefree kid? Laying down in a field on a warm spring day, gazing up at the sky, and imagining shapes in the clouds. Maybe you saw a lion or an elephant dancing across the sky. You felt grounded and safe, enjoying the breeze while listening to cheers from the football game in the distance. But could you see the rain forming on the horizon threatening to distort your animal shapes and send the football team running for cover?