Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Server Monitoring 101 Guider Tools Best Practices

As IT landscape is changing from in-house systems and servers to cloud ones, server monitoring has become an essential task. Without server monitoring, it’s not possible to detect issues in memory, operating systems, storage drives or applications. Server monitoring is vital to know what went wrong and what was the cause behind the problem.

Demystifying DevOps and SRE

How different are DevOps and SRE? Are they related to each other? In this blog, James Samuel sheds light on the similarities & differences between SRE & DevOps followed by the possible ways to structure an SRE team in your organization. One of the terms that people often find confusing is SRE and DevOps. People often ask, should I hire a DevOps Engineer or a Site Reliability Engineer? What is the difference between SRE and DevOps and which one do I need? In this post, I attempt to shed some light.

Getting Started with Docker: A Tutorial

Docker containers have taken the software industry by storm. Ever since its launch in 2013, Docker’s usage and popularity have grown at a rapid pace. Docker has saved organizations from the challenges of managing dependency and version conflicts across multiple environments by providing a portable, secure, and (most importantly) reliable container technology for shipping applications.

Integrating Speedscale with Jenkins

Minimizing and automating the path from development and production is necessary in order to stay competitive and keep customers happy. As engineering teams strive to solve this by quickly and efficiently rolling out new features, updates, and bug fixes, continuous integration and deployment (CI/CD) has come to be regarded as an industry best practice. One of the most popular CI/CD solutions is Jenkins, an open-source job execution system.

Automate EKS Node Rotation for AMI Releases

In the daily life of a Site Reliability Engineer, the main goal is to reduce all the work we call toil. But what is toil? Toil is the kind of work tied to running a production service that tends to be manual, repetitive, automatable, tactical, devoid of enduring value, and scales linearly as a service grows. This blog post describes our journey to automate our nodes rotation process when we have a new AMI release and the open source tools we built on this.