Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Practical guidance for getting started as a site reliability engineer

At the beginning of May, I joined incident.io as the first site reliability engineer (SRE), a very exciting but slightly daunting move. With only some high-level knowledge of what the company and its systems looked like prior to this point, it’s fair to say that I didn’t have much certainty in what exactly I’d be working on or how I’d deliver it.

Argo Rollouts at CircleCI: Progressive deployment for agile and efficient releases

At Circle, our traditional approach to Kubernetes (k8s) deployments likely looks familiar to many of you: Run the workflow, create the image, build the Helm chart and deliver it to k8s. At that point, k8s takes over with its rolling update. This method gets the job done, but we knew it wasn’t ideal. Limited support for canary releases and the need for time-consuming error monitoring and manual rollbacks added friction and risk to our release processes.

Using cloud unit metric costs to right size your AWS bill and improve productivity

Uncertainty continues to plague global economies, prompting many CFOs to optimize their bottom lines by re-examining large line-item costs, payrolls, and unit economics. Many IT leaders have been directed by their C-Suite to reduce or halt cloud spending, which represents roughly 30% of IT budgets.

How telcos are building carrier-grade infrastructure using open source

Service providers need cloud infrastructure everywhere, from modern 5G and 6G network functions running in the network core to sophisticated AI/ML jobs running on the edge. Given the sensitivity of those workloads to any interruptions, outages or performance degradations, the cloud infrastructure used by telecommunication companies needs to be fast, robust and ultra stable.

Using Tailscale for Authentication of Internal Tools

JWT is a popular way for authentication and authorization, especially for service to service communications. When it comes to internal tools, distribution and renewal of JWT can become a challenge. Our internal support systems use JWT to authenticate and authorize access and they are written in a few different languages and run on different hosting options.

Azure SQL database cost optimization to maximize savings

Azure SQL is a versatile and powerful database service, and it is an increasingly popular choice for storing and managing application data due to its scalability, high availability, security, and simplicity of integration. A common demand for cloud workloads is cost optimization. To maximize cloud savings, this article discusses Azure SQL Database Cost Optimization.

How to Use PowerShell to Automate Office 365 Installations

In the fast-paced and demanding world of IT, every tool that saves time and simplifies tasks is worth its weight in gold. Today, we're going to explore how PowerShell scripts can be utilized to automate the installation of Office 365, a critical operation that can save you countless hours in the long run. In fact, with a well-written script, you can manage installations across an entire network from your desk.

Cycle's New Interface, Part II: The Engineering Behind Cycle's New Portal

In our last installment, we covered the myriad of new UI changes added to Cycle’s portal. In this part, we walk through five of the tough engineering choices made when developing the new interface, discussing the alternatives that were considered, and shining a light on some of the technology our engineering team utilizes today.