Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How SRE's can Embrace Resilience During Crises

Blameless recently had the privilege of hosting SRE leaders Liz Fong-Jones, Dave Rensin, and Alex Hidalgo to discuss how SREs can embrace resilience during pandemic, and how the principles of SRE intersect with global trends. The transcript below has been lightly edited, and if you’re interested in watching the full panel, you can do so here.

Fleet Management for Kubernetes is Here

Today I’m excited to announce Fleet, a new open source project from the team at Rancher focused on managing fleets of Kubernetes clusters. Ever since Rancher 1.0 shipped in 2016, Rancher has provided a central control plane for managing multiple clusters. As pioneers of Kubernetes multi-cluster management, we have seen firsthand how users have consistently increased the number of clusters under management.

Announcing Hosted Rancher with Rancher 2.4

We’ve heard from many of our customers and prospects that they love Rancher but just don’t have the staff and expertise to operate the platform. Figuring out the compute, storage and networking architecture can be a challenge. Performing upgrades, backups and troubleshooting can also be time consuming. Monitoring the environment and knowing when to scale up or down, horizontally or vertically, is yet another thing to worry about.

Reserved Instances and Savings Plans on Your Terms with New Configurable Eco Strategies

As a “one size fits all” approach for AWS reservations doesn’t typically work for most AWS customers, Eco by Spot provides intelligent reserved instances and savings plans lifecycle automation with a focus on creating and executing customized, reserved capacity portfolios. These well balanced portfolios diversify customer commitments to various reserved instances and savings plans, ensuring the best blend of reserved pricing coverage, savings and optimal term length (e.g.

HAProxyConf 2019 - Building a Service Mesh at Criteo with Consul and HAProxy by Pierre Souchay

At Criteo, we have been working on building a tight integration between HashiCorp Consul and HAProxy. In this talk, we will explain how we provision our HAProxy instances dynamically using Consul Connect, a new service mesh technology that allows HAProxy to talk to its peers from machine-to-machine without a traditional load balancer. We will detail how we are able to create a service with DNS, add load balancing, and configure SSL certificates in mere seconds. Since Criteo is working actively on enabling HAProxy with Consul Connect, we will explain the challenges of scaling Service Mesh architectures for large infrastructures.

Episode 10: Installing Redis from Ansible Galaxy

A pre-built playbook from Ansible Galaxy lets us easily install Redis. (Even we don't re-invent the wheel every time.) The Request Metrics application will use Redis as its main data store. We need to install Redis on our servers to find out if this is a good plan. Ansible provides a repeatable way of doing this configuration work.

The art of shipping and monitoring software with speed and confidence

Software teams are under increasing pressure to ship code faster than ever before, but without the right workflow and tools in place, this can introduce unnecessary risk and headache. We wanted to share how to configure deployments, identify issues, and track performance gains using tools and process to get the best results and enable you to ship software with speed and confidence. The tools we will be using in today’s example include Jenkins, Octopus, and Raygun.