Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Monitor Citrix Hypervisor performance with Datadog

Citrix Hypervisor, formerly known as Citrix XenServer, is a type 1 hypervisor that enables organizations to run and manage an entire virtual infrastructure—including VMs, virtual desktops, and virtual applications. Organizations can also use Citrix Hypervisor to optionally host these virtual workloads with higher availability and flexibility by implementing managed server groups called resource pools.

Sponsored Post

Classifying Severity Levels for Your Organization

Major outages are bound to occur in even the most well-maintained infrastructure and systems. Being able to quickly classify the severity level also allows your on-call team to respond more effectively. Imagine a scenario where your on-call team is getting critical alerts every 15 minutes, user complaints are piling up on social media, and since your platform is inoperative revenue losses are mounting every minute. How do you go about getting your application back on track? This is where understanding incident severity and priority can be invaluable. In this blog we look at severity levels and how they can improve your incident response process.

Qovery x Spayr - Managing Multiple Environments Running on Kubernetes Clusters

Qovery makes it easy to deploy on-demand environments on AWS. More than 20,000 DevOps and developers use Qovery to deploy their production, staging, and development environments on AWS in a few seconds. Join Albane (Product Marketing at Qovery) and Pierre (CTO and Co-founder at Spayr) to talk about how Spayr is managing multiple environments running on Kubernetes clusters on Qovery, and is empowering his team, from junior to senior developers, to create a new environment and test new ideas independently.

Site Reliability Engineering (SRE) explained

Google has introduced so many innovations that it’d be impossible to list them all. And we’re not just talking about the obvious things like search engine algorithms or nearly-ubiquitous programs and apps (Google Maps, Docs, Gmail) — not even self-driving cars. Today, we’re going to talk about one such innovation: Site Reliability Engineering. In a nutshell, SRE it’s a practical framework for software development that improves on even giants like DevOps. Wait, what?

Multipass 1.10 brings new instance modification capabilities

Developers rejoice! The Multipass team has been listening to your feedback, and we are excited to announce that the latest update to Multipass contains one of our most requested features – instance modification. For those who are just discovering Multipass, it’s software designed to make working with virtual machines as painless as possible. It has an intuitive command line interface, and abstracts away the hard work of configuring, launching, modifying and destroying VMs.

Managing the Looker ecosystem at scale with SRE and DevOps practices

Many organizations struggle to create data-driven cultures where each employee is empowered to make decisions based on data. This is especially true for enterprises with a variety of systems and tools in use across different teams. If you are a leader, manager, or executive focused on how your team can leverage Google's SRE practices or wider DevOps practices, definitely you are in the right place!

How to Change the Puppetdb Port in Puppet Enterprise

Occasionally in Puppet Enterprise, you may need to change the port PuppetDB consumes, for instance, if another service requires port 8081. While we in Puppet Support recommend that you change the port for the other service. If you can’t do that, Follow this Video and the attached Knowledge base article for a guide on changing the port.

Setting up Runbooks in Squadcast | SRE Best Practices | Squadcast

A Runbook is a compilation of routine procedures and operations that are documented for reference while working on a critical incident. Sometimes, it can also be referred to as a Playbook. From this video, learn to create, attach, reference and mark progress for incident resolution using Runbooks.