Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Reliability Best Practices: How Gremlin Uses Gremlin

Ensuring software availability is essential for any SaaS company—including Gremlin. To do that, our teams need to identify the reliability risks hiding in our systems. That’s why our development, platform, and SRE teams use Gremlin regularly to perform Chaos Engineering experiments, run reliability tests, and track the reliability of our systems against our standards. Along the way they’ve picked up a thing or two about how to find and fix reliability risks with Gremlin.

Behind the Scenes: Mattermost OpenOps AI Mindmeld | July 27, 2023

Tune in for a behind-the-scenes discussion on the advancement of Mattermost's AI tools and how they're being integrated into the team's current projects. The main topics covered include using AI to create tweets, the potential of using the tool to auto-generate text that resembles a user's tone, how to improve public awareness and involvement in OpenOps, and more.

How to monitor connector's API Connections in Logic Apps?

Let us consider a scenario where a Logic App is used to communicate with SharePoint through API connections, known as connectors. When configuring the connector, it communicates with Azure AD, retrieving a username and password and continuously refreshing the authentication token. When the Logic App calls the connector, it performs operations like uploading files to SharePoint.

SMS Alerts for GitHub Actions - Civo Navigate NA 2023

Rishab Kumar, a Developer Evangelist at Twilio, shares their insights on implementing SMS alerts for GitHub actions during an informative talk. Their presentation primarily focuses on using GitHub Actions for build and CI/CD tasks due to its efficient cost structure. However, Rishab points out a feature gap: the platform lacks the capability for SMS alerts or phone calls. To address this, they demonstrate how to configure SMS alerts in a manner akin to enterprise tooling such as OpsGenie or PagerDuty.

Mastering Kubernetes Pod Restarts with kubectl

Managing containerized applications efficiently in the dynamic realm of Kubernetes is essential for smooth deployments and optimal performance. Kubernetes empowers us with powerful orchestration capabilities, enabling seamless scaling and deployment of applications. However, in real-world scenarios, there are situations that necessitate the restarting of Pods, whether to apply configuration changes, recover from failures, or address misbehaving applications.

Monitoring Redis Clusters with Prometheus

This article will outline what Redis database monitoring is and how to set up a Redis database monitoring system with MetricFire. Then we’ll show what the final graphs and dashboards look like when displayed on Grafana. We will be using Prometheus and Grafana to power the monitoring, and we'll use a simulated Redis DB to generate the data for the Grafana dashboards. ‍ ‍

SQL Server Terms Translated into PostgreSQL

The rise in popularity of open-source RBDMs has encouraged many organizations to adopt PostgreSQL, but as a DBA or Developer, it can be challenging when exploring new database platforms, no matter how experienced you are. When looking at SQL Server, it has many similarities to PostgreSQL, but there are several big differences too.