Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

De-Risk Application Re-Factoring

While cloud adoption in general has been on the rise, the migration of business-critical legacy application workloads to the cloud has been relatively cautious. Apart from financial risk, the primary reason for this precaution is the inherent risk to business operations. Customers who venture into application refactoring have broadly two options...

Part II: Artifactory as a Caching Mechanism for Package Managers

In our previous blog post we discussed the challenges with relying on external servers for downloading pre-build tools such as Curl, CLI, wget, Maven, Gradle, npm and others. We discussed how they can sometimes cause stability issues, also called “Environmental Issues”, that will break the build.

Monitor email workflows with Datadog Browser Tests

Monitoring your application from end to end is important for ensuring that core functionalities work as designed. Datadog’s browser tests help you verify that key user workflows—such as signing up for a new account—are consistent across devices and locations. Within these workflows, email often plays a key role in onboarding users and providing customers with important information about their accounts and application activity, such as profile changes and order confirmations.

Kubernetes Master Class: How to Run Databases in Production on Kubernetes

Databases are business-critical entities and data loss leads to major operational risk scenarios in any organization. A single operational or architectural failure can lead to significant loss of time and resources. This class will provide a real-world view into the challenges of maintaining state and running databases in production and show solutions managed by Rancher.

Introducing Git Blame Support for GitHub Integration

At Rollbar, we care about reducing the time it takes developers to find and fix errors. This is why we’re making our integration with GitHub even stronger to provide more context around errors and reduce the mean time it takes to resolve them MTTR. Last year, we launched Code Context to show additional lines of code within each frame of the stack trace, reducing the back and forth between GitHub and Rollbar.

Monitor SNMP with Datadog

As your on-premise network infrastructure grows in size and complexity, monitoring thousands of devices becomes a challenge. Whether you’re monitoring firewalls in a branch office or the routing and switching fabric in your datacenter over which all customer transactions are performed, visibility into all points of your infrastructure is critical for network maintenance.

Monitor Vault metrics and logs

Hashicorp Vault is a tool for managing secrets—sensitive data such as passwords, certificates, and API keys. Vault allows you to encrypt your secrets, control access to them, and audit activity to see who has requested data from your Vault. Datadog already monitors the status of your Vault servers—for example, you can configure the Vault integration to automatically notify you if a Vault server is unexpectedly sealed, or if there is a leader change in your Vault cluster.

How We're (Ab)using Hashicorp's Consul at Grafana Labs

Hashicorp’s Consul service is a distributed, highly available system that provides a service mesh solution, including service discovery, configuration, and segmentation functionality. Cortex uses Consul’s KV store to share information that’s necessary for distributing data to its components. While writing to Consul has been useful at Grafana Labs, we’ve found that as we expanded the operations, problems started arising.