Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How Detected Risks helps you find reliability risks in minutes-without running any tests

This video showcases Gremlin's Detected Risks feature. Detected risks are high-priority reliability concerns that Gremlin automatically identifies in an environment. These include misconfigurations, bad default values, and reliability anti-patterns. Gremlin prioritizes these risks based on severity and impact, giving instantaneous feedback on risks and action items to improve the reliability and stability of each service.

Four Pillars of a Best-in-Class Reliability Program

Reliability impacts every organization, whether you plan for it or not. Leading companies take matters into their own hands and get ahead of incidents by building reliability programs. But since many of these programs are still nascent, how do you know what good looks like? Of course, the right tools and technology that can enable your team to uncover reliability risks before they impact users play an important role. But improving reliability goes beyond technology.

Enhancing Your Heroku Postgres Performance: A Guide to Effective Monitoring with Hosted Graphite

When it comes to managing your database, monitoring is crucial for maintaining data integrity, optimizing performance, and ensuring efficient resource allocation. In today's fast-paced technological landscape, having real-time insights into your database's health is more important than ever. This is where Heroku Postgres and Hosted Graphite come into the picture.

Hosted Graphite and Printers: Boosting Efficiency and Performance

Printers play a crucial role in various industries, helping businesses efficiently manage their document workflows. However, ensuring optimal printer performance and minimizing downtime can be a challenge. This is where hosted graphite comes into the picture. Hosted graphite is a powerful monitoring tool that allows businesses to graph metrics and gain valuable insights into their printer systems.

Ditch Nagios Errors for A Streamlined Alternative: "Return code of x is out of bounds"

In the realm of IT infrastructure monitoring, Nagios has long been a popular choice due to its robust feature set and flexibility. However, even reliable systems can encounter issues, and one recurring problem that Nagios users might encounter is the "Return code of x is out of bounds" error. In this blog post, we'll dive into the details of this error, what causes it, and how it can impact your monitoring efforts.

Progressive delivery on Kubernetes with CircleCI and Argo Rollouts

Containers and microservices have revolutionized the way applications are deployed on the cloud. Since its launch in 2014, Kubernetes has become a de-facto standard as a container orchestration tool. With traditional approaches of deploying applications in production, developers often release updates or new features all at once, which can lead to issues if there are bugs or other issues that weren’t caught during testing.

What is a Colocation Data Center?

The colocation data center is a type of service where organizations can rent out space in a data center facility to house their IT infrastructure. These facilities provide the power, cooling, and network connectivity that companies require to operate their servers, storage devices, and applications. Colocation services allow companies to reduce costs and avoid the hassle of building and maintaining their data center facilities.

Data at the edge: Meet modern data processing demands with edge computing

We’ve all experienced latency in some form. It’s unfortunately something we’re all too familiar with. We’ve even gone so far as to accept it as a regular albeit undesirable part of the user experience. Yet despite various steps taken over the years, it still exists and is as disruptive as ever.