Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Five worthy reads: Agile, the perfect ingredient for your organization's operations management

Five worthy reads is a regular column on five noteworthy items we’ve discovered while researching trending and timeless topics. This week, we explore how the agile philosophy can help organizations manage their operations.

Container Orchestration in 2019

How are you deploying your applications in 2019? Are you using containers yet? According to recent research over 80% of you are. If you are within this group, were you initially sold on the idea of containers but found that in reality, the complexity involved with this approach makes it a difficult trade-off to justify? The community is aware of this and has come up with a remedy to ease the pain, and it’s called container orchestration.

Monitor CoreDNS with Datadog

CoreDNS is a DNS server that can also provide service discovery for microservice-based applications. It’s the default DNS server in Kubernetes, providing name resolution and service discovery for the services operating in the cluster. CoreDNS is easily customizable, so you can define how it should act on each request beyond simply executing a DNS lookup.

Grafana Labs at KubeCon: Awesome Query Performance with Cortex

At KubeCon + CloudNativeCon in Barcelona last week, Weaveworks’ Bryan Boreham and I did a deep-dive session on Cortex, an OSS Apache-licensed CNCF Sandbox project. A horizontally scalable, highly available, long term storage for Prometheus, Cortex powers Grafana Cloud’s hosted Prometheus. During our talk, we focused on the steps that we’ve taken to make Cortex’s query performance awesome.

OpsQ Observed Mode: Building a Culture of Trust for Modern Operational Intelligence

By 2021, IDC’s Worldwide CIO Agenda 2019 Predictions expects that 70% of CIOs will invest in machine learning and data science techniques for greater agility and innovation in IT operations management. In an AI-enabled future, enterprises will increasingly rely on advanced analytics to address a variety of IT operations use cases, including problem recognition, impact analysis, anomaly detection, root cause analysis, and incident resolution.

Grafana Labs at KubeCon: Foolproof Kubernetes Dashboards for Sleep-Deprived On Calls

We’ve all been in the situation where suddenly you are the lone developer on call while everyone is out of pocket. Or in the case of Grafana Labs Director of UX David Kaltschmidt, his then business partner, Grafana Labs VP of Product Tom Wilkie, was checking out for a weekend music fest. “Tom and I founded a company a couple of years ago, and I’m more of a frontend person. Tom did all the backend and devops stuff,” explained Kaltschimdt.

How I decimated Postgres response times for my SaaS

Last week I rolled out a simple patch that decimated the response time of a Postgres query crucial to Checkly. It quite literally went from an average of ~100ms with peaks to 1 second to a steady 1ms to 10ms. However, that patch was just the last step of a longer journey. This post details those steps and all the stuff I learned along the way. We'll look at how I analyzed performance issues, tested fixes and how simple Postgres optimizations can have spectacular results.