Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Why the CEO Cares About Splunk

Evolving business themes come at us in waves. So far this millennium we have had The New Economy, The Cloud, and now Digital Transformation. Underpinning these themes are real, economically significant dynamics that drown out the bleating voices of pundits and cynics alike. The Internet is not a bubble, the cloud is not a fad, and digital transformation is, well, transformative. Let’s take a look at what that last one means for Financial Services.

Global Restart: CIOs Need to Simplify in the Face of Complexity

We have to get everyone back to work. The global restart of economies derailed by the coronavirus pandemic is challenging organizations across the board. And from one industry to the next, IT must be a central player in establishing a new normal. Organizations that had to entirely shut down facilities — retail stores, manufacturing plants, restaurants, theme parks — may particularly struggle to reestablish operations with new approaches that protect worker and customer health.

How to categorize logs for more effective monitoring

Logs provide a wealth of information that is invaluable for use cases like root cause analysis and audits. However, you typically don’t need to view the granular details of every log, particularly in dynamic environments that generate large volumes of them. Instead, it’s generally more useful to perform analytics on your logs in aggregate.

Monitoring AWS Fargate with Prometheus and Sysdig

In this article, we will show how it’s easily possible to monitor AWS Fargate with Sysdig Monitor. By leveraging existing Prometheus ingestion in Sysdig, you will be able to monitor serverless services with a single-pane-of-glass approach, giving you confidence in running these services in production.

Identify, Reconcile & Rock the Multi-Source CMDB: From Nathan Foreman, Solutions Architect - Cookdown

Nathan Foreman, our solutions superstar here at Cookdown, shares his knowledge on how to successfully blend data sources to create a beautifully populated CMDB, as well as tips on how to customise and maintain data quality. Nathan talked us through the process of using ServiceNow’s out of the box Identification and Reconciliation Engine (IRE) to ensure a smooth integration of a multi-source CMDB.

Learn Grafana: How to automatically repeat rows and panels in dynamic dashboards

Running your software on dynamic infrastructure means that your monitoring platform needs to change dynamically. Variables let you reuse a single dashboard for all your services. Select the service you want to inspect from a drop-down menu, and watch panels update to only show you metrics from that service. Grafana lets you create dynamic dashboards using template variables. Any variables in your queries interpolate the current value of the variable before the query is sent to the database.

Purposeful Power Monitoring for IT

Have you ever lost power to a server? Did it ever reboot on its own? Wouldn’t it be nice to prevent power outage to IT devices? If this is something you’ve experienced in the past, there are ways to simplify power monitoring and avoid some of the outages that can be caused by power issues. This article will focus on using power consumption data from a rack power distribution unit (rPDU) and how to simplify the process.

A Journey Through Blameless from Incident to Success

Here at Blameless, every aspect of our product has SLOs (Service Level Objects) and error budgets in order to help us understand and improve customer experience. Sometimes, these error budgets are at risk, triggering an incident. While incidents are often painful, we treat them as unplanned investments, striving to learn as much as we can from them. We empower all of our engineers to handle an on-call rotation, no matter how difficult the issue.