Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

How to calculate the difference of a value over time with InfluxDB and Grafana

Learning about the past helps us understand the present, and even predict the future. So, whether you are monitoring CPU usage or how long your IoT device was powered on and then off, at some point, you might want to know the difference of a value over time. InfluxDB is an open source database for storing and retrieving time series data. Thanks to its own query languages — flux and InfluxQL — it provides different and powerful ways to analyze data.

Cloud Observer: Subsea Cable Maintenance Impacts Cloud Connectivity

In this edition of the Cloud Observer, we dig into the impacts of recent submarine cable maintenance on intercontinental cloud connectivity and call for the greater transparency from the submarine cable industry about incidents which cause critical cables to be taken out of service.

Build Operational Resilience with Generative AI and Automation

For modern enterprises aiming to innovate faster, gain efficiency, and mitigate the risk of failure, operational resilience has become a key competitive differentiator. But growing complexity, noisy systems, and siloed infrastructure have created fragility in today’s IT operations, making the task of building resilient operations increasingly challenging.

Introducing Cortex Eng Intelligence

Engineering teams rely on certain metrics to assess their ability to deliver quality products, on time. This is a useful exercise, but execution has been lacking—with metric collation often handled via spreadsheet, or stand-alone tool. Neither approach is ideal for two reasons: 1) How—or more specifically where—metrics are collected silos them away from business context.

Ensuring Robust Security in Office 365

In an era where digital threats are evolving rapidly, securing your Office 365 environment has never been more crucial. Office 365, a suite known for its robust productivity tools, also demands a proactive approach to security. This blog post delves into essential practices and strategies to fortify your Office 365 setup against various cyber threats.

Automate insights-rich incident summaries with generative AI

Does this sound familiar? The incident has just been resolved and management is putting on a lot of pressure. They want to understand what happened and why. Now. They want to make sure customers and internal stakeholders get updated about what happened and how it was resolved. ASAP. But putting together all the needed information about the why, how, when, and who, can take weeks. Still, people are calling and writing. Nonstop.

9 Cheap Cloud Storage Solutions to Keep Your Wallet Happy

Many businesses and individuals now store most of their data on the cloud, and for good reason. Cloud data storage is accessible, convenient, and even more secure compared to local storage options – after all, many cloud data stored services leverage advanced cyber security and data monitoring teams.

What is PagerDuty - and how does it work with BigPanda?

PagerDuty is an IT operations management platform and cloud computing company launched in 2009. They provide a suite of tools designed to help IT and DevOps teams detect and respond to infrastructure problems, streamline workflows, and improve operational reliability. The PagerDuty platform bridges different systems and the teams that maintain them, centralizing the detection and reporting of incidents. It allows organizations to minimize downtime and resolve issues efficiently.

Govern your infrastructure resources with the Datadog Resource Catalog

As an administrator of an expanding, highly distributed infrastructure, you may be responsible for overseeing thousands of on-premise and cloud resources from multiple providers—governed under dozens of accounts by a complex nest of RBAC rules. To query all these resources for purposes such as compliance audits and access management, you may be required to write custom scripts and painstakingly sift through data across disparate tools.