Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Improving your on-call schedule with runbooks

Incidents are a stressful time for your team: your service isn't working the way you expect and your customers/stakeholders want to know what's going on. The last thing you want to do is let your team improvise everything when it comes to responding to incidents. Google's own SRE book has great overall tips for incident management, part of which involves "develop(ing) and document(ing) your incident management procedures in advance", which this article dives into.

Evolving Corporate Sustainability Solutions: An Interview with Sebastien Duprez & Nina Zellweger

As the world continues to feel the pressure of climate change, more and more actors in the private sector are implementing solutions to reduce their carbon emissions and slow down global warming. For many organizations, technology is a major focus in their carbon reduction strategy. And most emissions are linked to digital workplace equipment. In fact, the workplace represents 70% of overall IT-related emissions.

Secure External Document Sharing in SharePoint

SharePoint, a product of Microsoft’s suite of office tools, has revolutionized the way organizations collaborate and manage documents. At its core, SharePoint is designed to facilitate the seamless sharing of information, both within an organization and with external partners. The ability to share documents externally is particularly valuable in today’s global business environment, where collaboration with vendors, clients, and contractors across geographical boundaries is commonplace.

Schedule Cron Jobs in Node.js with Node-Cron

Cron jobs are tasks set to run by themselves at certain times or intervals. They help with doing repetitive tasks automatically, like backing up data, sending emails, and updating systems. In Node.js, cron jobs can make tasks in applications run by themselves, making things more efficient and reliable. Node.js gives a good way to set these tasks through different libraries and tools.

Release Roundup March 2024: More ways to discover and test your services

2024 is off to a fast start here at Gremlin. Since our last release roundup, we’ve released new experiment types, new features to improve integration with cloud platforms, and improvements to our auto-detection processes. Now you can push processes to their limits, find dependencies even easier, limit when tests can be run, and much more. We also introduced a slew of platform improvements to improve efficiency, performance, and user experience in the Gremlin web application.

OpenTelemetry and Elastic: Working together to establish continuous profiling for the community

Profiling is emerging as a core pillar of observability, aptly dubbed the fourth pillar, with the OpenTelemetry (OTel) project leading this essential development. This blog post dives into the recent advancements in profiling within OTel and how Elastic® is actively contributing toward it. At Elastic, we’re big believers in and contributors to the OpenTelemetry project.

Instrumenting Lumigo for Python using OpenTelemetry

Standardized frameworks play a fundamental role in leveling the playing field and setting the standard within the tech industry, ensuring that everyone has access to the same tools and practices. These frameworks promote best practices and foster innovation and collaboration across different sectors. One example of such a framework is OpenTelemetry, a project that has rapidly gained traction and continued to flourish as an open-source initiative under the Cloud Native Computing Foundation (CNCF).

Part 3: Infrastructure Monitoring Tools

From networking and servers to databases and applications, the infrastructure is the backbone of an organization's operations. With the rise of digitalization, the need for reliable and efficient infrastructure has become more important than ever. Whether it be transportation systems, communication networks, or energy grids, infrastructure plays a vital role in keeping our society functioning smoothly.

APM Metrics: The Ultimate Guide

How your software applications perform is an extremely important factor in determining end-user satisfaction. APM metrics are the key indicators that help business-critical applications achieve peak performance. This article explains APM metrics, their importance, and the core APM metrics used by modern software systems to measure and optimize the performance of their applications.