Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Getting Started with Sending StatsD Metrics to Telegraf & InfluxDB

This tutorial will walk you through sending StatsD metrics to Telegraf. StatsD is a simple protocol for sending application metrics via UDP. These metrics can be sent to a Telegraf instance, where they are aggregated and periodically flushed to InfluxDB or other output sinks that you have configured. At the time of writing, we have 37 different output plugins supported.

Splunk Recognized as Value Leader in EMA Radar Report: AIOps, A Guide for Investing in Innovation

Splunk has been recognized as a Value Leader in EMA’s latest research, Radar Report: AIOps, A Guide for Investing in Innovation. Vendors were evaluated across three categories, with Splunk listed as a Value Leader in both ‘Incident, Performance, and Availability Management’ and ‘Business Impact and IT-To-Business Alignment’, as well as Strong Value in ‘Change Impact and Capacity Optimization’.

NiCE MongoDB Management Pack for Microsoft SCOM

Cloud application requirements have pushed beyond the limitations of relational database management systems. This rising demand for managing unstructured data, along with the growing need for data analytics, has caused significant market growth in the NoSQL database sector. MongoDB, as one of the classical NoSQL databases, is a powerful tool helping companies align with new cloud-based business strategies. With great power comes great responsibility.

Update Statuspage using the new Uptrends integration

Atlassian, the maker of Opsgenie (We have an integration for Opsgenie too!), also offers Statuspage. Statuspage is a communication tool that allows users to view the current and recent past status of a website, API, or web application. In addition to a hosted status notification page, Statuspage also sends out messages to users to keep them informed about your site’s availability. Now Uptrends can automatically send updates to Statuspage using the new integration.

How to maximize the value of SCOM - Monitoring, Alerts, Incidents & Visualization

In SCOM you can see the monitoring that generates your alerts (the contents of Health Explorer). While SCOM doesn’t always make it easy to get at the valuable context that this monitoring data provides, it is there and can help answer the "why" questions that often come up when looking at an alert in isolation.

Node.js Resiliency Concepts: Recovery and Self-Healing

In an ideal world where we reached 100% test coverage, our error handling was flawless, and all our failures were handled gracefully — in a world where all our systems reached perfection, we wouldn’t be having this discussion. Yet, here we are. Earth, 2020. By the time you read this sentence, somebody’s server failed in production. A moment of silence for the processes we lost.

Java Logging: Best Practices for Success with your Java Application

Java is used by at least 7.6 Million developers worldwide. Java logging has been a staple of the Java platform since day one, boasting extensive, resourceful documentation and rich API’s. The cornerstone of monitoring your application is efficient and widespread logging. At Coralogix, we know that logs have become one of the most important components of a modern monitoring function.

All the non-technical advantages of Loki: reduce costs, streamline operations, build better teams

Hi, I’m Owen, one of the Loki maintainers, and I’m putting proverbial pen to paper to convince you why Loki is important. And this isn’t because it scales (it does) or because I work at Grafana Labs (I do). It’s because of the oft-overlooked and underrepresented organizational benefits. Organizational benefits?! What is this, some sort of cult? Why are you avoiding the technicals? Whoa, whoa, whoa. Now, hold on. The technicals are still valid.