Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

20 KPIs Your MSP Should be Tracking (Webinar)

As an MSP owner or manager, you want your teams to be continually improving and increasing efficiency. To make that happen, you know you need to be tracking MSP KPIs—after all, what gets measured gets improved. But which numbers do you track, how many do you track, and how do you actually use them to get better?

Ready. Set. WAIT-New Report Shows Why Your Device is Slow to Start

Every workday you open your laptop or start your desktop, and you wait. For some, that wait is a mere blip in the day, a few seconds, for others that wait can seem interminable. A few months ago, our engineering team set out with the task of exploring what variables really impact a slow device performance. During the course of their research, the team uncovered answers to very specific questions like: What is the average startup time for a work device? (Hint: it’s less than five minutes).

Announcing Early Access to Variable Retention on LogDNA

The massive proliferation of log data forces teams to manage the costs to process, route, and store it. Teams need access to this data to gain critical insights into their services, but for many organizations this presents a challenge for their budget. Logging can get expensive, fast, which often results in teams making difficult tradeoffs between aggregating enough logging information to be useful and controlling the cost of storing all those logs.

Debug iOS crashes efficiently with Datadog RUM

Unsurprisingly, application crashes due to fatal errors can be a major pain point for iOS users. Recent research shows that roughly 20 percent of mobile application uninstalls were due to crashes or other code errors. As a developer, it’s paramount to manage this potential churn by capturing comprehensive crash data in order to track, triage, and debug recurring issues in your iOS apps.

Sponsored Post

Troubleshooting Office 365 Issues Made Simple

Do you often ask yourself the question - Is there an Office 365 problem today? While you try to find the answer, your customers (end-users) complain because they can't access their business applications. Apart from all this, your boss needs an immediate status update. Trust me. It doesn't feel great to be in that situation. And we know it. Despite Microsoft claiming to provide 99.9% SLA, issues will occur with the Office 365 applications such as Teams, Outlook, OneDrive, Exchange Online, SharePoint, Yammer, etc. Often, the issues aren't even Microsoft's problems but an ISPs or internal network change. There can be lot of reasons (Network, OS, browser, personal device, upgrade errors, Internet, and much more), but which one is it?

Metrics Dashboard, Scale testing upto 500K events/sec - Signal 05

A month and thousands of code lines later, we're here with our monthly product update - Signal #05. We squashed bugs, shipped custom metric dashboard along with improvisations in our frontend. We also got featured by one of the top online analytics magazines as one of the leading Data Observability platforms. 🥳 Let's dive in to see what humans at SigNoz have been up to!

How Do You Monitor Cassandra Performance: Key Metrics to Measure

Apache Cassandra is a distributed database known for its high availability, fault tolerance, and near-linear scaling. It was initially developed by Facebook, but it is a widely used open-source system used by the largest tech companies in the world. There are numerous reasons behind its popularity, including no single point of failure, exceptional horizontal scaling with a data layout designed as a perfect fit for time-series data.

7 Best Log and Syslog Viewers

Many devices—such as switches, routers, firewalls, servers, and printers—support syslog protocol. This standard for sending log messages within a network offers critical information about your system. Consequently, monitoring your network and its syslog messages should be a top priority. Many IT professionals use log and syslog monitors or viewers to gather logs and syslog messages from across their network in a centralized location.

Incident Review For the Facebook Outage: When Social Networks Go Anti-social

The following is an analysis of the Facebook incident on 10/4/2021. Marking a highly unusual state of events, Facebook, Instagram, WhatsApp, Messenger, and Oculus VR were down simultaneously around the world for an extended period of time Monday. The social network and some of its key apps started to display error messages before 16:00 UTC. They were down until 21:05 UTC, when things began to gradually return to normality.