Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Announcing Early Access to Variable Retention on LogDNA

The massive proliferation of log data forces teams to manage the costs to process, route, and store it. Teams need access to this data to gain critical insights into their services, but for many organizations this presents a challenge for their budget. Logging can get expensive, fast, which often results in teams making difficult tradeoffs between aggregating enough logging information to be useful and controlling the cost of storing all those logs.

Debug iOS crashes efficiently with Datadog RUM

Unsurprisingly, application crashes due to fatal errors can be a major pain point for iOS users. Recent research shows that roughly 20 percent of mobile application uninstalls were due to crashes or other code errors. As a developer, it’s paramount to manage this potential churn by capturing comprehensive crash data in order to track, triage, and debug recurring issues in your iOS apps.

Sponsored Post

Troubleshooting Office 365 Issues Made Simple

Do you often ask yourself the question - Is there an Office 365 problem today? While you try to find the answer, your customers (end-users) complain because they can't access their business applications. Apart from all this, your boss needs an immediate status update. Trust me. It doesn't feel great to be in that situation. And we know it. Despite Microsoft claiming to provide 99.9% SLA, issues will occur with the Office 365 applications such as Teams, Outlook, OneDrive, Exchange Online, SharePoint, Yammer, etc. Often, the issues aren't even Microsoft's problems but an ISPs or internal network change. There can be lot of reasons (Network, OS, browser, personal device, upgrade errors, Internet, and much more), but which one is it?

Metrics Dashboard, Scale testing upto 500K events/sec - Signal 05

A month and thousands of code lines later, we're here with our monthly product update - Signal #05. We squashed bugs, shipped custom metric dashboard along with improvisations in our frontend. We also got featured by one of the top online analytics magazines as one of the leading Data Observability platforms. 🥳 Let's dive in to see what humans at SigNoz have been up to!

How Do You Monitor Cassandra Performance: Key Metrics to Measure

Apache Cassandra is a distributed database known for its high availability, fault tolerance, and near-linear scaling. It was initially developed by Facebook, but it is a widely used open-source system used by the largest tech companies in the world. There are numerous reasons behind its popularity, including no single point of failure, exceptional horizontal scaling with a data layout designed as a perfect fit for time-series data.

7 Best Log and Syslog Viewers

Many devices—such as switches, routers, firewalls, servers, and printers—support syslog protocol. This standard for sending log messages within a network offers critical information about your system. Consequently, monitoring your network and its syslog messages should be a top priority. Many IT professionals use log and syslog monitors or viewers to gather logs and syslog messages from across their network in a centralized location.

Incident Review For the Facebook Outage: When Social Networks Go Anti-social

The following is an analysis of the Facebook incident on 10/4/2021. Marking a highly unusual state of events, Facebook, Instagram, WhatsApp, Messenger, and Oculus VR were down simultaneously around the world for an extended period of time Monday. The social network and some of its key apps started to display error messages before 16:00 UTC. They were down until 21:05 UTC, when things began to gradually return to normality.