The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.
Every workday you open your laptop or start your desktop, and you wait. For some, that wait is a mere blip in the day, a few seconds, for others that wait can seem interminable. A few months ago, our engineering team set out with the task of exploring what variables really impact a slow device performance. During the course of their research, the team uncovered answers to very specific questions like: What is the average startup time for a work device? (Hint: it’s less than five minutes).
The massive proliferation of log data forces teams to manage the costs to process, route, and store it. Teams need access to this data to gain critical insights into their services, but for many organizations this presents a challenge for their budget. Logging can get expensive, fast, which often results in teams making difficult tradeoffs between aggregating enough logging information to be useful and controlling the cost of storing all those logs.
Unsurprisingly, application crashes due to fatal errors can be a major pain point for iOS users. Recent research shows that roughly 20 percent of mobile application uninstalls were due to crashes or other code errors. As a developer, it’s paramount to manage this potential churn by capturing comprehensive crash data in order to track, triage, and debug recurring issues in your iOS apps.
Do you often ask yourself the question - Is there an Office 365 problem today? While you try to find the answer, your customers (end-users) complain because they can't access their business applications. Apart from all this, your boss needs an immediate status update. Trust me. It doesn't feel great to be in that situation. And we know it. Despite Microsoft claiming to provide 99.9% SLA, issues will occur with the Office 365 applications such as Teams, Outlook, OneDrive, Exchange Online, SharePoint, Yammer, etc. Often, the issues aren't even Microsoft's problems but an ISPs or internal network change. There can be lot of reasons (Network, OS, browser, personal device, upgrade errors, Internet, and much more), but which one is it?
Apache Cassandra is a distributed database known for its high availability, fault tolerance, and near-linear scaling. It was initially developed by Facebook, but it is a widely used open-source system used by the largest tech companies in the world. There are numerous reasons behind its popularity, including no single point of failure, exceptional horizontal scaling with a data layout designed as a perfect fit for time-series data.
The following is an analysis of the Facebook incident on 10/4/2021. Marking a highly unusual state of events, Facebook, Instagram, WhatsApp, Messenger, and Oculus VR were down simultaneously around the world for an extended period of time Monday. The social network and some of its key apps started to display error messages before 16:00 UTC. They were down until 21:05 UTC, when things began to gradually return to normality.