Operations | Monitoring | ITSM | DevOps | Cloud

Latest Blogs

Announcing variable substitution in Stackdriver alerting notifications

When an outage occurs in your cloud application, having fast insight into what’s going on is crucial to resolving the issue quickly. If you use Google Stackdriver, you probably rely on alerting policies to detect these issues and notify you with relevant information. To improve the organization and readability of the information contained in these alerts, we’ve added some new features to make our alerting notifications more descriptive, useful and actionable.

Introducing Dashboard Widgets

This morning we launched a new update for the RapidSpike App. This update includes a totally new Dashboard experience for our users. Our old dashboard – or “Home” screen – featured a number of fairly static graphs and charts showing your account usage. We felt that this screen was badly in need of an update to show off the cool and exciting facts and figures RapidSpike can generate.

7 Common Web Application Performance Problems (and How to Solve Them)

One of the cornerstones of a successful business in today’s digital environment is ensuring that web application performance is user-friendly and runs smoothly. A well-oiled website and its applications represent the face of a company, and in an ideal scenario, they serve as a mark of reliability, innovation, and progress.

Retrace Log Management: Logs, Errors and Code Level Performance

Log management is traditionally described as a way to collect all of your log data in one place so you can use it for a wide variety of uses. Retrace APM with log management aims to create the perfect product and user experience for developers with specific needs for managing logs..

How Much Downtime is Acceptable?

Downtime occurs. It's an unfortunate fact of online life. No website is able to provide 100% uptime - even tech giants like Google suffer downtime, albeit very occasionally. So, some amount of downtime is inevitable, but how much is acceptable? This question is obviously subjective - downtime that's acceptable for one person may be intolerable for another. Therefore, we undertook a little research...

Skylight Agent 2.0 Released

Today, we released version 2.0 of the Skylight Agent. 2.0 doesn't introduce any new APIs, but we did rewrite the SQL Lexer to support more varieties of queries. We also spent a lot of time on internal refactoring and improved our error logging. Since we follow semantic versioning, we also took the opportunity to drop support for some older dependencies and environments. Read on for more information about upgrading as well as some technical details on our internal changes.