Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

5 Things You Need in a Digital Operations Management Platform

It’s pretty well known that we live in a connected, always-on world where seconds matter when it comes to customer happiness. There are smaller incident management solution providers that offer what looks to be competitive pricing—but it’s important to consider the bigger picture outside basic alerting and incident response.

How to connect Stackdriver to external monitoring

Google Stackdriver lets you track your cloud-powered applications with monitoring, logging and diagnostics. Using Stackdriver to monitor Google Cloud Platform (GCP) or Amazon Web Services (AWS) projects has many advantages—you get detailed performance data and can set up tailored alerts. However, we know from our customers that many businesses are bridging cloud and on-premises environments.

Four nines and beyond: A guide to high availability infrastructure

We’ve talked about the increasingly-interconnected nature of cloud tools and the domino-goes-crashing-down effect thatcan happen when just one critical service has downtime. Web uptime is more important than ever, and it’s critical that these services we all rely on are up and running as often as possible.

Sysdig Monitor summer 2018 release.

It’s the first day of summer and the perfect opportunity for our summer Sysdig Monitor release round up. For those of you following our progress, we use these blogs to showcase the work we’ve done to add increased functionality, scale, and usability with Sysdig Monitor. What follows are quick descriptions of all the good stuff we’ve made available over the past few months.

How to Get Java Code-Level Visibility

The toughest IT performance problems to solve today are the ones where a user complains that their application access is slow. An IT administrator must then figure out the cause of the problem: is it the browser, the network, the server, the storage, the cloud infrastructure on which the application is hosted, or the application code?

Use New Range Markers to Show The Duration of a State Change

In our world of distributed systems, state changes to your infrastructure often take some time to propagate. With a few exceptions (for example, feature flags), single point in time changes are rare. Deploys, outages, database migrations, failovers, stress tests; none of these things are instantaneous – all have some duration during which the system is changing.