Operations | Monitoring | ITSM | DevOps | Cloud

%term

How to convince your boss you need a status page

Every company depends on tools that help them do their jobs. HR tools, CRMs, chat & collaboration tools, business intelligence tools, marketing automation tools… the list goes on. Bringing another tool into the mix involves approval processes, buy-in from execs, and the occasional boss-nudging for her credit card.

The Fastest Path to Modernizing Incident Management

Long gone are the days of manually monitoring an inbox and deciphering which alerts require attention or action. However, when adopting or migrating to a new tool, it can seem like a daunting process to set up all of your teams, integrations, and notification settings. OpsGenie is here to help. We offer dedicated Pre-Sale Engineers and Customer Success Engineers who will help you identify your bottlenecks and precise needs within OpsGenie.

EventSentry v3.5 Released: Windows Process Monitoring to the Max, Registry Tracking, Tags & More

EventSentry v3.5 continues to increase visibility into networks with additional vantage points, making it easier for EventSentry users to reduce their attack surface as well as discover anomalies.

The Monitor - Andy Tuba, Senior Software Developer at Reddit

For the sixth edition of The Monitor we spoke to Andy Tuba, a Senior Software Engineer at Reddit. Reddit is a site that needs no introduction, but we’re gonna write one anyway because otherwise this section would just be blank. They bill themselves as the front page of the internet, and considering they’re the 8th most popular website in the world, that isn’t just marketing pablum.

Monitoring Django apps on Heroku

I don't know of an easier way to deploy a Django app than letting Heroku do the work. That said, how do you stay on top of your app's performance, errors, and stability post-launch? Running an app on Heroku is a blissful experience, but it presents some monitoring challenges that aren't present when you control the hardware. In this post, I'll walk through a free-to-start, low-effort approach that gives you great visibility of the health of your Django app on Heroku.

Accelerating Incident Response With Real-Time Business Data at Wayfair

Like any good e-commerce company, Wayfair collects a significant amount of data to use for business intelligence. Until recently, the majority of this data was crunched off-hours in preparation for business use the next day. We also create a great deal of data about our applications and infrastructure in real time.

Volunteers, Not Conscripts: Fixing Out-Of-Hours On-Call at Intercom

Uptime matters. At Intercom, we believe that keeping our product online and working well at all times is critical to the success of our business. Out-of-hours on-call is inherently disruptive to your life as an engineer. You need to be ready to respond quickly and competently to an alert about something being broken.

The top 5 VM metrics that every Azure admin should monitor

The age-old approach of storing data on self-hosted data centers is rapidly becoming obsolete, with most organizations shifting towards cloud solutions like Microsoft Azure. Azure delivers more than 600 services like on-demand computing, storage, data management, and networking, and virtual machines (VMs) are an important component of the Azure cloud platform.