Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Cloudways - A Managed Cloud Hosting Platform that Facilitates Choice, Simplicity, and Performance

A reliable web host is unlike any other friend when you’re super monitoring your website. You should be able to spread your wings and expand those horizons without all the fuss. In our search of many web-hosting providers, we found one name that is powerful enough to scale your website effectively – Cloudways.

Change is in the air. BigPanda can help you embrace it.

It’s time for change – my first insight from GartnerIO, 2018. During my flight to Vegas I remembered that this was probably my 15th visit to a conference in Vegas (but who can really count…after a few, it all starts to blur together). What I do remember though is that, in each of these conferences, there was one very shiny buzzword. The reason I remember this so well is because, while everyone agreed on what the buzzword was, there was no consensus on what that word meant!

The Tool Sprawl Problem in Monitoring

One of the biggest KPIs in the DevOps space is monitoring. There are so many tools to help any organization to complete their monitoring picture, but no tool does everything and most organizations use many tools to help complete their monitoring solution. Mashing tools together often creates a problem of its own — the tool sprawl problem.

How to Monitor Kubernetes Without an Agent on Every Node

LogicMonitor is an agentless monitoring solution. What we really mean by “agentless” is that we don’t require an agent on every monitored server (physical or virtual). One LogicMonitor Collector - a lightweight application that takes just seconds to install - can monitor hundreds or even thousands of devices, including servers, virtual machines, network switches, storage systems, cloud resources, containers, and more.

Site Reliability Engineering Meets Traditional Operations

Google has effectively made the discipline of site reliability engineering (SRE) a DevOps best practice by publishing two decades’ worth of lessons in keeping alive the most scalable apps on the planet. As more organizations make the shift (or “transformation,” as it were) to becoming IT organizations, the demand for reliability increases substantially for customer-facing services.

Virtual Offsites: A Collaboration Approach For Distributed Teams

Once a year, PagerDuty’s SREs get together for a three-day, in-person offsite. With the team spread across three time zones in the U.S. and Canada, encompassing two offices and three remote members, face time is rare and valuable. We use our offsites for thoughtful discussions on team health, long-term project roadmap planning, refining and updating our team’s mission, and to simply spend time together as a team.