Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Dogfooding Chronicles: Tracing the path from "It's Slow" to "What's Slow"

Dogfooding is the practice of sampling your own product before the public does. For dog food executives, chowing down on their own kibble is a literal gut-check. For Sentry, we’ve been using Performance in advance of its upcoming release as a way to think through our own issues, all so we can give you better visibility into yours. Context is critical for software teams. It bridges the gap between a problem to solve and the right person to solve it.

Spot Achieves Red Hat OpenShift Operator Certification for its Serverless Container Engine Ocean

Optimization and validation bring hands-free infrastructure for Kubernetes in public and hybrid clouds to Red Hat OpenShift customers 21 May, 2020 — Spot, a leading provider of software for modern CloudOps, today announced that Ocean by Spot has achieved Red Hat Operator certification for OpenShift.

Building confidence and gaining experience with good open source projects

This year, I got a unique opportunity to call in at Mattercon 2020 and give a talk about my experience working on Mattermost and open source software (OSS) in general. I talked about how OSS helped me grow as a self-taught developer and how working on issues from Mattermost’s repos helped me gain experience and confidence in software development. In this article, I will highlight some of the things I talked about and also throw in a few pointers related to working on OSS.

CI/CD In Confidence: How Pipelines Keeps Your Secrets

A friend that can’t keep a secret isn’t one you’ll rely on. The same is true for your mission critical CI/CD tool that you have to entrust with credentials for each integrated component. Keeping your secrets safe can be a challenge for CI/CD tools, since they need to connect to such a variety of other services. Each one needs its own password or token that must be kept hidden from prying eyes.

Working From Home: The Good, the Bad, and Everything in Between

I’ve always been an advocate for working from home. While some would say working from home is rarely a good idea, decades of personal experience have proven otherwise. For one of those decades, my husband and I ran a successful marketing consulting company. Our work was done solely from our home office, all while raising five children. We started our days early before the kids woke up, and when they left for school, we worked feverishly until their return.

Real-time alerts from Zabbix and escalation with Zenduty

Recently, one of our customers, a 20-member NOC team of a large B2C company, had set up Zabbix to monitor a network of over 1000+ servers, routers, and switches. The NOC team wanted to set up alerting, on-call scheduling, and an escalation matrix whenever a critical network component encountered any downtime. The NOC team used Slack as the primary communication channel and Zoom for real-time communication. For NOC teams like these running a very large operation, setting up alerting can be very tricky.

When Incidents are not investigated, Problems await

Incident and Problem Management are two very different issues in IT service management that are unfortunately often used interchangeably. On the surface, it might just seem like a matter of terminology. But, what if you get to know that one is a small hiccup and the other could dent your entire quarterly or annual results?

Service and process monitoring: At a glance

With Site24x7 Server Monitoring, you can track the availability and system-level metrics of your servers, including CPU, memory, disk usage, and more. But did you know that you can also monitor the performance of each and every service and process running on your servers? Don't fall behind Almost all applications rely on a large number of services (for Windows) and processes (for Linux) to run smoothly and effectively.

Kafka monitoring: Metrics that matter

Kafka is a distributed streaming platform that acts as a publish-subscribe messaging queue by receiving data from various source systems and making it available to various systems and applications in real time. Key advantages for utilizing Kafka are that it provides durable storage, meaning the data stored within it cannot be easily tampered with, and it is highly scalable, so it can handle a large increase in users, workloads, and transactions when necessary.

Speed up ticket resolution in your ServiceDesk Plus help desk with automation

Helping businesses deliver a seamless customer experience and ensure zero downtime has always been a key aspect of ManageEngine ServiceDesk Plus. One of this service desk solution’s powerful integrations is with Site24x7, wherein tickets are logged for specific Site24x7 alerts like Trouble, Critical, and Down. Once the incidents are resolved in Site24x7, their associated tickets are automatically closed in ServiceDesk Plus.