Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

No More False Alerts at Night

Do you know this situation? You are on-call and in the middle of the night you get a phone call. Loud enough to wake you up. Loud enough to wake your wife up, as well. You stand up and check your emails to see what the problem is. OK, you got it. Then you log on to the console of your monitoring tool and – green. Green? False alert? Why did you get the call then? After double-checking, still a bit sleepy, you recognize that the problem has been recovered automatically.

Use proxy to process complex or aggregated data

Imagine you need a monitor to react to a derivative of several performance values. For example, you could need to only trigger alert if CPU load and free memory have both crossed certain thresholds. If those monitors are related to the same host, you can always use generic monitor type, such as Script or Program, Python script etc. and do whatever math is required. What should you do if the performance values can only be taken from different hosts? There are several solutions.

Kubernetes as a New Standard for Infrastructure Management

For IT teams inside large organizations used to managing any number of operating environments, Kubernetes is a breath of fresh, standardizing air. Forget its origins, forget any excitement over containers or microservices, and forget the sprawling ecosystem of related projects. What has some folks charged with managing Kubernetes deployments really excited is the prospect of managing all application infrastructure essentially the same way.

How to ensure network compliance in this era of remote work: The ITOM Podcast [Episode 5]

The ITOM Podcast returns with yet another episode to help you eliminate all the remote work woes in your IT environment. In the last episode, we discussed in detail about the key areas enterprises need to monitor to ensure endpoint security, measures to adopt to ensure cybersecurity while rolling out BYOD policies, and the crucial factors that IT leaders need to know about in the post-pandemic era with respect to IT security.

Building and deploying a Docker image to a Kubernetes cluster

Deploying Docker images to Kubernetes is a great way to run your application in an easily scalable way. Getting started with your first Kubernetes deployment can be a little daunting if you are new to Docker and Kubernetes, but with a little bit of preparation, your application will be running in no time. In this blog post, we will cover the basic steps needed to build Docker images and deploy them to a Kubernetes cluster.

Essential Observability Techniques for Continuous Delivery

Observability is an indispensable concept in continuous delivery, but it can be a little bewildering. Luckily for us, there are a number of tools and techniques to make our job easier! One way to aid in improving observability in a continuous delivery environment is by monitoring and analyzing key metrics from builds and deploys. With tools such as Prometheus and their integrations into CI/CD pipelines, gathering and analysis of metrics is simple. Tracking these things early on is essential.

Is your ITSM platform scaling with your operations? This Case Study will help you find the answer.

The client in consideration is one of the largest engineering, procurement, and construction (EPC) companies in India. With projects covering over 14 states in the country, its portfolio of services and offerings is widely diversified. It primarily engages in three forms of businesses – civil construction, road & highway development, and manufacturing.

Full Observability with Your Node.js App

Javascript is a pretty prolific programming language, used daily by people visiting any number of websites and web applications. NodeJS, it’s server-side version, is also used all over the place. You’ll find it deployed as full application stacks to functions in things like AWS Lambda, or even as IoT processes with things like Johnny Five. So when we think about Observability in the context of a nodejs stack, how do we set it up and get the information flowing?

Complete Guide to Script Errors: What Are They and How to Fix Them

Do you use an error tracking tool for your website? Have you ever tried to debug a problem that a user reported, and when you looked at your error tracking tool the reported error was simply: “Script Error”? Such errors are impossible to further debug because there is no additional information apart from that message.