Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

The essentials of monitoring AWS Elastic Load Balancing

AWS Elastic Load Balancing (ELB) dynamically distributes incoming application traffic across multiple EC2 instances and scales resources to meet traffic requirements. Elastic Load Balancing helps optimize the performance of various web and mobile applications by identifying failing EC2 instances before they affect the end-user experience.

ISO/IEC 20000 certification: What it is, why your organization needs it, and how to get it

One of the most important things that customers consider while purchasing a product or service is its credibility. A label that states the product has been tested, analyzed, and certified by an international regulatory body reassures a customer’s purchase decision. This is why organizations today strive to get themselves bench marked, differentiated, and validated. For this, they seek out regulatory bodies that develop and publish international standards.

GrafanaCONline Day 9 recap: Prometheus rate queries explained, and inside one company's adoption of a central telemetry platform

We’re into the third and last week of GrafanaCONline! We hope you’re able to check out all of our great online sessions. If you didn’t get a chance to watch yesterday’s sessions (or want to see them again), here’s a recap of day 9 of the conference.

The UX changes we made for Grafana 7.0 -- and what you can learn from them

Behind every part of Grafana, there are the ideas, creativity and commitment of the people who made it. While that includes code, it is not limited to it. Since August 2019, Grafana Labs has had a dedicated UX team, and we have been involved in countless recent features and improvements. We want to show you how we do our work, why you users are at the heart of everything we do – and most importantly, how design changes can make software better.

Kubernetes disaster prevention and recovery

Yeah, Kubernetes is great at making sure your workloads run as needed. But another of its amazing benefits is its ability to recover from failure all by itself. On an everyday basis, Kubernetes takes care of the complicated task of container orchestration. However, as with any complicated system, there is always the chance that you’ll experience failures and downtime.

Detect reverse shell with Falco and Sysdig Secure

Reverse shell is a way that attackers gain access to a victim’s system. In this article, you’ll learn how this attack works and how you can detect it using Falco, a CNCF project, as well as Sysdig Secure. Sometimes, an application vulnerability can be exploited in a way that allows an attacker to establish a reverse shell connection, which grants them interactive access to the system.

Azure dashboards: Azure Portal vs. SquaredUp (Part 1 - VMInsights)

Be it be on-premise or in Azure, monitoring your resources is extremely important. While the popular monitoring tool to use on premise remains to be SCOM, many organizations with a presence in the cloud have started to explore Azure Monitor to fulfil their monitoring needs in Azure. Recently, we discussed Azure Monitor and its capabilities extensively in our Azure Monitor Learning Path blog series.

Top Industry Performers in Unplanned Server Downtime | Q1 The Uptime Report

Can you be incompetent and still stay in business? Not as far as your web infrastructure is concerned. All the studies show that when a website is unavailable, or even just slow to load, customers go elsewhere—and often they don’t come back. After all, if you can’t keep a website up and running, why should people trust you to deliver any other product or service? So it’s worth asking: how reliable is your website relative to the top brands in your industry?

Tools for debugging apps on Google Kubernetes Engine

Editor’s note: This is a follow up to a recent post on how to use Cloud Logging with containerized applications running in Google Kubernetes Engine. In this post, we’ll focus on how DevOps teams can use Cloud Monitoring and Logging to find issues quickly. Running containerized apps on Google Kubernetes Engine (GKE) is a way for a DevOps team to focus on developing apps, rather than on the operational tasks required to run a secure, scalable and highly available Kubernetes cluster.

Understanding and building Kubernetes Custom Resource Definitions (CRDs)

So, let's say you had a service or application that was built on an orchestration platform such as Kubernetes. In doing so, you must also address an overflowing array of architectural issues, including security, multi-tenancy, API gateways, CLI, configuration management, and logging. Wouldn't you like to save some manpower and development time and focus on creating something unique to your problem? Well, it just so happens that your solution lies in what's called a Custom Resource Definition, or CRD.