Operations | Monitoring | ITSM | DevOps | Cloud

Key Kubernetes Metrics and Resources to Monitor for Peak Cluster Performance

Monitoring is not easy. Period. In our guide to Kubernetes monitoring we explained how you need a different approach to monitoring Kubernetes than with traditional VMs. In this blog post, we’ll go into more detail about the key Kubernetes metrics you have access to and how to make sense of them. Kubernetes is the most popular container orchestrator currently available. It’s available as a service across all major cloud providers. Kubernetes is now a household name.

A Guide to AWS Certifications

If you’re interested in cloud computing, AWS certifications are one of the most rewarding paths to a dynamic career. As a worldwide leader in cloud infrastructure service, Amazon prepares certified experts who are highly sought after by IT organizations around the world. Did you know that 94% of organizations use a cloud service and 30% of their IT budgets are allocated to cloud computing?

How to monitor Microsoft SQL Server with Prometheus

In this article, you will learn how to monitor SQL Server with Prometheus. SQL Server is a popular database, which is very straightforward to monitor with a simple Prometheus exporter. Like all databases, SQL Server has many points of failure, such as delays in transactions or too many connections in the database. We are basing this guide on Golden Signals, a reduced set of metrics that offer a wide view of a service from a user or consumer perspective.

7 Ways SRE Is Changing IT Ops And How To Prepare For Those Changes

SRE best practices are disrupting and catalyzing change in the ways organizations approach IT Operations. In this blog we look at 7 ways SRE is bringing this transition. ‍Site Reliability Engineering is a new practice that has been growing in popularity among many businesses. Also known as SRE, the new activity puts a premium on monitoring, tracking bugs, and creating systems and automations that solve the problem in the long term.

What's new in Grafana Enterprise Metrics 1.3, our scalable, self-hosted Prometheus service

We built Grafana Enterprise Metrics (GEM) to empower centralized observability teams to provide a multi-tenanted, horizontally scalable Prometheus-as-a-Service experience for their end users. The GEM plugin for Grafana is a key piece of realizing this vision. It provides a point-and-click way for teams operating GEM to understand the state of their cluster and manage settings for each of the tenants within it.

Dashboard Server: Working with the SQL tile

In my previous blogs in the Dashboard Server Learning Path, we looked at working with the Web API tile and the PowerShell tile. In this instalment, let’s try the SQL tile. This tile will let you connect to any SQL database and run a SQL query straight from SquaredUp. This tile is also available in both the SquaredUp for SCOM and Azure products, so I have some familiarity with it already.

Logz.io and the AWS Distro for OpenTelemetry

Amazon Web Services has announced enhanced support for the open-source distribution of the OpenTelemetry project for its users. AWS Distro for OpenTelemetry (ADOT) now includes support for AWS Lambda layers for the most popular languages and additional partners integrated into the ADOT collector. And one of those partners is Logz.io! Logz.io is happy to announce that our exporter is now included in the AWS Distro for OpenTelemetry.

Extinguishing our performance fires and rebuilding for the future

I stepped into the role of Head of Engineering for Bitbucket Cloud in late 2020, having served as one of the team's senior engineering managers for several years. It is an honor and a privilege to lead this team, and I couldn't be prouder of the hard work we've done and continue to do each day to make Bitbucket a world-class product empowering teams to build, test, and deploy software to millions of people around the world. It has been an eventful journey, and the past few weeks are no exception.

Search for files and document contents in Mattermost

Finding the right information in Mattermost is critical to work smarter and be more productive. Searching in Mattermost now finds both relevant messages and files in your team’s conversation history. Search will return results for attachments that match the file name or contain matching text content within supported document types. File search is available today in Mattermost Cloud and in Mattermost Self-Managed v5.35 (available May 16), with mobile support coming soon.