Operations | Monitoring | ITSM | DevOps | Cloud

%term

How to Calculate TPS in Performance Testing: A Kubernetes Guide

Transactions-per-Second (TPS) is a valuable metric for evaluating system performance and is particularly relevant for engineers overseeing Kubernetes environments.TPS, alongside average response time, provides critical insights into system performance during load testing. This post covers two approaches to calculating TPS; a manual approach applicable in all environments, and an automatic Kubernetes-specific solution using production traffic replication.

Stop Using TCP Health Checks for Kubernetes Applications

As developers, one of the most important things we can consider when designing and building applications is the ability to know if our application is running in an ideal operating condition, or said another way: the ability to know whether or not your application is healthy. This is particularly important when deploying your application to Kubernetes. Kubernetes has the concept of container probes that, when used, can help ensure the health and availability of your application.

Cloud Control Ep #29 Open Source Revolution How CNCF is Reshaping Cloud Infrastructure

In this episode, our host, Shon Harris, introduces Chris Aniszczyk, CTO of the Cloud Native Computing Foundation (CNCF) and a seasoned professional in the cloud industry, who explores his journey and experiences. With a wealth of knowledge and expertise, Chris sheds light on various aspects of cloud computing, from the evolution of cloud technology to the latest trends and innovations. This episode offers a comprehensive overview of the industry and provides valuable insights for both aspiring and seasoned professionals in the field.

All about Explore Logs for Grafana Loki (Loki Community Call October 2024)

In this Community Call, Senior Software Engineer Trevor Whitney talks to us all about Explore Logs for Grafana Loki, an open-source app for visualizing logs from Loki in Grafana without needing to learn and write LogQL queries. He is joined by Senior Developer Advocates Nicole van der Hoeven and Jay Clifford. Community Calls are monthly meetings that are open to everyone interested in the development of Loki. They are an opportunity for software engineers working on Loki to discuss new features as well as for open-source users of Loki to ask questions.

Transforming Compliance and Operational Efficiency: A Success Story with Motadata AIOps APIs

In the fast-paced world of equity broking, compliance with regulatory requirements and operational efficiency are paramount. Broking platforms must ensure that their systems are not only continuously monitored, but their data is accurately reported to regulatory bodies. This is a story of how a leading equity broker from India leveraged our AIOps APIs to meet their regulatory compliance requirements while achieving operational excellence.

8 Most Common Latency Issues & How to Troubleshoot Them

Whether you’re a business running cloud-based applications, an educational institution facilitating virtual learning or a remote worker, latency issues can be a major roadblock. At a time when businesses and remote workers depend heavily on cloud services, real-time communication tools like Zoom, and collaboration platforms such as Microsoft Teams, even a slight delay in network performance can disrupt workflows, cause frustration, and hinder overall efficiency.

Moving SharePoint Online Documents to Azure Blob Storage

Managing large volumes of documents in SharePoint Online can become challenging as storage needs grow. With rising storage costs and limits, organizations often look for efficient ways to offload files. One highly scalable and cost-effective solution is moving documents from SharePoint Online to Azure Blob Storage.

Five Playwright CLI features you should know

Thanks to Microsoft's Playwright, running end-to-end tests with real browsers is quickly done. Initialize a new Playwright project, install all the dependencies, and off you go! Then, any new headless browser test run is only one npx playwright test away. But have you checked all the test command's CLI options? playwright test includes a few real gems to help you create better tests faster. Let me share a mixed bag of my favorite CLI tricks in this post.

Comprehensive Observability: Key Availability and Reliability Metrics to Monitor in Cloud Environments

Strong observability in cloud environments is essential for monitoring the health of interconnected systems. Unlike traditional monitoring, which is limited to specific cloud stacks or devices, observability provides comprehensive visibility across the entire hybrid IT infrastructure including applications, IT systems and services.

Webinar Recap | Next Gen Log Management: Maximize Log Value with Telemetry Pipelines

During our webinar, Next Gen Log Management: Maximize Log Value with Telemetry Pipelines, we discussed how you can take your log management strategy to the next level with telemetry pipelines and unlock the full potential of your data. Bill explained that the rapid growth of log data is driving up storage and management costs. He emphasized the need for an intelligent, adaptable log management system to efficiently handle this situation.