Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Anodot Cloud Cost Update: Forecasting CostGPT and AWS Recommendations

Our Cloud Cost platform just got some practical upgrades to help you manage cloud costs better and boost your operational efficiency. Curious about the new features? Let’s jump right in! Forecast in ChatGPT Interacting with cloud cost data just got easier and smarter. Ask any cost-related question using natural language, and let CostGPT do the rest. It instantly delivers insightful visualizations and forecasts of your cloud costs.

As Technologies Continue to Evolve, How Do You Know What DX UIM Monitors?

DX UIM is designed to add monitoring for new technologies as IT operators adopt cutting-edge devices and services to improve their competitiveness, develop new services for their end users or customers, or increase cost efficiency. DX UIM currently supports monitoring and metric collection for more than 140 different technologies. Its architecture easily allows new technologies to be added to the list. How do you know if your technologies are covered? It’s Easy: In this 3-Minute Video Our DX UIM expert explains how to navigate to the Tech Docs section on our Support website.

Comparing Performance and Resource Usage: Grafana Agent vs. Prometheus Agent Mode vs. VictoriaMetrics vmagent

Monitoring and observability are critical components of modern IT infrastructures, enabling organizations to gain insights into the performance, health, and security of their systems. Agents play a crucial role in gathering and forwarding telemetry from various sources to observability platforms.

Track Errors in FastAPI for Python with AppSignal

When you first try a new library or framework, you are excited about it. However, as soon as you run something on production, things are less than ideal — an error here, an exception there - bugs everywhere! You start reading your logs, but you often lack context, like how often an error happens, in what line, etc. Fortunately, tools such as AppSignal can help. AppSignal helps you track your errors and gives you a lot of valuable insights.

Considerations for Active Monitoring from an SD-WAN Site

As companies adopt SD-WAN technologies, they increasingly rely on network services outside their control. The new reality is that network operations need end-to-end visibility on the network performance whether or not they own the infrastructure. In a 2023 EMA survey, 63% of companies report using the Internet as their primary WAN connectivity.

AI Explainer: Feature Extraction

In a previous blog post, which was a glossary of terms related to artificial intelligence, I included this brief definition of "feature extraction": Let’s go a bit deeper on that. In the ever-expanding landscape of machine learning, feature extraction stands out as a crucial technique for enhancing the performance of models and uncovering valuable insights from complex datasets.

Completing the Kubernetes Monitoring Puzzle

Kubernetes has changed the way many organizations approach the deployment of their applications. But despite its benefits, the additional layers of abstraction and reams of data can cause complexity around Kubernetes monitoring. We’ve seen so much of these challenges borne out in the results of the 2024 Observability Pulse survey. In the survey report, 36% of respondents say Kubernetes poses a challenge, and just 10% of organizations say they have full observability into their environments.

Microsoft SLA for Teams Telephony - 99.999% Uptime Guarantee

This week at Enterprise Connect, Microsoft announced many compelling new Teams features to drive productivity, collaboration and to simplify the lives of its users. One of the most noteworthy announcements is that Microsoft is now delivering a 99.999% Microsoft Teams SLA uptime guarantee for Teams telephony. This covers uptime for calls that take place over the PSTN, including Microsoft Teams Phone, Teams Calling Plans and Audio Conferencing.

Avoid flaky end-to-end tests due to poorly hydrated Frontends with Playwright's toPass()

In this video we'll dive into the world of flaky tests in Playwright and synthetic monitoring with Checkly. We examine a site with poor Frontend hydration patterns, their effect on test stability, and how to work around them. Learn how to avoid using artificial delays and implementing a retry mechanism with Playwright's 'toPass()' method to achieve stable testing instead.