The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.
In the latest episode of the Network AF podcast, your host Avi Freedman welcomes his friend and networking pro Cat Gurinski to the show. As a senior network engineer with loads of experience, Cat is most passionate about automation and troubleshooting, and especially loves to use Python and Arista’s pyeapi frameworks in her pursuits. She’s also the current chair of the NANOG Program Committee, and previously worked for companies like Best Buy, Switch and Data, and Equinix.
I have a good sense of how to use traces to understand my system’s behavior within request/response cycles. What about multi-request processes? What about async tasks spawned within a request? Is there a higher-level or more holistic approach?
As an update to.conf’s announcement of our continuous code profiling preview, we’re excited to share that today Splunk APM’s AlwaysOn Profiling is generally available for Java applications, included in APM with no additional cost. Here’s a quick walkthrough of the feature, and how you can get started now.
With Kubernetes emerging as a strong choice for container orchestration for many organizations, monitoring in Kubernetes environments is essential to application performance. Kubernetes allows developers to develop applications using distributed microservices introducing new challenges not present with traditional monolithic environments. Understanding your microservices environment requires understanding how requests traverse between different layers of the stack and across multiple services.
If you are trying to compare all of the best solutions for application performance monitoring and management you may have found that it can be highly complicated to compare all of the available observability tools whilst also trying to keep within a reasonable budget.
Telegraf comes included with over 200+ input plugins that collect metrics and events from a comprehensive list of sources. While these plugins cover a large number of use cases, Telegraf provides another mechanism to give users the power to meet nearly any use case: the Exec and Execd input plugins. These plugins allow users to collect metrics and events from custom commands and sources determined by the user.
A common DevOps use case involves alerting when hosts stop reporting metrics, aka a deadman alert. This can be done using the monitor.deadman() Flux function. One can easily create a deadman (or threshold) check in the InfluxDB UI Alerts section or craft a custom task to alert as well. Check out InfluxDB’s Checks and Notifications system post for more details. It’s also possible to use the monitor.deadman() function directly in a dashboard cell.
I can’t remember the last time I drove down highway 101 between San Francisco and the South Bay and didn’t see a billboard claiming to be the single tool to solve all of my data problems.