Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Network Tips to Ensure a Successful AWS Migration

Join Kentik Cloud Solutions Architect, Ted Turner, and Kentik Solutions Architect, Jim Muggli, for this on-demand webinar discussion about the network solutions needed to ensure a successful AWS migration. Watch learn how to ensure your infrastructure team can overcome the top three AWS network challenges.

Network AF, Episode 6: Cat Gurinski on mentorship and the shared languages of network engineering

In the latest episode of the Network AF podcast, your host Avi Freedman welcomes his friend and networking pro Cat Gurinski to the show. As a senior network engineer with loads of experience, Cat is most passionate about automation and troubleshooting, and especially loves to use Python and Arista’s pyeapi frameworks in her pursuits. She’s also the current chair of the NANOG Program Committee, and previously worked for companies like Best Buy, Switch and Data, and Equinix.

Announcing the GA of Splunk APM's AlwaysOn Profiling

As an update to.conf’s announcement of our continuous code profiling preview, we’re excited to share that today Splunk APM’s AlwaysOn Profiling is generally available for Java applications, included in APM with no additional cost. Here’s a quick walkthrough of the feature, and how you can get started now.

How to Deploy the Splunk OpenTelemetry Collector to Gather Kubernetes Metrics

With Kubernetes emerging as a strong choice for container orchestration for many organizations, monitoring in Kubernetes environments is essential to application performance. Kubernetes allows developers to develop applications using distributed microservices introducing new challenges not present with traditional monolithic environments. Understanding your microservices environment requires understanding how requests traverse between different layers of the stack and across multiple services.

Plugin Spotlight: Exec & Execd

Telegraf comes included with over 200+ input plugins that collect metrics and events from a comprehensive list of sources. While these plugins cover a large number of use cases, Telegraf provides another mechanism to give users the power to meet nearly any use case: the Exec and Execd input plugins. These plugins allow users to collect metrics and events from custom commands and sources determined by the user.

TL;DR InfluxDB Tech Tips - Visualizing Uptime with Flux deadman() Function in InfluxDB Dashboards

A common DevOps use case involves alerting when hosts stop reporting metrics, aka a deadman alert. This can be done using the monitor.deadman() Flux function. One can easily create a deadman (or threshold) check in the InfluxDB UI Alerts section or craft a custom task to alert as well. Check out InfluxDB’s Checks and Notifications system post for more details. It’s also possible to use the monitor.deadman() function directly in a dashboard cell.

Superfast Troubleshooting of Network User Performance Issues

In our first edition of our Work From Anywhere series, we look at the value of troubleshooting end-user hardware and application issues. Exploring the granular detail that the solution provides, we look at how understanding information around the end-users hardware can help reduce mean-time-to-resolution and increasing productivity of service/support desk teams.