Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Synthetic Monitoring vs. APM Stack Trace Tools

The complexity of an application’s digital architecture is increasing dramatically every day. In an era of cloud infrastructure, the goal is to integrate all your web services into one place: CDN, DNS, third-party API services, QA tools, analytics tools, and any other component you can think of, are working with each other to make your services function. With so many components in place for an application to run, each one of these behaves like its own black box within your IT infrastructure.

Common Operations Problems Solved by OpsRamp Discovery and Monitoring

OpsRamp provides hundreds of out-of-the-box IT infrastructure monitoring templates that capture behavioral and performance metrics for applications, servers, networks, storage, and database instances across hybrid and multi-cloud environments. Combined with powerful AIOps capabilities, modern IT operations teams can leverage both native monitors (pre-built instrumentation for managing IT infrastructure) and custom monitors (user-defined instrumentation for specialized workloads) for proactive IT operations management as a service and responsive troubleshooting.

GrafanaCONline Day 7 recap: The past, present and future of Loki, and making dashboards that tell stories

GrafanaCONline is live! We hope you’re able to check out all of our great online sessions. If you aren’t up-to-date on the presentations, here’s what you missed on day 7 of the conference.

New in Grafana 7.0: Trace viewer and integrations with Jaeger and Zipkin

Moving to a scalable, distributed microservice architecture poses a great deal of challenges for any organization. It gets harder to understand the system and pinpoint where errors originate. Logs get much messier, and stitching together a coherent picture of a particular request can be time-consuming or downright impossible. Distributed tracing can help with all of that.

A note of appreciation from our CEO - Business Continuity Edition

Dear SCOM community, It's been 8 weeks since we launched the free Business Continuity edition (BCE) of our SCOM dashboarding product, and I’m glad to see that many community members have taken us up on the offer. The early feedback we received was encouraging and heartening, and we are happy to have helped ease the pressure on IT teams during these uncertain times, in however small a way.

How to Monitor if a Process is Running

PA Server Monitor's process monitor checks how many instances of a target specified process are running on Windows or Linux servers. It then compares that to the threshold and fires actions as needed. The process may be running locally, or remotely. PA Server Monitor can monitor remote processes on Windows servers via WMI or SNMP, as well as processes on remote Linux/Unix servers via SNMP. Process up or down data is recorded every time the monitor runs. You can define a time period, and optionally a summarization (hourly, daily, weekly, monthly) to create an uptime report for the process.

Add Event ID and Text Filter to Event Log Monitor

How to Audit Windows Logons and Logon Failures When a user logs into a Windows computer, or fails to logon, an event can be written to the Windows Event Log. This feature is built in to Windows. The Event Log monitor in PA Server Monitor can tell you when one of these events occurs, thus alerting you to a server logon, or a failed server logon. And because the Event Log monitor has a configurable monitoring cycle (the Schedule button in the lower right corner), you can find out about the logon in nearly real time.

How to Monitor Anti-Virus and Alert

Using the Inventory Alerter monitor in PA Server Monitor can help you monitor for changes in your Anti-Virus software. The Inventory Collector collects system information including Anti-Virus product information. Then when using the Inventory Alerter monitor you can alert on such items as when the running status changes, when the Pattern File Date is out of date, or when the Version changes.

How to Create a Graph Using Tags and Time Aggregation | Datadog Tips & Tricks

In this video, you’ll learn how to use tag-based grouping and time aggregation (with the rollup function) to create actionable time-series graphs. Datadog offers various ways to manipulate your metric graphs so that you can create graphs that are specific and actionable for all of your use cases. Two methods of doing this—as explored in this video—are tag-based grouping and time aggregation.

How To Monitor Containers in Real-Time with Datadog Live Containers | Datadog Tips & Tricks

In this video, you’ll learn how to utilize Datadog’s Live Container View to monitor and troubleshoot container performance underlying your applications. Datadog makes it easy to monitor ephemeral, containerized infrastructure. In this video you’ll learn how to leverage Datadog’s Live Container View to effectively dive into your container health. Using this view, you can sort and group your containers by tags or labels imported from Kubernetes, such as container name.