Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

How Often Should You Ping Your Site?

How often should you ping your site? Should you be checking every few minutes, or every hour? Surely you have other ways to detect problems, so maybe just a daily check of your API and main page would be enough, right? While there’s no single right answer for everyone, this post tries to break down how you can find the right cadence for your site checks.

5 Steps to Troubleshoot Issues in Modern Networks

Networks are becoming more elastic, flexible, and agile than ever before. Organizations can now run network functions on commodity hardware, making network design and implementation less rigid and expensive. By modernizing and virtualizing their networks, teams are able to increase capacity and improve security.

Elastic APM for iOS and Android Native apps

Elastic APM for native apps provides auto-instrumentation of outgoing HTTP requests and view-loads, captures custom events, errors, and crashes, and includes pre-built dashboards for data analysis and troubleshooting purposes Elastic® APM for iOS and Android native apps is generally available in the stack release v8.12. The Elastic iOS and Android APM agents are open-source and have been developed on-top, i.e., as a distribution of the OpenTelemetry Swift and Android SDK/API, respectively.

The Top 8 Network Monitoring Tools

Network Monitoring is a process that supplies the information and data that network administrators need to determine, in real-time, the status of their network and if it's running optimally. This enables these administrators to work proactively to highlight deficiencies, enhance efficiency, and more. By utilizing network monitoring you can attain complete visibility into their network.

Resolving a Critical Incident in Core Banking: A Deep Dive into Application Patch Malfunction

In the dynamic environment of core banking systems, maintaining seamless operations is crucial. However, unforeseen complications can arise, leading to critical incidents that demand immediate and effective resolution. A recent incident involving an application patch malfunction presents a compelling study on the intricacies of managing and resolving system anomalies in real-time.

Unlocking the Power of IIoT with Time Series Databases

This article was originally published on IIoT World and is reprinted here with permission. In the rapidly evolving world of Industrial Internet of Things (IIoT), organizations face numerous challenges when it comes to managing and analyzing the vast amounts of data generated by their industrial processes. Data generated by instrumented industrial equipment is consistent, predictable, and inherently time-stamped.

Building resilience in cloud: Strategies, advantages, and considerations

Cloud resilience When it comes to cloud computing, resilience is an infrastructure's ability to bounce back from setbacks seamlessly, ensuring uninterrupted operations in the face of outages, malfunctions, software bugs, and even natural disasters. We'll explore measures you can take to enhance resilience in your cloud, plus discuss the advantages and limitations of building a resilient cloud system.

How Better DEX Benefits People + Performance: 3 Key Use Cases

Adopting a digital employee experience (DEX) solution delivers benefits for everyone in an organization, from the C-suite on down. A DEX solution provides contextual insights and intelligent automation capabilities that allow an IT team to proactively detect and resolve security vulnerabilities and other IT issues. This improves IT operations and an organization's cybersecurity and compliance posture.

Streamlining Cloud Operations by Unifying Security & Observability

Many companies are using cloud technologies to become more agile, scalable, and cost-effective during their digital transformation. However, this change brings new challenges in maintaining the security and performance of applications and infrastructure in the cloud. Security and observability go hand-in-hand.