Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

With AppNeta, ResultsCX Decreases Network Performance Triage Time by 90%

In order to deliver its differentiated, boutique level of customer care services, the team at ResultsCX has had to navigate some challenges in recent years that teams in many organizations can relate to. The organization relies extensively and constantly on its network connections—and outages and poor performance can be a big problem. This post offers an introduction to the challenges the company was facing, and it reveals how AppNeta by Broadcom delivered the solution they needed.

Introducing a New, Zero-Touch Way to Manage Your DX NetOps Upgrades

For every customer who has an existing DX NetOps solution deployed, an upgrade can be a daunting task. Even for seasoned administrators, the process of logging into each box, running the pre-checks, and then executing the installers can be tedious. With the solution’s support for zero-touch administration (ZTA), the effort becomes easier. Now, you can plan, test, and then finally upgrade your deployment versions in one session.

What is Enterprise Incident Management? Process and Software

Enterprise Incident Management (EIM) is a game-changer for organizations that want to keep their IT operations running smoothly. Whether it's a minor glitch or a full-blown system outage, managing incidents efficiently is crucial to minimizing downtime and keeping your business on track. But what exactly is Enterprise Incident Management, and why should you care?

Understanding Network Traffic Blockages in AWS

In this post, explore the challenges of diagnosing network traffic blockages in AWS due to the complex and dynamic nature of cloud networks. Learn how Kentik addresses these issues by integrating AWS flow data, metrics, and security policies into a single view, allowing engineers to quickly identify the source of blockages enhancing visibility and speeding up the resolution process.

PIR in Incident Management: How to Conduct a Successful Review

Incidents are inevitable. No matter how well-prepared your team is, something will eventually go wrong. But what separates high-performing IT teams from the rest is how they handle these incidents after the dust settles. Enter the Post-Incident Review (PIR) in Incident Management—a crucial process that not only helps teams understand what went wrong but also ensures that they’re better prepared next time.

How to Build an Effective DevOps Roadmap: A CTO Guide

As a CTO stepping into the fast-paced world of startups or scaling tech organizations, one of your first and most critical tasks is to chart a clear DevOps roadmap. This roadmap isn’t just a technical guideline—it’s a strategic blueprint that aligns your tech initiatives with your business goals. To succeed, your approach needs to be systematic, focusing on immediate wins while building a foundation for long-term scalability and resilience.

Syslog 101: Everything You Need To Know

System logging protocol, abbreviated as Syslog, is a standard protocol used for message logging. Put simply, it is a standard for collecting and storing log information. A Syslog server collects, parses, stores, examines, and dispatches log messages from devices including routers, switches, firewalls, Linux/Unix hosts, and Windows machines.

Observability vs Monitoring [Understanding the Key Differences in 2024]

When systems fail, it's not just a technical hiccup – it's a business problem. Downtime means unhappy customers and lost revenue. That's why teams need effective ways to spot issues fast and fix them even faster. This is where monitoring and observability come into play. Monitoring and observability are two key approaches to keeping your systems running smoothly. Monitoring is like your system's alarm bell – it tells you when something's wrong.

OpsRamp and HPE-One Year Later: An Analyst's Perspective

In March 2023, Hewlett Packard Enterprise (‘HPE’) announced the acquisition of OpsRamp, subsequently closing the deal in May that year. Founded in 2014, OpsRamp is an award-winning solution that enables IT operations, site reliability engineeering (SRE), cloud operations, and DevOps teams, and other stakeholders to better detect, remediate, predict, and prevent slowdowns and outages across physical, virtual, and cloud systems.

Streamline code quality: Integrating SonarCloud and SonarQube scanning with Kosli for automated compliance

Static code analysis is an important part of testing your software to ensure it is release-ready. In contrast to dynamic testing, which involves executing your code to find errors, static analysis uses automated tools to “look” through the code, without executing it, to find potential errors (including potential security issues) and bugs. Since the code does not need to be executed, static testing can begin much earlier in development than dynamic testing.