Operations | Monitoring | ITSM | DevOps | Cloud

AIOps

The latest News and Information on AIOps, alerting in complex systems and related technologies.

Networking Field Day 35: Selector AI Introduction with Debashis Mohanty

Selector's customer base includes 50 deployments across service providers as well as large enterprises in retail, media distribution, colocation services, and multi-cloud networking services. These customers aim to correlate events across their network, applications, and infrastructure; eliminate the need for human intervention in RCS and remediation; and democratize access to insights using conversational natural language interfaces. Selector delivers on these outcomes, while accelerating incident remediation through smart, actionable alerting and a GenAI-based conversational interface.

Networking Field Day 35: Solving the Query Problem with Selector AI

Selector translates English phrases to SQL queries through the use of an LLM. Each SQL query includes the table, or data set to be searched, along with filters, or conditions which prune the search results. We walk through a number of SQL queries and sample search results, before considering the LLM-based translation of a sample English phrase processed by Selector.

Networking Field Day 35: Selector AI and the Workings of an LLM

An LLM differs from a function in that it takes output and imputes, or infers, a function and its arguments. We first consider how this process works within Selector for an English phrase converted to a query. We then step through the design of Selector's LLM, which relies on a base LLM trained with English phrases and SQL translation, then fine-tuned, on-premises, with customer-specific entities. In this way, each of Selector's deployments relies on an LLM tailored to the customer at hand.

AI-powered incident management copilots: A guide

All eyes are on generative AI. Enterprise IT teams are looking to Gen AI to translate the high volume of data from their services architecture into actionable insights. The goal: Improve operational efficiency and quality of work. But it’s challenging to sort through the hype (and confusion) to identify which vendors have GenAI capabilities that can provide true impact and value to their IT and service operations. One capability in particular is AI-powered copilots.

Elevate Digital Employee Experience with Advanced Workspace Management

In today’s dynamic IT environment, effective Digital Workspace Management and Digital Experience Monitoring (DEM) are critical for maintaining operational efficiency and optimizing Digital Employee Experience. For IT Operations and Service Desk teams, navigating the complexities of hybrid work environment and ensuring seamless service delivery is challenging now more than ever.

How Nationwide Building Society boosted system resiliency & saved $1 2M with Digitate

Join us for an insightful conversation with Andrew Pringle, Delivery Lead at Nationwide Building Society (NBS), as we dive into how Nationwide transformed their system resiliency and achieved substantial savings. By partnering with Digitate, NBS identified 50 critical scenarios to monitor and alert in their core customer data systems, resulting in enhanced reliability and cost savings of $1.2 million.

Improving documentation with content reuse

Anyone who’s worked in a customer-facing role knows the pressure to find the correct answers quickly. Emotions are high when something is broken, or there’s an outage. The customer is angry. You’re stressed. And your boss is watching and wondering why the problem hasn’t been fixed. You need to troubleshoot quickly and provide the right information ASAP. As a support professional, you want to give customers and stakeholders the best possible experience.

Why Full AI-Stack Visibility is Key to High-Performing GPUs and AI Models

The generative AI market is poised to explode. From AI-based co-pilots and assistants to new use cases across healthcare, marketing, sales, software development, and more, generative AI is unleashing a new wave of productivity, efficiency, and transformative employee and customer experiences.

Crowdstrike outage and Security Posture Management with Descriptive Analytics

Last Updated on 15 hours The recent outage caused by Crowdstrike on Jul 18, 2024 has proved how the fallout was unforeseen and unthinkable, across the globe. In this era of zero trust, the leading cyber security company Crowdstrike sent an update to its Falcon sensor agent and another IT leader Microsoft which had Crowdstrike sensors installed crashed with Blue Screen of Death(BSOD) as soon as the update was received caused by a null pointer issue.

Six ways Australian local government IT teams can benefit from AIOps in monitoring

Running IT operations in an Australian city council is a complex role that faces a unique set of challenges and opportunities. Typically, a city council in an advanced country like Australia runs its IT on a hybrid model, with a combination of continuing on-premise installations working in tandem with modern cloud platforms, such as Azure.