Operations | Monitoring | ITSM | DevOps | Cloud

Pragmatic Incident Response: 3 Lessons Learned from Failures

In my past experience as an SRE I’ve learned some valuable lessons about how to respond and learn from incidents. Declare and run retros for the small incidents. It's less stressful, and action items become much more actionable. Decrease the time it takes to analyze an incident. You'll remember more, and will learn more from the incident. Alert on pain felt by people — not computers. The only reason we declare incidents at all is because of the people on the other side of them.

Monitoring Apache Kafka Clusters with Sumo Logic

Apache Kafka® is one of the most popular streaming and messaging platforms, commonly used in a pub-sub (publish-subscribe) model, where consumer software applications send data via messages that producer software applications can consume. Teams use Kafka for a variety of use cases, including monitoring user activity, sending notifications, and concurrently processing streams of incoming data such as financial transactions.

Accelerate Hybrid Threat Protection Using Sumo Logic Cloud SIEM powered by AWS

It has been off to the races for the Sumo Logic and AWS teams since the general availability of the Sumo Logic Cloud SIEM powered by AWS solution on June 1, 2021. We are excited for the overwhelming response from customers from across all segments, industries and geographies.

The Business Case for Switching from the ELK Stack

Last year we published a popular paper on how to calculate the true cost of an Elasticsearch, or ELK (for Elasticsearch, Logstash, Kibana) stack environment. The paper helps readers calculate their overall annual cost of ownership for their ELK environment, and reveals how the cost burden of ELK is much higher than anticipated for most customers. That paper clearly hit a nerve — it’s been, by far, our most downloaded piece of content.

Accountable but Not Informed - Bring Clarity to Your Desktop Virtualization Environments

Why is it that when IT has to manage a virtual desktop environment, their job becomes infinitely harder? If you were to poll every major enterprise IT department, there’s always one team (or person) that’s ultimately held accountable for the organization’s Digital Employee Experience.

Google's Core Web Vitals: LCP, FID & CLS explained

You may or may not have heard of Google Core Web Vitals, but the importance of getting them right for your website is like Everest for website owners right now. So what are the Core Web Vitals and what should you do to make sure my website meets them? Google Core Web Vitals consist of 3 components that relate to page responsiveness, speed, stability, and how they affect the user experience. Already scrambling to Google what these mean? Don’t worry, all 3 will be explained in detail below.

How and Why Has Cloud Computing Surged During the Last Few Years?

Cloud computing is undergoing a surge in adoption. In 2020, more money was spent on cloud computing than on-site data centers - a huge milestone. Increasingly, computing and storage are being routed off-site. How and why is this transition taking place? This article takes a quick look at some of the factors driving this watershed moment.

Managing reusable pipeline configuration with object parameters

CircleCI pipelines are defined in configuration files using the YAML syntax, which has been widely adopted by many software tools and solutions. YAML is a human-readable declarative data structure commonly used in configuration files and in applications where data is being stored or transmitted. The data in pipeline configuration files specifies and controls how workflows and jobs are executed when triggered on the platform.

Intro to AIOps: Leveraging AI and Machine Learning in DevOps

AIOps is a DevOps strategy that brings the power of machine learning to bear on observability and system management. It’s not surprising that an increasing number of companies are now adopting this approach. AIOps first came onto the scene in 2015 (coincidentally the same year as Coralogix) and has been gaining momentum for the past half-decade. In this post, we’ll talk about what AIOps is, and why a business might want to use it for their log analytics.