Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

A Cool Milestone for Monitoring as Code: Checkly Recognized a THIRD Time by Gartner!

Hello, Checkly community and Monitoring as Code (MaC) aficionados! We have some exhilarating news that we can't wait to share. Our mascot is sporting sunglasses today because Checkly has been named in Gartner®'s 2023 Cool Vendors in Monitoring and Observability: Where Awareness Meets Understanding report!

Predictive Maintenance: What's the Economic Value?

The global predictive maintenance market is expected to grow to $6.3 billion by 2022, according to a report by Market Research Future. However, a new paradigm is required for analyzing real-time IoT data. Predictive maintenance, which is the ability to use data-driven analytics to optimize capital equipment upkeep, is already used or will be used by 83 percent of manufacturing companies in the next two years.

How Vydia Uses Serverless with Stackery

Vydia is dedicated to helping creators gain more control over their audio and video content with a centralized tool for distributing, managing, protecting, and optimizing AV files. Vydia’s software team describes themselves as a “DevOps team first and foremost” delivering new features and updates in a tight loop. They are always in search of new ways to improve and modernize the development process.

Taloflow Founder Presents at Vancouver AWS Meetup on Driving AWS Infrastructure Insights into Kafka

Last week, our CTO, Todd Kesselman, presented on "Driving AWS Infrastructure Insights into Kafka" in downtown Vancouver, Canada. In his presentation, he revealed an unobtrusive way to share a wide range of operational information between organizations in a way that can easily be incorporated into your event pipeline. The featured technology is the AWS Event Bus. To clarify, the Event Bus is a message bus that enables multiple AWS accounts to publish and receive events to and from each other.

Intel's latest security vulnerability - our steps and yours

This week Intel released a statement regarding Microarchitectural Data Sampling (MDS), another vulnerability in the "speculative execution" feature of modern processors. This is for HyperThreading and is the feature that allows the CPU to work out what commands will be run next, if they would affect the current running command and if not, run it on the same core.

Five Things Your APM Platform Should do for Your Container Application Deployments.

One of the chief complexities in running large scale containerized applications is the need for continuous systems/application monitoring. Containers are very different from traditional VMs and the 3 tier applications that run on them. Monitoring that needs to ensure that SLAs promised to the business are being met as well as an ability to forecast usage trends while identifying problem areas such as bugs, capacity challenges, slowing performance, and any potential downtime.

Dynamic Sampling by Example

Last week, Rachel published a guide describing the advantages of dynamic sampling. In it, we discussed varying sample rates to achieve a target collection rate overall, and having different sample rates for distinct kinds of keys. We also teased the idea of combining the two techniques to preserve the most important events and traces for debugging without drowning them out in a sea of noise.

Why Your Lambda Functions May Be Doomed To Fail

AWS Lambda has a cool feature that can be both a blessing and a nightmare for a serverless application, depending on whether it’s properly handled by our code: the retry behavior. A retry occurs when an invocation of a Lambda function results in an error and the AWS Lambda platform automatically invokes the function again, with the same event payload. Before we get deeper, make sure you are familiar with the AWS documentation on the subject.

Alert escalation - How it works in SIGNL4

Part of any managers role is to make sure their team is taking accountability. Managers are not the front lines resolvers that handle issues, that is what they have a team for. However, managers do need to be aware of incidents that are occurring as well as making sure their team is taking ownership and resolving those issues. SIGNL4 takes the managerial work out of being a manager by providing alert ownership transparency.