Operations | Monitoring | ITSM | DevOps | Cloud

Our first ML based anomaly alert

Over the last few years we have slowly and methodically been building out the ML based capabilities of the Netdata agent, dogfooding and iterating as we go. To date, these features have mostly been somewhat reactive and tools to aid once you are already troubleshooting. Now we feel we are ready to take a first gentle step into some more proactive use cases, starting with a simple node level anomaly rate alert. note You can read a bit more about our ML journey in our ML related blog posts.

Unlocking IT: Considerations for a Powerful Observability Strategy

In today's cloud-native landscapes, observability is more than a buzzword; it's a critical element for software development teams looking to master the complexities of modern environments like Kubernetes. There’s a multi-faceted nature to observability with all its various levels and dimensions — from basic metrics to comprehensive business insights. It’s complex and can continue indefinitely…if you let it.

Accessibility testing with Cypress

Effective user experience (UX) design is a key factor in creating compelling software products. UX considers the quality of interaction that users have with a product and takes the user’s point of view as the most sacred thing in software and product design. A great UX includes accessibility, which ensures that software is inclusive and usable by the widest possible audience.

Connect Over Coffee | Cross-Cloud with Megaport Cloud Router (MCR)

Misha Cetrone, VP of Strategic Partnerships at Megaport, shows us that connecting to and moving between multiple public clouds like AWS, Azure, Google Cloud, and other leading cloud computing services using Megaport Cloud Router (MCR) is as quick and easy as having a cup of coffee.

Expert Insights: Navigating Outages Like A Pro

Large enterprises need Internet Resilience solutions to limit damage from the outages and incidents that are an unavoidable part of doing business. Proactive deployments can get ahead of the problem to prevent damage, while reactive ones after the fact can put a cap on losses. Luckily, Internet Resilience in a cloud-enabled world is easier than you think! Tune in for an engaging discussion with Howard Holton & Howard Beader, where they discuss.

A Simple Scheduler via an Interrupt-driven Actor Model

Using an RTOS is often a tradeoff between the ease of decomposing tasks, with the complexity of the scheduler itself. There exists a middle ground between highly complex systems that may require an RTOS, and simpler ones that can be easily modeled using a super loop.Since ARM is the most popular embedded CPU and almost every ARM processor has a hardware scheduler, it would be interesting to make a compact framework utilizing these features.