Modern distributed applications are composed of potentially hundreds of disparate services, all containing code from different internal development teams as well as from third-party libraries and frameworks with limited external visibility. Instrumenting your code is essential for ensuring the operational excellence of all these different services. However, keeping your instrumentation up to date can be challenging when new issues arise outside the scope of your existing logs.
Missed the last Netdata updates? Here is what is new.
Recently, Elastic Universal ProfilingTM became generally available. It is the part of our Observability solution that allows users to do whole system, continuous profiling in production environments. If you're not familiar with continuous profiling, you are probably wondering what Universal Profiling is and why you should care. That's what we will address in this post.
System outages: they are an inevitable problem that every single IT team will encounter at some point. Whether they come about due to technical issues, act-of-god natural disasters, or simply random human error, system outages happen to the best of us. Though the cause of system outages is not always in your control, you can control your team’s processes for response and resolution.