Operations | Monitoring | ITSM | DevOps | Cloud

ScaleUP 2021: Taking the Logz.io Observability Platform to the Next Level

Today was a very exciting day for Logz.io, as we held ScaleUP 2021 – our second annual user conference – dedicated to elevating our customers’ success, discussing best practices for modern observability, and unveiling Logz.io’s latest product updates. These product advancements were presented by our Co-Founder and VP of Product Asaf Yigal, and members of the Logz.io software engineering team.

5 ways incidents made me a better engineer

Incidents are a great opportunity to gather both context and skill. They take people out of their day-to-day roles, and force ephemeral teams to solve unexpected and challenging problems. In my career, I've found incidents can be a great accelerator - for both myself and others around me. It was after leading my first incident at GoCardless that I started to feel really comfortable in the codebase and the team.

A Simple Guide to Taming the Beast That Is Kubernetes

Containers are amazing. But when you start to orchestrate them in a complex environment, they can become quite the beast. Kubernetes is one of the best tools to tame that beast, but few resources exist to help you manage your big data workloads on Kubernetes. If you want to learn how you can optimize your big data workloads on Kubernetes, this is for you.

Lower Your Google Cloud Costs with These 5 Google Dataproc Best Practices

Thinking about using Google Dataproc as your cloud vendor? We can see why. Google Dataproc is a powerful tool for analytics and data processing, but to get the most out of it you have to ensure you use it properly. We’re going to explore five best practices you can use to lower your Google cloud costs while maximizing efficiency: Following these tips will ensure the best performance and help keep your cloud costs in line.

4 Best Tools to Measure and Reduce Network Latency

If your network operations are slow and inefficient, you could be having issues with your latency. Latency measures how long it takes for data to move from one place to another. There are many reasons why you could be experiencing network latency—propagation time, transmission delays, and processing delays are all common causes of latency. When latency occurs, it negatively affects the performance of your network.

TensorFlow Python Code Injection: More eval() Woes

JFrog security research team (formerly Vdoo) has recently disclosed a code injection issue in one of the utilities shipped with Tensorflow, a popular Machine Learning platform that’s widely used in the industry. The issue has been assigned to CVE-2021-41228. This disclosure is hot on the heels of our previous, similar disclosure in Yamale which you can read about in our previous blog post.

Risky Business: Implementing a Redundant Networking and Multi-CDN Monitoring Strategy

Last month, we partnered with AWS to put together a webinar on the importance of implementing a comprehensive redundant networking and multi-CDN monitoring strategy. You can replay the event in full here. In this article, we’ll recap the key takeaways covered by the panel of experts who included Leo Vasiliou, Director of Product Marketing at Catchpoint, and Steve Campbell, our Chief Strategy Officer.