Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Resources for Your Capacity Planning Project

It’s budgeting season for many of us. As a result, capacity planning and cost justification for hardware purchases are likely on your mind. But don’t fear, because Galileo has you covered. We’ve curated a bunch of our most popular and helpful resources for your capacity planning project, and they’re all here in one handy place.

Expand your monitoring reach with Datadog's enhanced Azure integration

Microsoft Azure is one of the fastest-growing cloud platforms in the world, offering a wide range of products for deployment, testing, and cloud storage. Datadog is committed to maintaining an extensible monitoring solution for Azure’s growing ecosystem that doesn’t require lots of manual configuration. That’s why we are excited to announce the following enhancements to Datadog’s Azure integration.

How Mercari Scales Vision, Culture, & Reliability

In a recent fireside chat with Mohan Bhatkar, Head of Engineering for the Customer Reliability Platform at Mercari, Inc. sat down with Blameless Co-Founder Ashar Rizqi. They talked about scaling while avoiding silos, exciting day-to-day challenges, instilling a culture of empowerment, and more. Here are their top insights and the lightly edited transcript of their conversation.

How to Choose your Monitoring Solution

When we talk about IT support in an organization, it’s not only about resetting passwords or fixing desktops and printers. The most important task for your IT staff is to regularly monitor your network for any emerging issues or threats, and respond on time to ensure that the problem does not interfere with your productivity. However, in an age of growing networks and small budgets, it’s a challenging task that is not easy to handle manually at all times.

Fast scaling for containerized workloads with automatic headroom

High performing container workloads rely on infrastructure to match application demands at a moment’s notice. From scaling bursts that require instant compute availability, to traffic lulls that create infrastructure waste, it’s important to keep both availability and cost in mind during the life of a production application.

Secure Chaos Engineering on Kubernetes Clusters Without being a Noisy Neighbor

Get started with Gremlin's Chaos Engineering tools to safely, securely, and simply inject failure into your systems to find weaknesses before they cause customer-facing issues. Kubernetes is a powerful open source platform to build scalable, reliable systems, designed to be extensible and customizable for many use cases. Kubernetes provides the ability to scale individual pods, swap out runtimes, and control access to objects using namespaces.

Keeping cloud costs low and availability high during Black Friday

During Black Friday 2017 there seemed to be a lot of missing capacity in several AWS regions, even for on-demand instances. As such, some AWS users are wary of using EC2 spot instances going into Black Friday. In this post we will explain how Spot by NetApp can help ensure high availability while fully leveraging spot instances.