Operations | Monitoring | ITSM | DevOps | Cloud

Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Learning from the AWS Outage: Internal Monitoring Alone Isn't Enough

If you have set up your own monitoring services with Amazon CloudWatch, Azure Monitor or another internal tool, we suggest you consider looking beyond the horizon. These services often provide internal web monitoring only. Perhaps they validate HTTP availability from locations outside their networks, but HTTP checks won’t give you a 360º view into the state of your services.

Accelerate incident analysis by incorporating Ocean logs in any pipeline

Spot Ocean delivers container-driven autoscaling to continuously monitor and optimize your cloud environment. Positioned at a busy crossroads in the application deployment pipeline, Ocean has a critical role when shipping new containers. Given the highly dynamic nature of Kubernetes environments, events happen constantly and take shape as logs in Ocean. These can help you understand the chain of events in different scaling scenarios, from debugging cluster issues to incident analysis.

Ocean explained: container-driven autoscaling with Kubernetes

Whether you’re using a managed Kubernetes service like AWS EKS, GCP GKE or Azure AKS, or self-managing a DIY cluster deployed with open source tools like kops and Kubespray, the underlying hardware can vary from container to container. Each container requires specific resources (CPU/ memory/GPU/network/disk) and as long as the underlying infrastructure can provide those resources, the container will be able to execute its business logic.

Control Plane Demo

Control Plane is a perfect substrate to run microservices. When you deploy on Control Plane, your containerized apps run concurrently on AWS, Azure and GCP. Customers experience ultra-low latency, 99.999% availability and you can mix-n-match ALL the services of AWS, Azure and GCP as if those clouds have virtually merged. All your "Ops" portion of "DevOps" has been codified so developers are freed to focus on the Dev part and leave the Ops part to the platform.

Discover Database Servers In 3 Steps

I recently had a cloud migration client who was at the beginning stage of their discovery phase and looking to jump straight to “which database platforms should I be using in the cloud?” - a tall ask you might say, but following the three steps below they were able to discover and analyze all of their database servers in just two weeks.

Refined User Experience, New Executive Visibility, and Enhanced Cloud Monitoring with Splunk Enterprise Security 7.0

Just like that, another year has gone by full of remote work, virtual conferences, and lengthy Zoom calls. And, although we were not able to see our fellow Splunkers in person at.conf21 that didn’t stop us from previewing the latest enhancements to Splunk Enterprise Security. And now, it gives us great pleasure to announce that Enterprise Security 7.0 is available!

9 Cloud FinOps Challenges - And The Solutions To Overcome Them

Companies migrate to the cloud to become more productive, respond to market changes, and be flexible — while spending less on cloud infrastructure. But there is one thing that many cloud-based organizations have learned: Cloud costs add up. Fast. As a result, 71% of cloud financial management teams doubt they will achieve expected results at the expected time or at all. Only 29% of cloud FinOps teams expect to achieve their cloud goals.

AWS re:Invent 2021: Scale the Kubernetes-first, cloud-native model

The widespread adoption of Kubernetes has made it table stakes for the modern cloud native stack. Software is now being purpose-built for Kubernetes, and as companies enter this new phase of their cloud journey, they are looking to scale. Building upon years of experience with Kubernetes, Spot by NetApp is continuously innovating to bring new ways to achieve this goal. From container-optimized, serverless infrastructure to Kubernetes-native continuous delivery, Spot is connecting the dots to vastly improve the efficiency and manageability of Kubernetes applications and environments.

How to deploy the Google Cloud Ops Agent with Ansible

Site Reliability Engineering (SRE) and Operations teams responsible for operating virtual machines (VMs) are always looking for ways to provide a more reliable, more scalable environment for their development partners. Part of providing that stable experience is having telemetry data (metrics, logs and traces) from systems and applications so you can monitor and troubleshoot effectively. Many Google Cloud services, including Google Compute Engine, provide basic system metrics out of the box.