Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

How CloudZero Manages Cloud Costs During Our Product Discovery Process

Like any SaaS organization, CloudZero cares about our COGS, margin, and unit cost. As you can imagine, we have a fair amount of insight into how those numbers are changing every day thanks to being “customer zero” on our cloud cost intelligence platform. Talking with other companies, I often find that the best organizations understand that measuring the cost implications of new products and features once you’ve deployed them can be too late.

Verify GKE Service Availability with new dedicated uptime checks

Keeping the experience of your end user in mind is important when developing applications. Observability tools help your team measure important performance indicators that are important to your users, like uptime. It’s generally a good practice to measure your service internally via metrics and logs which can give you indications of uptime, but an external signal is very useful as well, wherever feasible.

Orchestration in Telcos: the multi-vendor and multi-cloud environments...

The use of NFV migration is becoming commonplace, it is made apparent there is a need for a higher degree of software management, smoother upgrades, and deployment process. Due to the complexity of the migration, Telcos have been deterred from adoption. A solution should be out there to aid businesses in managing and deploying network automation, orchestration, and managed services. In general, a telco network is complex and needs to be managed using multiple perspectives.

Monitor and troubleshoot your VMs in context for faster resolution

Troubleshooting production issues with virtual machines (VMs) can be complex and often requires correlating multiple data points and signals across infrastructure and application metrics, as well as raw logs. When your end users are experiencing latency, downtime, or errors, switching between different tools and UIs to perform a root cause analysis can slow your developers down.

Distributed tracing with OpenTelemetry and Cloud Trace

As more services are involved in serving user traffic and completing transactions, how does each service contribute to overall latency? In this episode of Engineering for Reliability, we’ll show how to use distributed tracing to capture the latency of user requests and how long it takes each service in the path to return a response. Watch to learn how to capture latency in distributed applications using OpenTelemetry and analyze it using Cloud Trace.

Product Explainer Video: Splunk Infrastructure Monitoring for Real-time Monitoring in the Cloud

Wherever you are in your cloud journey and whatever your environment looks like, Splunk Infrastructure Monitoring is a purpose-built metrics platform to address real-time cloud monitoring requirements at scale. Get real-time observability for data from any cloud, any vendor, and any service.

Google Cloud Asset Inventory 101

Cloud Asset Inventory is a metadata inventory service that allows you to view, monitor, and analyze all your Google Cloud and Anthos assets across projects and services. In this video, Sophia Yang - a Google Cloud Product Manager - will show you how Cloud Asset Inventory allows you greater visibility into your Google Cloud assets, receive real-time notifications on asset config changes, run analysis on inventory, getting insights from your deployment, and more! Watch to learn how you can use Cloud Asset Inventory to gain greater observability into your Google Cloud and Anthos assets!