Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

How Gremlin monitors its own Chaos Engineering service with Datadog

Reliable systems are vital to meeting customer expectations. Downtime not only hurts a company’s bottom line but can be detrimental to reputation. Our goal at Gremlin is to help enterprises build more reliable systems using Chaos Engineering. Whether your infrastructure is deployed on bare metal in a corporate-owned data center or as Kubernetes-orchestrated microservices in a public cloud, chaos experiments can help you find system weaknesses early, before they affect customers.

Autoscaling Puppet compile masters with AWS

In classic Puppet deployment architecture, compile masters are widely used when the number of managed nodes goes up. Multiple compile masters sit behind a load balancer to take care of the additional workloads. It is not rare to see Puppet adopters launching the compile masters in the public cloud, such as Amazon Web Service (AWS) and Google Cloud Platform.

ITSM Change Management to Control Continuous Cost Optimization

I’ve been writing about continuous cloud optimization for a while now, and recently, I’ve spoken with several organizations to understand any challenges they’re currently facing in their automation journey. Their insights would help us understand how we can improve our technology to better support them. I discovered two fundamental themes behind their challenges.

Q&A with Ziff Media Group: Why They Made the Switch to Kubernetes

Today’s leading companies are one step ahead of their competitors as they adopt new tools and disciplines emerging from the cloud native landscape. That was the case for Ziff Media Group, which is a collection of several media web properties including pcmag.com, mashable.com, deals.com, offers.com, and more.

Best practices for an effective remote team in the world of cloud delivery

Just like countless other business functions, cloud delivery has now shifted entirely to remote working for now and perhaps longer term. Canonical has been a remote-first company for some time, so we thought we’d take this opportunity to share some of the best practices we’ve learned along the way. Communication Effective communication between customers, engineers, and project managers is the most critical element of successful cloud delivery.

Top 5 Reasons for Enterprises to Modernize their Applications to Cloud

While the secret of change is to focus all your energy not on fighting the old, but on building the new, adaptability is about the powerful difference between adapting to cope and adapting to win. Enterprises that overlook modernization are sure to fail in fulfilling critical compliance and security problems thus, endangering both, their businesses and customers.

Featured Post

Appfleet - an edge cloud platform for improving the efficiency of business processes - launched

Appfleet - an edge hosting platform that allows companies to easily deploy docker containers in multiple locations at the same time, improving the efficiency of business processes and reducing latency - is debuting on the market. Ultimately, this solution is to replace limiting and expensive serverless services, managed kubernetes solutions and complex multi-regional cloud deployments.

Introducing the Datadog IoT Agent

From smart thermostats and grocery store checkouts to public utility infrastructures and industrial manufacturing lines, the Internet of Things (IoT) is all around us—and growing larger every day. But with this rapid growth comes a number of operational challenges: IoT devices collect a large amount of data, and are often distributed across harsh, ever-changing environments.