Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Debugging AWS Lambda Timeouts

Some time ago, an ex-colleague of mine at DAZN received an alert through PagerDuty. There was a spike in error rate for one of the Lambda functions his team looks after. He jumped onto the AWS console right away and confirmed that there was indeed a problem. The next logical step was to check the logs to see what the problem was. But he found nothing. And so began an hour-long ghost hunt to find clues as to what was failing and why there were no error messages.

8 Changes Drift Made to Save $2.4M in Cloud Cost

Chief Architect at Drift, Freedom Dumalo, recently spoke with the CloudZero team about how they’ve successfully cut cloud costs by $2.4M in just a few months. When the world rapidly transitioned to work from home, Drift, the conversational marketing platform known for their chatbots, saw a spike in their customer utilization. The largest usage spike were among new users experimenting with their free trial. This increase in engagement was a great sign for their business.

4 Reasons Why Carbon Black Co-Founder Ben Johnson Prioritizes Cloud Cost at His Latest Startup

After growing Carbon Black from nothing to over 800 employees, Founder and CTO Ben Johnson turned his attention to the security of SaaS applications with new startup Obsidian Security. Obsidian secures companies like Office 365, G Suite, Salesforce, Slack and Zoom. This time around, Ben is a seasoned founder and is proactively thinking about how to scale his company with healthy product margins. He uses CloudZero to monitor his AWS costs, detect anomalies, and enable his engineering team.

Splunking Cisco Webex Meetings Data

The COVID-19 pandemic has had a major impact on our working lives. Companies have adopted by transforming their workforce to work remotely through video conferencing software. Cisco’s Webex Meetings, one of the most popular video conference softwares, plays a critical role in helping employees stay connected, enhance collaboration and drive productivity.

Using Splunk to Detect Abuse of AWS Permanent and Temporary Credentials

Amazon Web Services provides its users with the ability to create temporary credentials via the use of AWS Security Token Service (AWS STS). These temporary credentials work pretty much in the same manner like permanent credentials created from AWS IAM Service. There are however two differences.

Interview: Why Applications Fail and What to Do About It

Lee Atchison is a recognized industry thought leader in cloud computing and has significant experience architecting and building high scale, cloud-based, service oriented, SaaS applications. Formerly the Senior Director for Cloud Architecture at New Relic, Lee is now the owner of Atchison Technology LLC, a cloud consulting and advising firm. Lee is also the author of “Architecting for Scale,” a book published by O’Reilly Media.

Introducing the Sumo Logic Observability suite with distributed tracing (beta) - a cornerstone of cloud-native APM

Last week Sumo Logic announced our new Observability Suite, which included the public introduction of the closed beta for our distributed tracing capabilities as part of our Microservices Observability solution. This new solution will provide end-to-end visibility into user transactions across services, as well as seamless integration into performance metrics and logs to accelerate issue resolution and root-cause analysis. In this blog, we’ll explore the new solution in detail.

ChaosSearch Announces New Integration With Opsgenie

ChaosSearch is excited to announce its new integration with Opsgenie — Atlassian’s alerting and incident management platform. Using this integration, your teams can leverage the industry’s most powerful and comprehensive data monitoring and analytics capabilities channeled into a unified workflow through Opsgenie’s easy-to-use interface.