Operations | Monitoring | ITSM | DevOps | Cloud

Ecommerce monitoring strategy for Black Friday and Cyber Monday 2021

It’s nearly here. The annual mad rush at the wee hours of the morning. The stampede into retail stores to claim really deep discounts on the latest toys, electronics, and gadgets makes headline news every year. It begins the day after Thanksgiving and is usually two of the biggest shopping days of the year. Yes, we’re talking about Black Friday and Cyber Monday.

What can you learn from IoT with i2M - Part 3

In the last 2 installments (Part 1 & Part 2), we discussed the basics of IoT and an example of how the components can be connected and used to provide basic automation and alerting. These seemingly simple steps can build up to provide very advanced controls of all aspects of the physical world. The challenge can become managing situations that were not expected.

Podcast: Break Things on Purpose | Gustavo Franco, Senior Engineering Manager at VMWare

In this episode Jason is joined by Gustavo Franco, Senior Engineering Manager at VMWare, to chat about chaos in the Gustavo’s early days. Gustavo reflects on Googles early disaster recovery practices, to the contemporary SRE movement.

Top 5 Challenges of Monitoring Complex IT Infrastructures

When monitoring a large IT infrastructure, there are multiple aspects you need to keep under control. Doing things manually and relying on people to ensure the infrastructure reliability can be a wrong decision and mislead you when resolving issues or troubleshooting problems. All these complexities faced while managing a large ecosystem can seem hard to overcome, but in reality, they can be handled.

Best practices for Cloud Operations in the enterprise

How can you get the most value out of Cloud Operations, especially as your Cloud footprint grows? In this episode of Engineering for Reliability, we look at the enterprise best practices for setting up and using Cloud Operations. Watch to learn how to improve the security of your services, better manage capacity, and keep your users happy!

NGINX Monitoring: Best Tools and Key Metrics You Should Know About

NGINX is a popular web server featuring a wide range of capabilities, including reverse proxy, mail proxy, HTTP cache, and load balancing. It offers TLS offloading and a health check of the backends and supports gRPC, WebSocket, and HTTP/2. In short, NGINX is a one-stop solution for most of your web server needs. When using NGINX, monitoring its metrics is crucial for tackling issues.

How they SRE: Insights from the Cloudflare SRE team

Cloudflare is a global cloud services provider that is based all over the globe, from San Francisco, US to London, England to Sydney, Australia. Their mission, as stated front and center on their homepage, is to help build a better Internet. While that may read like hyperbole, their numbers are impressive - Cloudflare has over 126,000 paying customers and 95% of Internet Users in the developed world are within 50ms of their network.

Rollbar Pro Tips: Slack Best Practices

Deliver a better alerting experience for you and your team by utilizing Slack best practices and customizing your alerts. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

Rollbar Pro Tips: UUIDs

The Rollbar API supports including a UUID with occurrence reports which can later be used to look up that exact occurrence. Learn how you can create a browser bookmark to quickly open an error sent to Rollbar, and how to configure Rollbar to display the error UUID back to the web page. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.