Latest News

Azure Monitoring Agent: Key Features & Benefits

Aug 13, 2023 By Squadcast Community In Squadcast

In today's rapidly evolving digital landscape, businesses increasingly rely on cloud computing and infrastructure to support their operations. As organizations migrate their workloads to the cloud, robust monitoring and management tools are paramount to ensure optimal performance, security, and efficiency. In response to this demand, Microsoft Azure has introduced the Azure Monitoring Agent (AMA), a powerful and versatile solution designed to enhance the monitoring capabilities of Azure resources.

Read Post

Squadcast

Read more about Azure Monitoring Agent: Key Features & Benefits

Splashing into Data Lakes: The Reservoir of Observability

Aug 11, 2023 By JJ Jeffries, Head of Marketing In ObservIQ

If you’re a systems engineer, SRE, or just someone with a love for tech buzzwords, you’ve likely heard about “data lakes”. Before we dive deep into this concept, let’s debunk the illusion: there aren’t any floaties or actual lakes involved! Instead, imagine a vast reservoir where you store loads and loads of raw data in its natural format. Now, pair this with the idea of observability and telemetry pipelines, and we have ourselves an engaging topic.

Read Post

ObservIQ

Read more about Splashing into Data Lakes: The Reservoir of Observability

Rootly Raises $12 Million from Renegade Partners, Google Gradient Ventures, & XYZ Ventures

Aug 10, 2023 By JJ Tang In Rootly

We are excited to announce that we have raised a $12M round of financing led by Renegade Partners with participation from Google Gradient Ventures (Google’s AI-focused venture fund) and XYZ Ventures. This brings our total funding to date to $15.2M ($20M CAD) alongside our other existing investors Y Combinator and 8VC.

Read Post

Rootly

Read more about Rootly Raises $12 Million from Renegade Partners, Google Gradient Ventures, & XYZ Ventures

How To Write Incident Postmortems

Aug 10, 2023 By Anjali Udasi In Zenduty

Writing a public postmortem regarding an outage is essential to maintaining transparency and accountability when things go wrong in a service or system. The purpose of writing a postmortem is to analyze and document an incident or event that has occurred, usually with a focus on identifying its root causes, understanding what went wrong, and outlining steps to prevent similar issues from happening in the future.

Read Post

Zenduty

Read more about How To Write Incident Postmortems

Tools and Trends in Site Reliability Engineering according to Gartner's 2023 Hype Cycle

Aug 9, 2023 By Halle Katz In OnPage

Gartner recently published its Hype Cycle for Site Reliability Engineering, 2023, report. This blog reviews the future of site reliability engineering based on Gartner’s Hype Cycle. Additionally, the OnPage team is pleased that Gartner mentioned OnPage as a sample vendor in the Automated Incident Response category.

Read Post

OnPage

Read more about Tools and Trends in Site Reliability Engineering according to Gartner's 2023 Hype Cycle

Thanos vs. VictoriaMetrics

Aug 9, 2023 By Last9 In Last9

A deep dive comparison between Thanos and VictoriaMetrics: Performance and Differences.

Read Post

Last9

Read more about Thanos vs. VictoriaMetrics

Observability vs. Telemetry vs. Monitoring

Aug 9, 2023 By Last9 In Last9

Observability vs Telemetry vs Monitoring - What they are, differences and what lies in future.

Read Post

Last9

Read more about Observability vs. Telemetry vs. Monitoring

Unveiling Squadcast's Enhanced Status Pages

Aug 3, 2023 By Sanjog Sandhu In Squadcast

Meet Kevin and Mai (again): Navigating the Troublesome Waters of Platform Downtime. Kevin is a Site Reliability Engineer (SRE), constantly on the lookout for potential downtime that could impact their platform, kryptobro.com. Mai is his adept partner, ever-ready to troubleshoot. In their journey, the previous version of Squadcast Status Pages served as a helpful tool, but they soon found room for improvements.

Read Post

Squadcast

Read more about Unveiling Squadcast's Enhanced Status Pages

SRE Redefines IT Operations as Architect of Sustainable Systems

Aug 3, 2023 By Ari Stowe In Resolve

Site Reliability Engineering (SRE) is a term that’s getting attention and gaining momentum – and for a good reason. SRE takes features of software engineering and applies them to various problems in infrastructures and operations. Organizations look to build SRE teams with a couple goals in mind, including to create and increase scalability and develop solid software systems.

Read Post

Resolve

Read more about SRE Redefines IT Operations as Architect of Sustainable Systems

Kubernetes Incident Management Best Practices

Aug 3, 2023 By Rajesh Tilwani In Rootly

Creating just any infrastructure on Kubernetes is not enough. There are so many basic configurations you could apply and create the infrastructure for your application for the time being and it might work just fine. The incident responses won’t always remain 100% reliable. You will run into newer potholes, and that’s okay.

Read Post

Rootly

Read more about Kubernetes Incident Management Best Practices

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Azure Monitoring Agent: Key Features & Benefits

Splashing into Data Lakes: The Reservoir of Observability

Rootly Raises $12 Million from Renegade Partners, Google Gradient Ventures, & XYZ Ventures

How To Write Incident Postmortems

Tools and Trends in Site Reliability Engineering according to Gartner's 2023 Hype Cycle

Thanos vs. VictoriaMetrics

Observability vs. Telemetry vs. Monitoring

Unveiling Squadcast's Enhanced Status Pages

SRE Redefines IT Operations as Architect of Sustainable Systems

Kubernetes Incident Management Best Practices

Monthly Archive

Follow Us