%term

Cribl's Blueprint for Secure Software Development.

Jul 19, 2024 By Cribl In Cribl

What does it take to build software for the most security-demanding customers worldwide? At Cribl, building secure products is integral to our engineering identity. We have established a secure software development lifecycle that is both culturally and policy-driven, integrating product security tooling and processes into every architecture review, pull request, and release, whether major or minor.

View Video

Cribl

Read more about Cribl's Blueprint for Secure Software Development.

Integration Spotlight: PagerDuty and Robusta

Jul 19, 2024 By PagerDuty In PagerDuty

Bring powerful AI troubleshooting and cause analysis to your incident response with Robusta's integration with PagerDuty. Join us to learn more from CEO Natan Yellin on how your team can improve your k8s reliability.

View Video

PagerDuty

Read more about Integration Spotlight: PagerDuty and Robusta

Introduction to Ingesting Logs into Loki with Fluentd and Fluent Bit | Zero to Hero: Loki | Grafana

Jul 19, 2024 By Grafana In Grafana

Have you just discovered Grafana Loki and plan to use FluentD or Fluent Bit as your telemetry collector? Or are you trying to decide which agent is right for you? In this "Zero to Hero" episode, we cover the basics of FluentD and Fluent Bit, highlighting their differences and helping you determine when to use one over the other. Additionally, we guide you through configuring both agents' Loki plugins to write logs directly into Loki.

View Video

Grafana

Read more about Introduction to Ingesting Logs into Loki with Fluentd and Fluent Bit | Zero to Hero: Loki | Grafana

Learning Moment: Effective Customer Communication During Incidents - Enhance Visibility & Response with Uptime.com

Jul 19, 2024 By Jonathan Franconi In uptime

The recent global outage caused by an operating system update reminded me of how vulnerable we are today and most importantly, how close we are always teetering on global scale incidents with millions of interconnected dependencies. When the base of the house collapses, everything built on top is impacted. Those of us in IT Operations, Monitoring, Observability (insert the current acronym), etc., know firsthand this risk; we face it every day.

Read Post

uptime

Read more about Learning Moment: Effective Customer Communication During Incidents - Enhance Visibility & Response with Uptime.com

Chaos Testing Explained

Jul 19, 2024 By Shanika Wickramasinghe In Splunk

Chaos testing is a part of site reliability engineering (SRE). In chaos testing, we intentionally break things in and around a given application, in order to: The purpose of chaos testing is to assess how software systems respond to scenarios like network outages, hardware failures, database failures, and server or cluster node failures in the infrastructure.

Read Post

Splunk

Read more about Chaos Testing Explained

How to Build Resilience Throughout Your SDLC Lessons from a Top 10 Bank

Jul 19, 2024 By Gremlin In Gremlin

Are your applications as reliable as you planned? How do you know? The only way to ensure systems are resilient to common failure conditions is to test them, yet many large enterprises struggle with the effort and expense to do so. In this webinar, Anantha Movva, a former head of SRE and Performance Engineering at one of the top 10 North American banks, will share how he drove Chaos Engineering and resilience testing adoption throughout his organization.

View Video

Gremlin

Read more about How to Build Resilience Throughout Your SDLC Lessons from a Top 10 Bank

Monitoring Healthtech Applications with Custom Metrics

Jul 19, 2024 By Lauren Barnes In MetricFire

Staying healthy is essential. Luckily, nowadays, tracking health and wellness is easier than ever. This article will discuss how monitoring allows developers to ensure that their health applications run smoothly so people can stay healthy.

Read Post

MetricFire

Read more about Monitoring Healthtech Applications with Custom Metrics

How to Bounce Back from the CrowdStrike Outage: Expert Tips & Recommendations

Jul 19, 2024 By Spiceworks In Spiceworks

How did this morning's CrowdStrike outage affect you? Aberdeen's Head of Customer Experience Management (CX), Omer Minkara, breaks down four key recommendations to help get your business and its customers back on track.

View Video

Spiceworks

Read more about How to Bounce Back from the CrowdStrike Outage: Expert Tips & Recommendations

Features for Better Code Collaboration #shorts #GitKraken

Jul 19, 2024 By GitKraken In GitKraken

Discover an easier way to review code changes with Code Suggestions! Now, you can approve commits and suggest changes without being limited to specific lines of code. Or, maybe you need quick feedback on WIPs? Cloud Patches let you share your work with your team at any stage of the development process. Collaborate smoothly, get early input, and keep your repos neat and organized.

View Video

GitKraken

Read more about Features for Better Code Collaboration #shorts #GitKraken

Beyond the Headlines: The Unsung Art of Software Outage Management

Jul 19, 2024 By Robert Ross In FireHydrant

Today, the entire world is feeling the pain of a major software outage. While we know a lot about these occurrences—our entire business is built on helping companies manage incidents and outages effectively—we’re not here to share our opinion on it. Instead, we’d like to help those unfamiliar with the incident lifecycle understand what happens when an outage like this occurs, who is responsible for what, and what companies ultimately do to get things working again.

Read Post

FireHydrant

Read more about Beyond the Headlines: The Unsung Art of Software Outage Management

Operations | Monitoring | ITSM | DevOps | Cloud

%term

Cribl's Blueprint for Secure Software Development.

Integration Spotlight: PagerDuty and Robusta

Introduction to Ingesting Logs into Loki with Fluentd and Fluent Bit | Zero to Hero: Loki | Grafana

Learning Moment: Effective Customer Communication During Incidents - Enhance Visibility & Response with Uptime.com

Chaos Testing Explained

How to Build Resilience Throughout Your SDLC Lessons from a Top 10 Bank

Monitoring Healthtech Applications with Custom Metrics

How to Bounce Back from the CrowdStrike Outage: Expert Tips & Recommendations

Features for Better Code Collaboration #shorts #GitKraken

Beyond the Headlines: The Unsung Art of Software Outage Management

Monthly Archive

Follow Us