Operations | Monitoring | ITSM | DevOps | Cloud

Faster Incident Resolution with Context Rich Alerts

Labelling your alert payloads although simple can significantly improve the time it takes for your team to respond to incidents. In this blog learn how Squadcast's auto-tagging feature can be a game changer by enabling intelligent labelling & routing of alerts to ultimately reduce your MTTR. A frequent problem faced by on-call engineers when critical outages occur is pinpointing the exact point of failure.

Performance, Stress, and Load Tests in Rails

Tests are an integral part of most well-working Rails applications where maintenance isn’t a nightmare and new features are consistently added, or existing ones are improved. Unfortunately, for many applications, a production environment is where they are put under heavy workload or significant traffic for the first time. This is understandable as such tests are costly.

20 Best Software Development Tools in 2021

With the increase in popularity and adoption of software development across the industry, developers and organizations are constantly looking for tools to make their lives easier. The right set of tools can quickly help you get the maximum output each day, but the road to finding your arsenal of the best software development tools is not easy. That’s why we researched for you!

CVE-2021-31440: Kubernetes container escape using eBPF

In a recent post by ZDI, researchers found an out-of-bounds access flaw (CVE-2021-31440) in the Linux kernel’s (5.11.15) implementation of the eBPF code verifier: an incorrect register bounds calculation occurs while checking unsigned 32-bit instructions in an eBPF program. The flaw can be leveraged to escalate privileges and execute arbitrary code in the context of the kernel.

Can They Really See That Data? Auditing Access Controls Across Environments

"Who has access to sensitive financial data in our organization?" Assuming you have a definition of what "sensitive financial data" means for your organization, can you easily answer this question? If you needed to perform an audit to verify only the appropriate people have access, what actions would you take?

The Future Demands Full Stack DevOps Engineers at the Epicenter

As we wrap up swampUP 2021, I have never felt more excited about being part of the global DevOps community. My greatest takeaway from swampUP 2021, with all of its great presentations and the participation of thousands of our community peers, is that DevOps today finds itself at a historical inflection point. Let me explain.

Understanding Serverless Observability

Ideally, observability should help you understand the state of your application and how it performs under different circumstances. However, while serverless observability may seem similar to serverless monitoring and testing, the three achieve different goals. Testing helps you check your application for known issues, and monitoring helps you evaluate system health according to known metrics. Observability helps you search and discover unknown issues, providing end-to-end visibility.

What Is Cloud Optimization? (And Why Is It Important?)

Whether you've recently migrated to the cloud, or you've been operating there for awhile now, you've likely realized how challenging it can be to understand exactly where your cloud spend goes and what drives your costs. Using a cloud service provider can offer many benefits, like the ability to easily scale up and down based on need, but unless you're using the right services and tools, your costs can quickly spiral out of control.

AIOps as a modern cockpit, and why that matters

Our human capacity for ingesting information and acting on it, is constant. As the systems we operate grow more complex, we need to make sure we use technology that presents us with only the relevant information we need, exactly when we need it. In aviation, this lesson was learned long ago, and now IT Ops is catching up.