Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Trusted SBOMs Delivered with the JFrog Platform and AWS

In this webinar, you’ll learn what an SBOM is, how it will benefit you, the misconceptions that exist around it and why it must be a key element of your software development life cycle's (SDLC) security and compliance. We’d also like to invite you to register for a joint JFrog-AWS webinar, where we’ll do a deep dive on SBOMs and share insights and best practices on SBOM creation and usage.

How to use AWS IoT SiteWise Edge and Grafana to collect and monitor industrial data on-premises

The AWS IoT SiteWise plugin for Grafana was created to enable AWS IoT SiteWise customers to visualize and monitor industrial equipment data using Grafana dashboards. Industrial customers use AWS IoT SiteWise to collect, process, and monitor their industrial data at scale. This plugin allows them to use Grafana dashboards to monitor this data, stored by AWS IoT SiteWise in the AWS Cloud.

Elastic and HashiCorp partner to bring infrastructure-as-code to Elastic Cloud

Operations and SRE teams often rely on HashiCorp Terraform to safely manage production-related infrastructure using methodologies such as infrastructure as code, which allows you to apply peer-reviewed infrastructure changes in an automated and controlled fashion.

Announcing support for EKS Anywhere

Amazon Elastic Kubernetes Service (EKS) is a cloud-based compute platform that includes a fully managed Kubernetes control plane in order to simplify cluster operations. AWS introduced EKS Anywhere to bring the operational ease of EKS to organizations that manage on-premise environments (e.g., to meet data sovereignty requirements).

Maintaining reliable services with advanced Cloud Logging features

We’ve covered ingesting, routing, storing, and viewing logs from your services in Cloud Logging already, but what else can you do with all that data? In this episode of Engineering for Reliability, we show how you can use advanced features like alerting on logs, logs-based metrics, and capturing application exceptions in Error Reporting. Watch to learn how you can find issues faster, make your services more reliable, and keep your users happy.

Creating problem-solving partnerships through a policy of open innovation

The world is full of problems. Any company trying to make a name for itself in the world is going to run right smack into those problems. But the world is also full of solutions. To better find and profit from those solutions, companies are increasingly embracing open innovation, an approach to solving problems in creative and unexpected ways by collaborating with customers, partners, and employees.

Why Migrate to Cloud Now?

Cloud has become the go-to location for businesses to store data and build infrastructure. Many organizations have shifted their applications to cloud platforms, and many of those businesses that have their data on-premise ecosystem today are soon planning to migrate to the cloud. Studies reveal that the main drivers for cloud migration are security, cost-efficiency and modernization capabilities. But is not limited to this, a lot more is yet to realize. In respect of the current situation, the transition from legacy software and cloud is a strategic step. It is becoming a must-have step for business continuity for most companies.

Cloud or On-Prem? With Monitoring, It's Both-And, Not Either-Or

Despite the migration of services and systems to cloud (either all or in part), many of the fundamental aspects of the day-to-day work IT practitioners do hasn’t changed. It’s just moved. In this session, SolarWinds Head Geek Leon Adato and Technical Content Manager for Community Kevin M. Sparenberg discuss that state of affairs, as well as what monitoring can do to help view those resources as a contiguous whole, despite possibly being split across the on-prem/cloud divide.

How Lowe's SRE reduced its mean time to recovery (MTTR) by over 80 percent

The stakes of managing Lowes.com have never been higher, and that means spotting, troubleshooting and recovering from incidents as quickly as possible, so that customers can continue to do business on our site. To do that, it’s crucial to have solid incident engineering practices in place. Resolving an incident means mitigating the impact and/or restoring the service to its previous condition.

Top 8 uses of cloud computing

The cloud is gaining widespread adoption. For many organizations, cloud computing has become an indispensable tool for communication and collaboration across distributed teams. Whether you are on Amazon Web Services (AWS), Google Cloud, or Azure. the cloud can reduce costs, increase flexibility, and optimize resources. If you have spent your career in buzzing server rooms full of cable nests, you may be wondering what all the fuss is about.