Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Cortex v1.0 released: The highly scalable, fast Prometheus implementation is generally available for production use

We’re happy to announce that Cortex v1.0 has been released! The horizontally scalable, durable, and fast Prometheus implementation is now generally available for production use. At Grafana Labs, we’ve been using Cortex in production for almost three years, including to power the Prometheus backend for the Grafana Cloud managed logging and metrics platform.

Best Practices for Monitoring Kubernetes using Grafana

Microservices and containers have taken the tech industry by storm. Kubernetes is one of the tools that has evolved to manage these new aspects of software development. It is an open-source system for automating deployment, scaling, and management of containerized applications. One of the biggest challenges that organizations face when adopting Kubernetes is performing monitoring tasks in this dynamic environment.

Best practices to ensure data security while working remotely

Coronavirus has disrupted daily life for so many around the world in a shockingly short span of time. Lifestyles have shifted. A new normal, albeit a panic-stricken one, has set in. One-third of the global population is under lockdown to slow the spread of coronavirus. Many organizations have adopted temporary work-from-home measures to keep themselves up and running.

Ecommerce Security - NutriBullet & Tupperware Suffer Magecart Attacks

The COVID-19 virus epidemic has seen a 23% rise in visitors to UK independent ecommerce sites and similarly, on a global scale, many companies have transitioned to fully ecommerce-based business practice and are seeing an increase in online shoppers. Additionally, employees are either remote working, self-isolating or ill. This pivot in business continuity means websites are increasingly vulnerable to being attacked.

How SRE's can Embrace Resilience During Crises

Blameless recently had the privilege of hosting SRE leaders Liz Fong-Jones, Dave Rensin, and Alex Hidalgo to discuss how SREs can embrace resilience during pandemic, and how the principles of SRE intersect with global trends. The transcript below has been lightly edited, and if you’re interested in watching the full panel, you can do so here.

Fleet Management for Kubernetes is Here

Today I’m excited to announce Fleet, a new open source project from the team at Rancher focused on managing fleets of Kubernetes clusters. Ever since Rancher 1.0 shipped in 2016, Rancher has provided a central control plane for managing multiple clusters. As pioneers of Kubernetes multi-cluster management, we have seen firsthand how users have consistently increased the number of clusters under management.

Announcing Hosted Rancher with Rancher 2.4

We’ve heard from many of our customers and prospects that they love Rancher but just don’t have the staff and expertise to operate the platform. Figuring out the compute, storage and networking architecture can be a challenge. Performing upgrades, backups and troubleshooting can also be time consuming. Monitoring the environment and knowing when to scale up or down, horizontally or vertically, is yet another thing to worry about.

Want to be able to work even faster and smarter? Download Discovery 1.5, live today!

Our ServiceNow Discovery Connector extends your ServiceNow discovery sources into SCOM, allowing you to leverage the rich management pack discoveries and wide SCOM agent deployment to populate your CMDB with everything from servers and devices to databases, clusters, and services. All with no extra agents, so you are up and running in minutes not months! Our latest version also brings with it some new features.

We've just launched Alert Sync 1.5 and it's even more functionally fantastic than before!

So, what shiny new functionality have we added for you to enjoy? As well as all the great stuff Alert Sync did before, you can now benefit from even more features: Wait Rules allow an incoming SCOM alert to be held for a specified period of time; before being evaluated against Incident Creation rules. This is really useful for those incidences when a SCOM alert might open and close itself in quick succession (like a CPU usage threshold monitor).