Operations | Monitoring | ITSM | DevOps | Cloud

Scaling Prometheus: How we're pushing Cortex blocks storage to its limit and beyond

In a recent blog post, I wrote about the work we’ve done over the past year on Cortex blocks storage. Cortex is a long-term distributed storage for Prometheus. It provides horizontal scalability, high availability, multi-tenancy and blazing fast query performances when querying high cardinality series or large time ranges.

21 new ways we're improving observability with Cloud Ops

We’ve heard from customers about how important it is to be able to reliably operate your applications and infrastructure running on Google Cloud. In particular, observability is critical to reliable operations. To help you quickly gain insight into your Google Cloud environment, we’ve added 21 new features to Cloud Operations, the observability suite we launched earlier this year, which gives you access to all our operations capabilities directly from the Google Cloud Console.

Logstash CSV: Import & Parse Your Data [Hands-on Examples]

The CSV file format is widely used across the business and engineering world as a common file for data exchange. The basic concepts of it are fairly simple, but unlike JSON which is more standardized, you’re likely to encounter various flavors of CSV data. This lesson will prepare you to understand how to import and parse CSV using Logstash before being indexed into Elasticsearch.

How many 9's are enough? Kolton Andrus  CTO Connection: Reducing engineering cycle time

How many nines of availability are enough? In this talk, Gremlin CEO Kolton Andrus shares insights from years at Amazon, Netflix, and now working with a wide array of customers across various disciplines and industries. He’ll describe what each level of availability looks like, the challenges faced at each stage, and the trade-offs required to achieve the next nine of uptime.

InfluxDB Community Office Hours - August 2020

InfluxDB Community Office Hours are one-hour, monthly online sessions, hosted by Influxers to answer your questions about any topic related to InfluxDB or time series. We host this monthly live webinar so that users can directly ask a panel of Influxers questions and talk in real time. We record these sessions and post them on YouTube. InfluxDB Community Office Hours are part of our commitment to open source, developer happiness, and time to awesome. In our August 2020 session, Tim Hall discusses InfluxDB OSS 2.0 and the path to upgrading.

Effortless Load Testing | Simon Aronsson (Load Impact / k6)

Load testing and performance monitoring used to be really hard and bothersome. Not any more! With modern code-first tools and visualisation, being on top of your service scalability and performance is no longer something that's reserved for the QA department. According to research by Google, 53% of mobile website visitors will leave if the page load duration exceeds three seconds. Armed with this knowledge, We'll go through how to implement load tests and performance monitoring around it as well as how to efficiently visualize it.