Operations | Monitoring | ITSM | DevOps | Cloud

Maximising the potential of business intelligence with data warehouse automation

Data is a business's most valuable asset. When given the ability to harness its power, organisations can transform their data into meaningful business intelligence (BI)-and act on this to stay ahead in an ever-more competitive landscape. This is where the enterprise data warehouse (EDW) comes in. An EDW centralises and consolidates data from many sources-storing it in a denormalised structure called a star schema-making it easy to analyze, visualise, and forecast essential business metrics.

Introduction to Apache Arrow

A look at what Apache Arrow is, how it works, and some of the companies using it as a critical component in their architecture. Over the past few decades, leveraging big datasets required businesses to perform increasingly complex analysis. Advancements in query performance, analytics, and data storage are largely a result of greater access to memory. Demand, manufacturing process improvements, and technological advances all contributed to cheaper memory.

How Apache Arrow is Changing the Big Data Ecosystem

This article was originally published in The New Stack and is reposted here with permission. Arrow makes analytics workloads more efficient for modern CPU and GPU hardware, which makes working with large data sets easier and less costly. One of the biggest challenges of working with big data is the performance overhead involved with moving data between different tools and systems as part of your data processing pipeline.

DataOps vs. DevOps - Similar or Completely Different?

Over the past several years, there has been a big focus on streamlining infrastructure operations. Many organizations have begun using DevOps principles in their software development and operations activities. We've also seen a new Ops in town recently: DataOps. Good things come from new methods to help teams collaborate across an organization.

What Is a Column Database and When Should You Use One?

If you are working with large amounts of data that will primarily be used for analytics, a column database might be a good option. There are a lot of different options when it comes to choosing a database for your application. A common discussion seems to be the high-level SQL vs. NoSQL database argument of whether data should be stored in a relational database or in a NoSQL alternative like key-value, document or graph databases.

An Introduction to Apache Superset: An Open Source BI solution

With native SQL support coming to InfluxDB, we can broaden the scope of developer tools used to analyze and visualize our time series data. One of these tools is Apache Superset. So let’s break down the basics of what Superset is, look at its features and benefits, and run a quick demo of Superset in action.