An Introduction to Apache Parquet
A look at what Parquet is, how it works and some of the companies using its optimization techniques as a critical component in their architecture. As the amount of data being generated and stored for analysis grows at an increasing rate, developers are looking to optimize performance and reduce costs at every angle possible. At the petabyte scale, even marginal gains and optimizations can save companies millions of dollars in hardware costs when it comes to storing and processing their data.