upvote
The ‘why’ is referenced in the bibliography at the end of the readme; this repo is not meant to be consumed standalone. Start with the paper instead:

https://doi.org/10.1145/3749163

reply
I also had no idea what they were talking about, but there's good points about how hardware oblivious and somewhat global is Parquet around metadata.

I found this post interesting,

- https://medium.com/@reliabledataengineering/f3-the-future-pr...

reply
Yeah it seems like most of this can be handled by some more dev hours to Parquet
reply
Paper mentions Parquet, ORC, Nimble, Lance, TSFile, Bullion, and BtrBlocks.
reply