Hardwood is a new open-source Apache Parquet parser for Java 21+, built from scratch to address the heavy Hadoop dependency and single-threaded limitations of parquet-java. It offers two APIs: a row-oriented RowReader for complex nested schemas and a columnar ColumnReader for peak throughput via typed primitive arrays.

8m read timeFrom morling.dev
Post cover image

Sort: