A developer building a custom dataframe library explores the theoretical foundations of dataframe operations. Starting from Petersohn et al.'s dataframe algebra (15 operators covering 200+ pandas methods), the author discovers that the relational core maps onto exactly three categorical patterns: Delta

16m read timeFrom mchav.github.io
Post cover image
Table of contents
Petersohn’s dataframe algebraThree shapes of schema changeWhy these threeDesigning an API around the three patternsWhere this is going

Sort: