Postgres to Iceberg in 13 minutes: How Supermetal compares to Flink, Kafka Connect, and Spark

A sponsored benchmark comparing Supermetal's new Iceberg sink against Apache Flink, Kafka Connect (Debezium), and Apache Spark for CDC-based Postgres-to-Iceberg pipelines. Using TPC-H SF=50 data on identical single-node AWS infrastructure, Supermetal completed snapshotting in 13 minutes with no tuning, while Flink took 90–116 minutes, Kafka Connect 120 minutes, and Spark over 3 hours. The key differentiators are Supermetal's fast CDC source, low serialization overhead, and its unique ability to switch Iceberg sink behavior (append-only with target file size vs. merge-on-read with time-based flush) between snapshot and live CDC phases. Flink required aggressive fetch/split size tuning; Kafka Connect needed careful batch tuning; Spark struggled on single-node due to its scale-out architecture. All tools produced correct data with matching row counts.

#postgresql

#apache-kafka

#apache-flink

#apache-iceberg

#change-data-capture

Apr 15•14m read time•From thenewstack.io

Table of contents

Test setup Supermetal Flink Kafka Connect Spark Data correctness Summary

Comment

Bookmark

Copy

Sort: