pg_infer 1.0.0 released -- transformer model knowledge as SQL relations

pg_infer 1.0.0 is a new PostgreSQL 18+ extension that embeds small transformer language model internals — gate activations, feature labels, learned associations, and embeddings — directly into PostgreSQL as SQL-queryable relations and a custom index access method. Unlike pgvector (which stores user-supplied embeddings) or RAG pipelines (which call external services), pg_infer stores the model itself in WAL-logged 8KB pages and exposes it as a first-class planner operator. The `<~>` operator is index-backed and composes with WHERE, JOIN, aggregation, and partitioning. It targets CPU-only hardware using BitNet b1.58 ternary-weight transformers and OpenBLAS, making inference viable on existing PostgreSQL replica hosts without GPUs. Functions like `describe(entity)`, `walk(prompt)`, and `implies(a, b)` expose the model's learned knowledge directly in SQL. The project builds on the LARQL project's vindex format and gate-KNN algorithm.

#machine-learning

#deep-learning

#postgresql

#vector-search

May 22•5m read time•From postgresql.org

Table of contents

Quick example What pg_infer does that other extensions do not CPU inference, BitNet, and idle-cluster compute A few queries that are uniquely pg_infer Acknowledgements A note on stability and feedback Links

Comment

Bookmark

Copy

Sort: