Introducing Apache Arrow Support in mssql-python

mssql-python now supports fetching SQL Server data directly as Apache Arrow structures, enabling zero-copy, columnar data transfer to Polars, Pandas, DuckDB, and other Arrow-native libraries. Three new Cursor APIs are introduced: cursor.arrow_batch() for incremental batch fetching, cursor.arrow() for eager full result set retrieval as a pyarrow.Table, and cursor.arrow_reader() for lazy streaming via a RecordBatchReader. The Arrow path avoids Python object creation per row, reducing GC pressure and memory usage. Temporal types like DATETIME and DATETIMEOFFSET show the largest performance gains since timezone normalization runs entirely in C++. A known limitation exists for NVARCHAR on Linux due to a slower UTF-16 to UTF-8 conversion path, with a fix planned. The feature was contributed by community developer Felix Graßl and is purely additive — existing fetchone/fetchmany/fetchall code is unaffected.

#python

#microsoft-sql-server

#polars

May 04•7m read time•From devblogs.microsoft.com

Table of contents

What Is Apache Arrow? Copy link The Arrow Fetch APIs Copy link Testing Copy link Getting Started Copy link What’s Next Copy link A Note of Thanks Copy link Resources Copy link

Comment

Bookmark

Copy

Sort: