Minimaxir's blog is a hub for machine learning enthusiasts, offering tutorials, project showcases, and insights into the latest trends in AI and data science. With a focus on practical applications of machine learning, Minimaxir shares tips, tools, and resources for building and deploying ML models. Developers can learn about deep learning frameworks, natural language processing techniques, and AI-powered creativity, gaining  skills to tackle real-world problems.

Max Woolf's Blog

Text embeddings, representing words and documents, are highly useful in various applications. While vector databases like faiss or Pinecone are typically used for handling embeddings, simpler methods involving numpy and formats like Parquet files can be more efficient for smaller projects. Parquet files, in combination with polars, offer a powerful alternative for storing embeddings and performing similarity searches with added metadata flexibility.

The Best Way to Use Text Embeddings Portably is With Parquet and Polars

The Intended-But-Not-Great Way to Store Embeddings #

How do you use Parquet files in Python for embeddings? #