Best of MLOpsOctober 2025

  1. 1
    Article
    Avatar of javarevisitedJavarevisited·29w

    I’ve Read 20+ Books on AI and LLM — Here Are My Top 5 Recommendations for 2026

    A curated list of five essential books for learning AI and LLM engineering, covering practical topics from building and fine-tuning models to production deployment. The recommendations include hands-on guides for prompt optimization, retrieval-augmented generation, model evaluation, infrastructure design, and understanding transformer architectures from scratch. Each book emphasizes production-ready engineering practices including monitoring, cost optimization, and system design rather than pure theory.

  2. 2
    Article
    Avatar of huggingfaceHugging Face·29w

    huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

    The huggingface_hub Python library has reached v1.0 after five years of development, now powering 200,000 dependent libraries and providing access to over 2 million models, 500,000 datasets, and 1 million Spaces. Major changes include migration from requests to httpx for modern HTTP infrastructure, a redesigned CLI replacing huggingface-cli with expanded features, and full adoption of hf_xet for file transfers with chunk-level deduplication. The release removes legacy patterns like the Git-based Repository class while maintaining backward compatibility for most ML libraries, though transformers v5 will be required for full v1.x support.

  3. 3
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·32w

    AI Agent Deployment Strategies

    Four deployment patterns for AI agents are explored: batch deployment for scheduled bulk processing with high throughput, stream deployment for continuous real-time data pipeline processing, real-time deployment via APIs for instant user interactions, and edge deployment on user devices for privacy and offline functionality. Each pattern serves different performance requirements, with batch optimizing throughput, stream enabling continuous monitoring, real-time providing sub-second responses, and edge ensuring data privacy without server dependencies.