Best of Big DataJuly 2025

  1. 1
    Article
    Avatar of hnHacker News·46w

    kepler.gl

    Kepler.gl is a WebGL-powered geospatial data visualization tool designed for analyzing and visualizing large-scale datasets in web browsers. Built with high-performance rendering capabilities, it enables interactive exploration of geographic data. Foursquare Studio extends kepler.gl's framework as a free analytics platform with regular feature updates.

  2. 2
    Article
    Avatar of duckdbDuckDB·46w

    DuckLake 0.2

    DuckLake 0.2 introduces significant improvements including secrets management for credentials, enhanced Parquet file settings, relative schema/table paths for better organization, name mapping for existing Parquet files, scoped settings at schema and table levels, and partition transforms. The update includes automatic migration from v0.1 and adds new functions like ducklake_list_files for better system integration.

  3. 3
    Article
    Avatar of salesforceengSalesforce Engineering·47w

    Architecting AI Agent Auditing Systems in Agentforce

    Salesforce's Feedback and Audit Trail team built an AI auditing system for Agentforce that handles 20 million model interactions monthly across 500 enterprise customers. The system overcame significant integration challenges with Data Cloud by using Kafka-based ingestion to manage unpredictable AI traffic patterns. Key technical solutions included dynamic flow control mechanisms, Tiger Team coordination across 8-10 cross-functional teams, and iterative development approaches. The architecture prioritizes trust, security, and compliance while maintaining scalability through continuous performance monitoring and architectural improvements.