Best of LinkedInNovember 2024

  1. 1
    Article
    Avatar of detlifeData Engineer Things·2y

    DataHub: The Metadata Platform Developed at LinkedIn

    LinkedIn developed DataHub, an open-sourced metadata catalog platform that evolved through three generations. Initially, LinkedIn used a monolithic application which later split into a metadata service with an API. The third generation emphasized a stream-based, real-time architecture and decentralization, enabling efficient, trustworthy metadata handling. DataHub supports various APIs, real-time metadata changes, and federated metadata services, making it reliable and suitable for large enterprises.

  2. 2
    Article
    Avatar of collectionsCollections·1y

    How LinkedIn Scaled Their System to 5 Million Queries Per Second

    LinkedIn scaled their Restrictions and Enforcement System to handle 5 million queries per second by using advanced techniques such as BitSets, Bloom Filters, and full refresh-ahead caching strategies. The architecture includes components like the Venice Database and Kafka for real-time data streaming, ensuring high availability, low latency, and efficient memory usage.