Best of ELK2025

  1. 1
    Article
    Avatar of bigdataboutiqueBigData Boutique blog·1y

    Elasticsearch vs OpenSearch - 2025 update

    An in-depth 2025 update comparing Elasticsearch and OpenSearch, touching on project status, performance, licensing, vector search capabilities, cost efficiency, and ecosystem solutions. OpenSearch has gained traction with open-source governance and additional vector search engines, while Elasticsearch maintains proprietary features and extensive integration solutions.

  2. 2
    Article
    Avatar of roadmaproadmap.sh·25w

    NEW ROADMAP: Elasticsearch

    A new learning roadmap for Elasticsearch has been released on roadmap.sh, providing a structured guide for developers working with search and database applications to learn and master Elasticsearch.

  3. 3
    Video
    Avatar of fireshipFireship·1y

    Is Elon’s Grok 3 the new AI king?

    Elon's new AI model, Grok 3, has overtaken benchmarks and reached the number one spot on the LM Marina leaderboard, outperforming other leading models in various tasks. Unique for its access to Twitter data, Grok 3 generates uncensored content and plans to release a subscription-based Super Grok soon. The model was trained on the world’s largest AI supercomputer in Memphis, Tennessee.

  4. 4
    Article
    Avatar of lobstersLobsters·1y

    How Discord Indexes Trillions of Messages

    Discord scaled its search infrastructure to efficiently index trillions of messages using Elasticsearch and Kubernetes. They faced challenges like message drops, fault-intolerant bulk indexing, and performance issues due to large clusters. Solutions included deploying Elasticsearch on Kubernetes, adopting a multi-cluster architecture with dedicated nodes, and using PubSub for message queuing. Key achievements include improved indexing throughput, reduced query latency, and seamless cluster upgrades.

  5. 5
    Article
    Avatar of zalandoZalando·23w

    The Day Our Own Queries DoS’ed Us: Inside Zalando Search

    Zalando's Search & Browse team experienced a self-inflicted DoS attack when an internal application sent resource-intensive faceting queries on high-cardinality fields to their Elasticsearch cluster. The incident caused search slowdowns and empty results for customers. The team mitigated by splitting markets across clusters, implementing load shedding, and eventually traced the issue to a maintenance workload bug generating 50x normal query volume. Key lessons included improving per-client monitoring with X-Opaque-Id headers, implementing query-level rate limiting, adding aggregation size controls, and recognizing that performance issues can stem from unexpected sources rather than common causes.

  6. 6
    Article
    Avatar of itsfossIt's Foss·22w

    Watch Out Elasticsearch! Tiger Data's PostgreSQL BM25 Search Extension Goes Open Source

    Tiger Data has open-sourced pg_textsearch, a PostgreSQL extension that enables BM25 relevance-ranked keyword searches directly within PostgreSQL. Previously available only on Tiger Cloud, the extension is now released under The PostgreSQL License on GitHub. It supports 29+ languages, works with partitioned tables, and uses a memtable architecture for efficient indexing. The extension allows developers to run modern search capabilities without setting up external systems like Elasticsearch, and can be combined with pgvector for hybrid keyword and semantic search within a single database.

  7. 7
    Article
    Avatar of elasticelastic·25w

    Elastic Stack 9.1.8 released

    Elastic Stack version 9.1.8 has been released with bug fixes and updates. The release is recommended as an upgrade over version 9.1.7, with full details available in the official release notes.

  8. 8
    Article
    Avatar of elasticelastic·28w

    Elastic Stack 9.2.1 released

    Elastic Stack version 9.2.1 has been released with bug fixes and updates. The team recommends upgrading from previous versions, particularly 9.2.0, to this latest release. Full details of fixes and changes are available in the official release notes.

  9. 9
    Article
    Avatar of communityCommunity Picks·1y

    Pulse for Elasticsearch and OpenSearch - Search Cluster Monitoring & Maintenance, Redefined

    Pulse enhances the management of Elasticsearch and OpenSearch clusters by offering detailed cluster health assessments, actionable maintenance insights, and automated root cause analysis to optimize cost and performance.

  10. 10
    Article
    Avatar of bigdataboutiqueBigData Boutique blog·1y

    Elasticsearch ELSER vs External Vector Embeddings

    Elasticsearch's ELSER is a native embedding model for hybrid search, offering memory efficiency and simple integration. However, external models like OpenAI and Cohere provide richer semantic understanding, complex relationship handling, and multilingual support. The choice between ELSER and external embeddings depends on specific application needs, including database commitment, language constraints, and query complexity. ELSER is cost-effective and easy to set up, whereas external models might involve extra costs and complexity due to additional computational requirements.

  11. 11
    Article
    Avatar of neontechNeon·49w

    Comparing Native Postgres, ElasticSearch, and pg_search for Full-Text Search

    PostgreSQL's native full-text search works for small datasets but struggles with performance and search quality at scale. ElasticSearch provides superior search capabilities but introduces operational complexity with dual data stores and sync challenges. pg_search offers a middle ground by embedding a modern search engine (powered by Tantivy) directly into PostgreSQL, delivering BM25 ranking, fuzzy matching, and real-time indexing without the operational overhead of managing separate systems.

  12. 12
    Article
    Avatar of bigdataboutiqueBigData Boutique blog·1y

    OpenSearch & Elasticsearch Scroll and Deep Paging Methods Compared

    Deep paging in Elasticsearch or OpenSearch can be resource-intensive. Various methods are available for deep paging, including the `from` and `size` parameters, Scroll API, PIT API, and `search_after`. Each method has its pros and cons, with `search_after` being the best for real-time pagination and the Scroll API suitable for bulk data retrieval. Choosing the right method based on the use case is crucial for maintaining cluster performance.

  13. 13
    Article
    Avatar of lnLaravel News·52w

    ElasticLens: Eloquent-Powered Elasticsearch for Laravel

    ElasticLens is a Laravel package that seamlessly integrates Elasticsearch with Eloquent models, providing advanced search capabilities while maintaining familiar Laravel syntax. The package introduces Index-Model architecture that automatically synchronizes with your database, supports complex queries including full-text search, geolocation filtering, and field boosting. It offers zero-configuration setup, custom field mapping, index migrations, and automatic model synchronization through observers.

  14. 14
    Article
    Avatar of notedNoted·1y

    Diskover: Search Files and Analyze Their Storage Usage

    Diskover helps users efficiently analyze file sizes and search data on their servers. Utilizing Elasticsearch, it offers rapid indexing and data retrieval. The Community Edition enables crawling and indexing data, sorting files by various attributes, and searching within one index. Installation is facilitated by Docker Compose. Enhanced features are behind a paywall. Basic functionalities include size and date searches, with more advanced analytics requiring paid versions.

  15. 15
    Article
    Avatar of notedNoted·1y

    Index and Search Every File on Your Homelab Server using Sist2

    Sist2 is a high-performance file indexing and searching solution that uses C and VueJS technologies, along with Elasticsearch for efficient content retrieval. It supports features such as scanning and scheduling, multi-platform support, OCR integration, and more. This post provides a guide on how to set up Sist2 using Docker, index files, and customize settings to optimize your search experience on a homelab server.