Rerankers Aren’t Magic Either: When the Cross-Encoder Layer Is Worth the Cost

An empirical comparison of four embedding models (GloVe through text-embedding-3-large) and three cross-encoder rerankers (bge-base, bge-large, ms-marco-MiniLM) across five challenging RAG query shapes reveals that the expected cost-performance gradient mostly doesn't hold. On four of five test cases, rerankers either match or underperform strong embeddings. Only signal dilution in long context is a clear reranker win. Negation, out-of-domain vocabulary, listing queries, and exact identifiers at scale remain broken regardless of scorer. The article argues that upstream architectural choices — question parsing, classify-before-retrieve, and expert keyword dictionaries — deliver more value per dollar than stacking a reranker on weak retrieval.

#rag

#embeddings

Yesterday•20m read time•From towardsdatascience.com

Table of contents

1. What a reranker actually is 2. The cost-perf gradient, tested on the same cases 3. Where the cross-encoder still breaks 4. Where rerankers actually justify their cost 5. Conclusion 6. Further reading

Comment

Bookmark

Copy

Sort: