Cross-encoder reranker trained on the MS MARCO passage retrieval dataset, designed to score query-document pairs jointly rather than encoding them independently. Distilled from a 12-layer cross-encoder into 6 layers to reduce latency while retaining re-ranking accuracy. Used as a second-stage ranker on top of fast first-stage retrieval (BM25 or bi-encoder).
80,423,371 ↓ · 269 ♡
A 4-layer MiniLM cross-encoder fine-tuned on MS MARCO passage ranking for fast query-document relevance scoring. Offers the lowest latency in the MS MARCO cross-encoder series at the cost of some ranking accuracy.
3,809,022 ↓ · 15 ♡
The 12-layer variant of MiniLM fine-tuned on MS MARCO, providing the highest accuracy in the MiniLM cross-encoder family at the cost of latency. A strong default choice for reranking when compute allows.
2,508,734 ↓ · 106 ♡
gte-reranker-modernbert-base is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
2,210,907 ↓ · 95 ♡
Qwen3-Reranker-0.6B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
2,039,531 ↓ · 368 ♡
jina-reranker-v2-base-multilingual is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,790,615 ↓ · 352 ♡
Qwen3-Reranker-4B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
1,734,504 ↓ · 145 ♡
mmarco-mMiniLMv2-L12-H384-v1 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,458,314 ↓ · 75 ♡
ms-marco-MiniLM-L2-v2 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,251,918 ↓ · 14 ♡
Qwen3-Reranker-8B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
1,092,486 ↓ · 244 ♡
jina-reranker-v3 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
970,944 ↓ · 140 ♡
Qwen3-VL-Reranker-8B is Qwen's multimodal cross-encoder reranker built on the Qwen3-VL-8B architecture, capable of reranking both text-only and text+image query-document pairs. It is trained with contrastive reranking objectives to score relevance between a query and a set of retrieved passages or image-text chunks, making it the first VLM-based reranker in the MTEB ecosystem.
646,940 ↓ · 153 ♡
mxbai-rerank-xsmall-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
554,878 ↓ · 57 ♡
mxbai-rerank-base-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
515,428 ↓ · 46 ♡
rank1-7b is a 7B-parameter reranker fine-tuned from Qwen2.5-7B by Johns Hopkins CLSP, designed to score candidate passages for retrieval-augmented generation and information retrieval pipelines. It is described in arxiv:2502.18418 and trained on the jhu-clsp/rank1-training-data dataset. The MIT license and Text Embeddings Inference compatibility make it straightforward to deploy in production retrieval stacks.
468,220 ↓ · 4 ♡
japanese-reranker-cross-encoder-small-v1 is an openly licensed reranking and retrieval scoring model in the sentence transformers family. japanese-reranker-cross-encoder-small-v1 is MIT-licensed, clearing it for closed-source and paid products. Like most open checkpoints, japanese-reranker-cross-encoder-small-v1 rewards a quick in-domain eval before commitment.
367,802 ↓ · 5 ♡
llama-nemotron-rerank-1b-v2 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
357,535 ↓ · 52 ♡
crossencoder-camembert-base-mmarcoFR is a sentence transformers-based open-weight model aimed at reranking and retrieval scoring. Permissive MIT terms let crossencoder-camembert-base-mmarcoFR go straight into commercial pipelines. The weights start from camembert-base and specialize it for the target task. Read crossencoder-camembert-base-mmarcoFR's card for hardware requirements and licensing fine print before deploying.
343,937 ↓ · 7 ♡
As a sentence transformers-based open-weight model, stsb-roberta-large focuses on reranking and retrieval scoring. The Apache 2.0 license keeps stsb-roberta-large unrestricted for commercial reuse. Before relying on stsb-roberta-large, reproduce its key numbers on representative inputs.
341,196 ↓ · 14 ♡
Qwen3-VL-Reranker-2B scores query-document pairs for relevance. Used as a cross-encoder reranker, it jointly encodes the pair and outputs a single relevance score.
318,777 ↓ · 194 ♡
mxbai-rerank-large-v2 is Mixedbread AI's cross-encoder reranker built on Qwen2, covering 100+ languages. Rerankers score query-document pairs for relevance, improving retrieval quality when used after a fast bi-encoder retrieval step. Apache-2.0 licensed with text-embeddings-inference compatibility.
314,720 ↓ · 140 ♡
A sequence-classification-format wrapper of the Qwen3-Reranker-0.6B by Tom Aarsen (sentence-transformers maintainer), making it compatible with the CrossEncoder class without generative decoding. This format is faster for reranking than the generative format.
306,747 ↓ · 30 ♡
A 2-layer TinyBERT cross-encoder fine-tuned on MS MARCO passage ranking by the sentence-transformers team. Extremely small and fast, it trades accuracy for speed in reranking pipelines where a larger cross-encoder would be the throughput bottleneck.
300,817 ↓ · 44 ♡
Built for reranking and retrieval scoring, mxbai-rerank-large-v1 is a sentence transformers-based model with publicly available weights. mxbai-rerank-large-v1 is Apache 2.0-licensed, clearing it for closed-source and paid products. Read mxbai-rerank-large-v1's card for hardware requirements and licensing fine print before deploying.
299,638 ↓ · 142 ♡