text ranking models

24 models · ranked by HuggingFace downloads

ms-marco-MiniLM-L6-v2

Cross-encoder reranker trained on the MS MARCO passage retrieval dataset, designed to score query-document pairs jointly rather than encoding them independently. Distilled from a 12-layer cross-encoder into 6 layers to reduce latency while retaining re-ranking accuracy. Used as a second-stage ranker on top of fast first-stage retrieval (BM25 or bi-encoder).

80,423,371 ↓ · 269 ♡

ms-marco-MiniLM-L4-v2

A 4-layer MiniLM cross-encoder fine-tuned on MS MARCO passage ranking for fast query-document relevance scoring. Offers the lowest latency in the MS MARCO cross-encoder series at the cost of some ranking accuracy.

3,809,022 ↓ · 15 ♡

ms-marco-MiniLM-L12-v2

The 12-layer variant of MiniLM fine-tuned on MS MARCO, providing the highest accuracy in the MiniLM cross-encoder family at the cost of latency. A strong default choice for reranking when compute allows.

2,508,734 ↓ · 106 ♡

gte-reranker-modernbert-base

gte-reranker-modernbert-base is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

2,210,907 ↓ · 95 ♡

Qwen3-Reranker-0.6B

Qwen3-Reranker-0.6B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

2,039,531 ↓ · 368 ♡

jina-reranker-v2-base-multilingual

jina-reranker-v2-base-multilingual is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,790,615 ↓ · 352 ♡

Qwen3-Reranker-4B

Qwen3-Reranker-4B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

1,734,504 ↓ · 145 ♡

mmarco-mMiniLMv2-L12-H384-v1

mmarco-mMiniLMv2-L12-H384-v1 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,458,314 ↓ · 75 ♡

ms-marco-MiniLM-L2-v2

ms-marco-MiniLM-L2-v2 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,251,918 ↓ · 14 ♡

Qwen3-Reranker-8B

Qwen3-Reranker-8B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

1,092,486 ↓ · 244 ♡

jina-reranker-v3

jina-reranker-v3 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

970,944 ↓ · 140 ♡

Qwen3-VL-Reranker-8B

Qwen3-VL-Reranker-8B is Qwen's multimodal cross-encoder reranker built on the Qwen3-VL-8B architecture, capable of reranking both text-only and text+image query-document pairs. It is trained with contrastive reranking objectives to score relevance between a query and a set of retrieved passages or image-text chunks, making it the first VLM-based reranker in the MTEB ecosystem.

646,940 ↓ · 153 ♡

mxbai-rerank-xsmall-v1

mxbai-rerank-xsmall-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

554,878 ↓ · 57 ♡

mxbai-rerank-base-v1

mxbai-rerank-base-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

515,428 ↓ · 46 ♡

rank1-7b

rank1-7b is a 7B-parameter reranker fine-tuned from Qwen2.5-7B by Johns Hopkins CLSP, designed to score candidate passages for retrieval-augmented generation and information retrieval pipelines. It is described in arxiv:2502.18418 and trained on the jhu-clsp/rank1-training-data dataset. The MIT license and Text Embeddings Inference compatibility make it straightforward to deploy in production retrieval stacks.

468,220 ↓ · 4 ♡

japanese-reranker-cross-encoder-small-v1

japanese-reranker-cross-encoder-small-v1 is an openly licensed reranking and retrieval scoring model in the sentence transformers family. japanese-reranker-cross-encoder-small-v1 is MIT-licensed, clearing it for closed-source and paid products. Like most open checkpoints, japanese-reranker-cross-encoder-small-v1 rewards a quick in-domain eval before commitment.

367,802 ↓ · 5 ♡

llama-nemotron-rerank-1b-v2

llama-nemotron-rerank-1b-v2 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

357,535 ↓ · 52 ♡

crossencoder-camembert-base-mmarcoFR

crossencoder-camembert-base-mmarcoFR is a sentence transformers-based open-weight model aimed at reranking and retrieval scoring. Permissive MIT terms let crossencoder-camembert-base-mmarcoFR go straight into commercial pipelines. The weights start from camembert-base and specialize it for the target task. Read crossencoder-camembert-base-mmarcoFR's card for hardware requirements and licensing fine print before deploying.

343,937 ↓ · 7 ♡

stsb-roberta-large

As a sentence transformers-based open-weight model, stsb-roberta-large focuses on reranking and retrieval scoring. The Apache 2.0 license keeps stsb-roberta-large unrestricted for commercial reuse. Before relying on stsb-roberta-large, reproduce its key numbers on representative inputs.

341,196 ↓ · 14 ♡

Qwen3-VL-Reranker-2B

Qwen3-VL-Reranker-2B scores query-document pairs for relevance. Used as a cross-encoder reranker, it jointly encodes the pair and outputs a single relevance score.

318,777 ↓ · 194 ♡

mxbai-rerank-large-v2

mxbai-rerank-large-v2 is Mixedbread AI's cross-encoder reranker built on Qwen2, covering 100+ languages. Rerankers score query-document pairs for relevance, improving retrieval quality when used after a fast bi-encoder retrieval step. Apache-2.0 licensed with text-embeddings-inference compatibility.

314,720 ↓ · 140 ♡

Qwen3-Reranker-0.6B-seq-cls

A sequence-classification-format wrapper of the Qwen3-Reranker-0.6B by Tom Aarsen (sentence-transformers maintainer), making it compatible with the CrossEncoder class without generative decoding. This format is faster for reranking than the generative format.

306,747 ↓ · 30 ♡

ms-marco-TinyBERT-L2-v2

A 2-layer TinyBERT cross-encoder fine-tuned on MS MARCO passage ranking by the sentence-transformers team. Extremely small and fast, it trades accuracy for speed in reranking pipelines where a larger cross-encoder would be the throughput bottleneck.

300,817 ↓ · 44 ♡

mxbai-rerank-large-v1

Built for reranking and retrieval scoring, mxbai-rerank-large-v1 is a sentence transformers-based model with publicly available weights. mxbai-rerank-large-v1 is Apache 2.0-licensed, clearing it for closed-source and paid products. Read mxbai-rerank-large-v1's card for hardware requirements and licensing fine print before deploying.

299,638 ↓ · 142 ♡