text classification models

47 models · ranked by HuggingFace downloads

bge-reranker-v2-m3

BGE-Reranker-v2-M3 is BAAI's multilingual cross-encoder reranker built on XLM-RoBERTa, designed for re-ranking retrieved passages in multilingual RAG or search pipelines. It jointly encodes query-passage pairs to produce relevance scores, providing higher accuracy than bi-encoder similarity for the same candidate set. Apache 2.0 licensed with text-embeddings-inference support.

16,278,800 ↓ · 1,056 ♡

finbert

FinBERT is a BERT model fine-tuned on financial news and financial communications text for financial sentiment analysis, classifying text as positive, negative, or neutral from a finance domain perspective. Developed by Prosus AI (Naspers), it targets applications where general-purpose sentiment models fail on financial jargon and market-specific framing.

7,479,376 ↓ · 1,184 ♡

bge-reranker-base

Cross-encoder reranker trained on multilingual data (English and Chinese) using XLM-RoBERTa. It scores query-document pairs directly rather than comparing embeddings, making it more accurate than bi-encoders for retrieval pipelines at the cost of higher latency.

4,148,348 ↓ · 239 ♡

twitter-roberta-base-sentiment-latest

RoBERTa-base fine-tuned on ~124M tweets for three-class sentiment classification (positive/neutral/negative). Trained by Cardiff NLP on the TweetEval benchmark, it consistently ranks among the top-performing tweet-specific sentiment models.

3,912,317 ↓ · 815 ♡

distilbert-base-uncased-finetuned-sst-2-english

distilbert-base-uncased-finetuned-sst-2-english is a DistilBERT model fine-tuned on the Stanford Sentiment Treebank v2 (SST-2) for binary positive/negative sentiment classification of English text. It is one of the most downloaded sentiment classifiers on HuggingFace and commonly used as a demonstration or fast baseline. The model achieves approximately 91% accuracy on the SST-2 validation set.

3,636,844 ↓ · 912 ♡

Prompt-Guard-86M

As a llama-based compact model, Prompt-Guard-86M focuses on text classification. Weighing in near 86M parameters, Prompt-Guard-86M trades some ceiling for cheaper, faster inference. Prompt-Guard-86M is subject to Llama 3.1 Community terms, so confirm licensing before commercial use. Prompt-Guard-86M ships without a hosted SLA, so budget for self-managed deployment and monitoring.

2,415,338 ↓ · 348 ♡

tiny-Qwen2ForSequenceClassification-2.5

tiny-Qwen2ForSequenceClassification-2.5 maps input sequences to one or more labels. Fine-tuned on labeled data, it covers tasks like sentiment analysis, topic detection, and intent classification.

1,572,281 ↓ · 1 ♡

sentiment-polish-gpt2-small

sentiment-polish-gpt2-small is a GPT2-small fine-tuned for Polish sentiment classification, trained on the PolEmo 2.0 dataset with four sentiment classes (positive, negative, ambivalent, neutral). GPT2 architecture fine-tunes cleanly for sequence classification despite being a causal LM by adding a classification head. It is one of the few openly available Polish-specific sentiment models at small scale.

1,395,046 ↓ · 1 ♡

fasttext-language-identification

Meta's fastText-based language identification model, capable of identifying 176 languages from short text strings. Extremely fast CPU inference makes it practical for preprocessing pipelines that need to route text by language.

1,367,827 ↓ · 269 ♡

twitter-xlm-roberta-base-sentiment

twitter-xlm-roberta-base-sentiment classifies text into predefined label categories using a RoBERTa encoder fine-tuned with a classification head. It outputs per-class logits.

1,328,187 ↓ · 269 ♡

RADAR-Vicuna-7B

RADAR-Vicuna-7B maps input sequences to one or more labels. Fine-tuned on labeled data, it covers tasks like sentiment analysis, topic detection, and intent classification.

1,326,427 ↓ · 13 ♡

roberta-base-go_emotions

This RoBERTa-base model fine-tuned on Google's GoEmotions dataset classifies English text into 28 emotion categories or groups them into sentiment buckets. It supports multi-label classification, meaning a sentence can be tagged with several emotions simultaneously. The model is widely used in sentiment analytics pipelines and social media monitoring tools.

1,253,504 ↓ · 680 ♡

twitter-roberta-base-sentiment

twitter-roberta-base-sentiment maps input sequences to one or more labels. Fine-tuned on labeled data, it covers tasks like sentiment analysis, topic detection, and intent classification.

1,111,818 ↓ · 337 ♡

bert-base-multilingual-uncased-sentiment

As a bert-based open-weight model, bert-base-multilingual-uncased-sentiment focuses on text classification. Training spans multiple languages, so bert-base-multilingual-uncased-sentiment covers cross-lingual text classification from one checkpoint. The MIT license keeps bert-base-multilingual-uncased-sentiment unrestricted for commercial reuse. Before relying on bert-base-multilingual-uncased-sentiment, reproduce its key numbers on representative inputs.

951,882 ↓ · 479 ♡

roberta-base-openai-detector

roberta-base-openai-detector is a RoBERTa-base binary classifier trained to distinguish GPT-2-generated text from human-written text. It was released by the OpenAI Grover team and works by fine-tuning on paired human/machine samples from the GPT-2 output corpus. As an early-generation AI text detector, it is most accurate on GPT-2 output and significantly less reliable on newer LLMs.

904,396 ↓ · 134 ♡

emotion-english-distilroberta-base

emotion-english-distilroberta-base classifies text into predefined label categories using a RoBERTa encoder fine-tuned with a classification head. It outputs per-class logits.

798,466 ↓ · 500 ♡

bge-reranker-v2-gemma

bge-reranker-v2-gemma is BAAI's cross-encoder reranker built on a Gemma backbone, trained for multilingual passage reranking. Unlike bi-encoders that embed query and document independently, it attends jointly to both as a cross-encoder for higher reranking accuracy. It is designed to be used as the second stage after a bi-encoder retrieval step in two-stage RAG pipelines.

783,623 ↓ · 85 ♡

finbert-tone

Built for text classification, finbert-tone is a bert-based model with publicly available weights. Read finbert-tone's card for hardware requirements and licensing fine print before deploying.

753,638 ↓ · 220 ♡

turn-detector

turn-detector is an open-weight text classification model in the llama family. turn-detector is multilingual by design rather than English-only. Licensing for turn-detector is unspecified or custom — clear it before commercial use. turn-detector is community-maintained, so track upstream changes and pin a known-good revision.

682,274 ↓ · 113 ♡

distilbert-base-multilingual-cased-sentiments-student

As a distilbert-based open-weight model, distilbert-base-multilingual-cased-sentiments-student focuses on text classification. Training spans multiple languages, so distilbert-base-multilingual-cased-sentiments-student covers cross-lingual text classification from one checkpoint. The Apache 2.0 license keeps distilbert-base-multilingual-cased-sentiments-student unrestricted for commercial reuse. Before relying on distilbert-base-multilingual-cased-sentiments-student, reproduce its key numbers on representative inputs.

661,109 ↓ · 314 ♡

jina-reranker-m0

jina-reranker-m0 is a multimodal reranker fine-tuned from Qwen2-VL-2B-Instruct, capable of scoring text and image relevance for retrieval pipelines. It is tagged for the Vidore benchmark, indicating focus on document and visual retrieval reranking. The CC-BY-NC-4.0 license restricts commercial use.

631,985 ↓ · 120 ♡

xlm-roberta-base-language-detection

xlm-roberta-base-language-detection maps input sequences to one or more labels. Fine-tuned on labeled data, it covers tasks like sentiment analysis, topic detection, and intent classification.

527,179 ↓ · 375 ♡

robertuito-sentiment-analysis

robertuito-sentiment-analysis classifies text into predefined label categories using a RoBERTa encoder fine-tuned with a classification head. It outputs per-class logits.

463,885 ↓ · 100 ♡

Qwen2.5-1.5B-apeach

Qwen2.5-1.5B-apeach is a Korean hate speech detection classifier fine-tuned on the APEACH dataset using Qwen2.5-1.5B as the backbone. APEACH (Automated Pipeline for Evaluation Against Crowdsourced Hate speech) is a Korean benchmark for detecting offensive and hateful content. This model converts the LLM into a sequence classifier for binary or multi-class Korean content moderation.

459,152 ↓ · 6 ♡

Qwen3-Reranker-4B-W4A16-G128

A W4A16 (4-bit weights, 16-bit activations) quantized version of the Qwen3-Reranker-4B, enabling efficient cross-encoder reranking inference. The group-size-128 quantization balances compression ratio against accuracy retention for reranking tasks.

456,135 ↓ · 2 ♡

MedCPT-Cross-Encoder

MedCPT-Cross-Encoder is a BERT-based cross-encoder from NCBI fine-tuned for medical text relevance scoring, trained on PubMed query-article pairs. It takes a query and a candidate passage and scores their relevance, making it a reranking component in medical information retrieval pipelines. The model is typically paired with MedCPT's query encoder for a full retrieval-reranking system.

440,075 ↓ · 31 ♡

cryptobert

CryptoBERT is a RoBERTa-based sentiment classifier fine-tuned on the StockTwits crypto dataset, designed specifically for cryptocurrency-related social media text. It outputs three-class sentiment labels (bullish, bearish, neutral) and targets the linguistic patterns common in crypto discourse on platforms like Twitter and StockTwits. The model is trained exclusively on English-language crypto content, making it narrow in scope but well-suited for that domain.

385,681 ↓ · 191 ♡

KR-FinBert-SC

As a bert-based open-weight model, KR-FinBert-SC focuses on text classification. KR-FinBert-SC ships without a hosted SLA, so budget for self-managed deployment and monitoring.

361,071 ↓ · 39 ♡

OTel-Reranker-0.6B

A 0.6B cross-encoder reranker from farbodtavakkoli specialized for reranking OpenTelemetry log, trace, and metric documents. Pairs with OTel-Embedding models for a two-stage observability retrieval pipeline.

360,544 ↓ · 1 ♡

roberta-large-mnli

RoBERTa-large fine-tuned on the Multi-Genre Natural Language Inference (MNLI) corpus, commonly used for zero-shot text classification via the NLI entailment trick. One of the most frequently used models for zero-shot classification before dedicated models like DeBERTa-MNLI improved further.

349,227 ↓ · 210 ♡

koelectra-small-v3-nsmc

KoELECTRA-small-v3 fine-tuned on the Naver Sentiment Movie Corpus (NSMC) for Korean binary sentiment classification. It uses the ELECTRA discriminator architecture, which is more parameter-efficient than BERT for the same task performance.

343,464 ↓ · 7 ♡

deberta-large-mnli

deberta-large-mnli classifies text into predefined label categories using a DeBERTa encoder fine-tuned with a classification head. It outputs per-class logits.

341,460 ↓ · 32 ♡

internlm2-1_8b-reward

internlm2-1_8b-reward is an open-weight model aimed at text classification. internlm2-1_8b-reward's 8000M-parameter size keeps hosting requirements modest relative to frontier models. internlm2-1_8b-reward lists a non-standard license, so confirm permissions before deployment. internlm2-1_8b-reward ships without a hosted SLA, so budget for self-managed deployment and monitoring.

339,215 ↓ · 16 ♡

deberta-v3-base-prompt-injection-v2

deberta-v3-base-prompt-injection-v2 targets text classification and is shipped as an open-weight, self-hostable checkpoint. It is a fine-tune of deberta-v3-base, inheriting that base model's general competence. Permissive Apache 2.0 terms let deberta-v3-base-prompt-injection-v2 go straight into commercial pipelines. Treat deberta-v3-base-prompt-injection-v2's published metrics as a starting point and validate against your workload.

329,160 ↓ · 108 ♡

beto-sentiment-analysis

BETO-based sentiment analysis model from finiteautomata, fine-tuned for Spanish sentiment classification. BETO is the Spanish BERT model, and this checkpoint targets positive/negative/neutral classification on Spanish text.

324,795 ↓ · 35 ♡

xlm-emo-t

Built for text classification, xlm-emo-t is a xlm roberta-based model with publicly available weights. Training spans multiple languages, so xlm-emo-t covers cross-lingual text classification from one checkpoint. Check the xlm-emo-t model card for benchmarks and intended use before adopting it.

309,766 ↓ · 11 ♡

phishing-email-detection-distilbert_v2.4.1

phishing-email-detection-distilbert_v2.4.1 is an open-weight checkpoint for text classification, distributed on the HuggingFace Hub. The Apache 2.0 license keeps phishing-email-detection-distilbert_v2.4.1 unrestricted for commercial reuse. It is a fine-tune of distilbert-base-uncased, inheriting that base model's general competence. Like most open checkpoints, phishing-email-detection-distilbert_v2.4.1 rewards a quick in-domain eval before commitment.

308,997 ↓ · 26 ♡

inclusively-classification

inclusively-classification is a bert-based open-weight model aimed at text classification. Because inclusively-classification uses CC BY-NC-SA 4.0, vet the conditions against your deployment plan. Check the inclusively-classification model card for benchmarks and intended use before adopting it.

307,175 ↓ · 1 ♡

distilroberta-finetuned-financial-news-sentiment-analysis

distilroberta-finetuned-financial-news-sentiment-analysis is an openly licensed text classification model in the roberta family. distilroberta-finetuned-financial-news-sentiment-analysis is Apache 2.0-licensed, clearing it for closed-source and paid products. Evaluate distilroberta-finetuned-financial-news-sentiment-analysis on your own data before trusting it in production.

304,970 ↓ · 458 ♡

distilbert-base-uncased-emotion

distilbert-base-uncased-emotion is an open-weight checkpoint for text classification, distributed on the HuggingFace Hub. The Apache 2.0 license keeps distilbert-base-uncased-emotion unrestricted for commercial reuse. Like most open checkpoints, distilbert-base-uncased-emotion rewards a quick in-domain eval before commitment.

303,852 ↓ · 164 ♡

rubert-base-cased-sentiment-rusentiment

rubert-base-cased-sentiment-rusentiment is an open-weight text classification model in the bert family. Evaluate rubert-base-cased-sentiment-rusentiment on your own data before trusting it in production.

302,698 ↓ · 15 ♡

phobert-base-vietnamese-sentiment

Built for text classification, phobert-base-vietnamese-sentiment is a roberta-based model with publicly available weights. phobert-base-vietnamese-sentiment is MIT-licensed, clearing it for closed-source and paid products. Check the phobert-base-vietnamese-sentiment model card for benchmarks and intended use before adopting it.

302,680 ↓ · 16 ♡

multilingual-sentiment-analysis

multilingual-sentiment-analysis classifies text into predefined label categories using a DistilBERT encoder fine-tuned with a classification head. It outputs per-class logits.

300,736 ↓ · 373 ♡

FinBERT-PT-BR

FinBERT-PT-BR is an openly licensed text classification model in the bert family. FinBERT-PT-BR is Apache 2.0-licensed, clearing it for closed-source and paid products. Treat FinBERT-PT-BR's published metrics as a starting point and validate against your workload.

295,424 ↓ · 29 ♡

roberta_toxicity_classifier

roberta_toxicity_classifier classifies text into predefined label categories using a RoBERTa encoder fine-tuned with a classification head. It outputs per-class logits.

287,679 ↓ · 72 ♡

twitter-xlm-roberta-base-sentiment-multilingual

twitter-xlm-roberta-base-sentiment-multilingual is an open-weight checkpoint for text classification, distributed on the HuggingFace Hub. Treat twitter-xlm-roberta-base-sentiment-multilingual's published metrics as a starting point and validate against your workload.

286,538 ↓ · 31 ♡

deberta-xlarge-mnli

DeBERTa-XLarge fine-tuned on the Multi-Genre Natural Language Inference (MNLI) dataset for zero-shot text classification via entailment. DeBERTa's disentangled attention mechanism improves on BERT for NLI tasks. The xlarge variant (900M parameters) provides strong NLI accuracy but is expensive for high-throughput use.

261,138 ↓ · 23 ♡