speaker-diarization-3.1
speaker-diarization-3.1 is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
32 models · ranked by HuggingFace downloads
speaker-diarization-3.1 is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
whisperkit-coreml is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
whisper-large-v3-turbo is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
whisper-large-v3 is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-large-xlsr-53-russian is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
mms-300m-1130-forced-aligner is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-large-xlsr-53-portuguese is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
voice-activity-detection is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
speaker-diarization-community-1 is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
whisper-small is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
Qwen3-ASR-1.7B is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
whisper-base is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-large-xlsr-53-polish is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
distil-large-v3 is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
mms-1b-all is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-base-960h is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-large-xlsr-53-japanese is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
wav2vec2-large-xlsr-53-chinese-zh-cn is an open-source automatic-speech-recognition model available on HuggingFace. Details are sourced from the public model registry.
A wav2vec2-large XLSR model fine-tuned on Korean speech data from the Zeroth Korean dataset. Performs automatic speech recognition for Korean audio, leveraging cross-lingual self-supervised pretraining followed by supervised fine-tuning on Korean-specific acoustic patterns.