AI Tools.

Search

text to speech

VibeVoice-Realtime-0.5B

VibeVoice-Realtime-0.5B is an open-source text-to-speech model available on HuggingFace. Details are sourced from the public model registry.

Last reviewed

Use cases

  • Building text-to-speech applications
  • Research and experimentation
  • Open-source AI prototyping

Pros

  • Open weights available
  • Community support on HuggingFace

Cons

  • Requires manual evaluation for production use
  • Licensing terms vary — check model card

FAQ

What is VibeVoice-Realtime-0.5B used for?

Building text-to-speech applications. Research and experimentation. Open-source AI prototyping.

Is VibeVoice-Realtime-0.5B free to use?

VibeVoice-Realtime-0.5B is an open-source model published on HuggingFace. License terms vary by model — check the model card for the specific license.

How do I run VibeVoice-Realtime-0.5B locally?

Most HuggingFace models can be loaded with transformers or the appropriate framework library. See the model card for framework-specific instructions and hardware requirements.

Tags

transformerssafetensorsvibevoice_streamingRealtime TTSStreaming text inputLong-form speech generationtext-to-speechenarxiv:2508.19205arxiv:2412.08635base_model:Qwen/Qwen2.5-0.5Bbase_model:finetune:Qwen/Qwen2.5-0.5Blicense:mitendpoints_compatibleregion:us