Speech Recognition Python Tutorial

资讯

Scaling Multilingual Visual Speech Recognition

Abstract: Visual Speech Recognition (lip-reading) has witnessed tremendous improvements, reaching word error rates as low as 12.8 WER in English. However, the ...

IEEE

Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly ...

Abstract: There exist three approaches for multilingual and crosslingual automatic speech recognition (MCL-ASR) - supervised pretraining with phonetic or graphemic transcription, and self-supervised ...

GitHub

Real-Time Cascading Speech-to-Speech Chatbot

A real-time cascading speech-to-speech chatbot that combines advanced speech recognition, AI reasoning, and neural text-to-speech capabilities. Built for seamless voice interactions with web ...

GitHub

fish-speech

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

一些您可能无法访问的结果已被隐去。

显示无法访问的结果