-
Updated
May 27, 2022 - Python
#
tts
Here are 1,161 public repositories matching this topic...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
python
text-to-speech
deep-learning
speech
pytorch
tts
vocoder
tacotron
tensorflow2
tacotron2
melgan
speaker-encoder
dataset-analysis
glow-tts
multiband-melgan
gantts
-
Updated
Apr 23, 2022 - Jupyter Notebook
python
text-to-speech
deep-learning
speech
pytorch
tts
speech-synthesis
speech-recognition
vocoder
voice-synthesis
tacotron
voice-cloning
speaker-encodings
melgan
speaker-encoder
multi-speaker-tts
glow-tts
hifigan
tts-model
-
Updated
Jun 2, 2022 - Python
alexa
ai
amazon-echo
muse
tts
google-home
unit
bci
speaker
homeassistant
snowboy
asr
anyq
raspeberry-pi
-
Updated
May 13, 2022 - Python
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
-
Updated
Jan 24, 2022 - Python
text-to-speech
real-time
tts
speech-synthesis
vocoder
tflite
tensorflow2
fastspeech
tacotron2
melgan
multi-speaker-tts
multiband-melgan
fastspeech2
parallel-wavegan
mobile-tts
zh-tts
chinese-tts
korea-tts
german-tts
japanese-tts
-
Updated
Mar 10, 2022 - Python
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
May 27, 2022 - Python
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
-
Updated
Jan 17, 2022 - Python
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
-
Updated
Jun 2, 2022 - Java
WaveRNN Vocoder + TTS
-
Updated
Mar 2, 2022 - Python
Open
Multi GPU Support
4
tanmayb123
commented
Mar 4, 2018
I'd like to train this model on 8 V100 GPUs - does it support multi GPU training?
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
text-to-speech
german
speech
pytorch
tts
speech-synthesis
english
speech-recognition
spanish
colab
speech-to-text
pretrained-models
stt
asr
capitalization
onnx
stt-benchmark
tts-models
torch-hub
repunctuation
-
Updated
May 23, 2022 - Jupyter Notebook
Python library and CLI tool to interface with Google Translate's text-to-speech API
-
Updated
May 11, 2022 - Python
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
-
Updated
Jun 7, 2018 - Python
alex19EP
commented
May 12, 2022
since, at least in Arch Linux, each voice has its own package, the user can install rhvoice without voices, and when switching the synthesis module to RHVoice in orca, for example, the user will be left without speech.
good first issue
This is a good first issue for a new contributor
<Bug>
This needs to be fixed
Code: C++
OS: Linux
Issues related to Linux operating system
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
text-to-speech
realtime
pytorch
tts
speech-synthesis
wavenet
vocoder
parallel-wavenet
neural-vocoder
melgan
hifigan
style-melgan
-
Updated
May 30, 2022 - Jupyter Notebook
-
Updated
May 19, 2022 - Python
Free and open source text-to-speech software
-
Updated
Dec 6, 2021 - Vue
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
-
Updated
Apr 28, 2021 - Python
text-to-speech
tts
speech-synthesis
voice-recognition
speech-recognition
speech-to-text
stt
speech-processing
voice-activity-detection
speech-separation
speech-emotion-recognition
voice-cloning
-
Updated
Jan 25, 2022
an open-source implementation of sequence-to-sequence based speech processing engine
deployment
tensorflow
tts
speech-synthesis
transformer
speech-recognition
sequence-to-sequence
unsupervised-learning
speaker-recognition
asr
ctc
wfst
-
Updated
Mar 20, 2022 - Python
Управление Яндекс.Станцией и другими колонками с Алисой из Home Assistant
-
Updated
May 31, 2022 - Python
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
-
Updated
May 5, 2022 - Python
A Python/Pytorch app for easily synthesising human voices
-
Updated
Apr 23, 2022 - Python
cadavere
commented
Apr 19, 2021
I thought about rearranging items in sentence (using drag & drop) in case of misclick or user fault.
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
-
Updated
Mar 23, 2022 - Python
Improve this page
Add a description, image, and links to the tts topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tts topic, visit your repo's landing page and select "manage topics."
目前的多音字使用 pypinyin 或者 g2pM,精度有限,想做一个基于 BERT (或者 ERNIE) 多音字预测模型,简单来说就是假设某语言有 100 个多音字,每个多音字最多有 3 个发音,那么可以在 BERT 后面接 100 个 3 分类器(简单的 fc 层即可),在预测时,找到对应的分类器进行分类即可。
参考论文:
tencent_polyphone.pdf
数据可以用 https://github.com/kakaobrain/g2pM 提供的数据
进阶:多任务的 BERT
![image](https://user-images.githubusercontent.com/24568452