#
speech
Repositories 457
This is the official location of the Kaldi project.
kaldi
c-plus-plus
cuda
shell
speech-recognition
speech-to-text
speaker-verification
speaker-id
speech
Shell
Updated Jun 3, 2019
JavaScript
Updated Mar 10, 2019
Code examples for new APIs of iOS 10.
ios
ios10
swift-3
swift-4
speech
metal
cnn
image-recognition
convolutional-neural-networks
demo
metal-performance-shaders
metal-cnn
uiviewpropertyanimator
Swift
Updated Apr 11, 2019
Lingvo
speech-recognition
translation
speech-to-text
machine-translation
mnist
seq2seq
language-model
tts
asr
lm
nlp
tensorflow
speech
research
distributed
gpu-computing
speech-synthesis
Python
Updated Jun 3, 2019
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Python
Updated Mar 19, 2018
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
speech
alignment
tts
python
linux
macos
windows
nlp
espeak
espeak-ng
festival
cli
dtw
ffmpeg
forced-alignment
text
audio
srt
smil
text-to-speech
Good first issues
#134 opened over 2 years ago by readbeyond
#104 opened over 2 years ago by pettarin
2
Python
Updated Oct 11, 2018
WaveNet vocoder
Good first issues
#64 opened about 1 year ago by PetrochukM
2
Python
Updated May 29, 2019
Python library and CLI tool to interface with Google Translate's text-to-speech API
Good first issues
Python
Updated Feb 20, 2019
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is…
speech-recognition
gru
dnn
kaldi
rnn-model
pytorch
timit
deep-learning
deep-neural-networks
recurrent-neural-networks
multilayer-perceptron-network
lstm
lstm-neural-networks
speech
asr
rnn
dnn-hmm
Perl
Updated May 18, 2019
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Good first issues
#18 opened over 3 years ago by nitslp-ri
C
Updated May 31, 2019
Free, easy, portable audio engine for games
audio
game-development
engine
sound
sound-effects
synthesizer
game
portable
mp3
ogg
flac
opensl-es
python
c
cpp
ruby
gamemaker
blitzmax
speech
speech-to-text
Good first issues
#201 opened 6 months ago by yevhen8
#167 opened over 1 year ago by brightening-eyes
C
Updated Mar 26, 2019
Speech Enhancement Generative Adversarial Network in TensorFlow
speech
gan
tensorflow
deep-learning
deep-neural-networks
generative-model
generative-adversarial-networks
Python
Updated Aug 18, 2018
simple audio I/O for pytorch
Python
Updated May 31, 2019
speech
speech-recognition
speech-to-text
voice-control
stt
node
hotword-detection
keyword-spotting
alexa
voice-recognition
JavaScript
Updated Jun 2, 2019
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to pro…
Java
Updated May 2, 2019
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Python
Updated Jun 7, 2018
AAC communication board with text-to-speech for the browser
Good first issues
#89 opened over 1 year ago by shayc
27
JavaScript
Updated Jun 3, 2019
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
HTML
Updated Apr 9, 2019
Praat: Doing Phonetics By Computer
C
Updated Jun 2, 2019
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly rec…
vad
dnn
lstm
bdnn
acam
attention
speech
data
voice-detection
speech-recognition
voice-activity-detection
speech-activity-detection
MATLAB
Updated Feb 10, 2019
A Python wrapper for Kaldi
python
wrapper
kaldi
openfst
asr
speech-recognition
speech
language-model
feature-extraction
clif
numpy
Python
Updated May 31, 2019
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
JavaScript
Updated May 30, 2019
A neural network for end-to-end speech denoising
machine-learning
deep-learning
neural-networks
speech-denoising
speech
wavenet
end-to-end
speech-processing
Python
Updated Jun 5, 2018
Android speech recognition and text to speech made easy
Java
Updated Feb 2, 2019
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Python
Updated Apr 9, 2019
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learn…
emotion
python3
deep-learning
neural-network
data-science
deep-neural-networks
speech
voice
audio-files
natural-language-processing
natural-language-understanding
speech-recognition
emotion-recognition
speech-emotion-recognition
keras
Jupyter Notebook
Updated Dec 7, 2018
Gender recognition by voice and speech analysis
gender-recognition
gender
machine-learning
data-science
artificial-intelligence
neural-network
logistic-regression
vocal
voice
speech
acoustic-properties
signal
ai
R
Updated Jul 3, 2018
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
keras
deepspeech
asr
ctc
coreml
speechrecognition
speech-to-text
deep-learning
machine-learning
neural-networks
baidu
speech
deeplearning
neural-network
nn
Python
Updated Mar 17, 2018
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Python
Updated May 30, 2019
deep learning based speech enhancement using keras python, make it easy to use
Python
Updated Mar 31, 2019

