kaldi-asr / kaldi

6k

This is the official location of the Kaldi project.

kaldi c-plus-plus cuda shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell Updated Jun 3, 2019

TalAter / annyang

5.4k

💬 Speech recognition for your site

speech-recognition speech speech-to-text voice hacktoberfest

JavaScript Updated Mar 10, 2019

shu223 / iOS-10-Sampler

3.4k

Code examples for new APIs of iOS 10.

ios ios10 swift-3 swift-4 speech metal cnn image-recognition convolutional-neural-networks demo metal-performance-shaders metal-cnn uiviewpropertyanimator

Swift Updated Apr 11, 2019

tensorflow / lingvo

1.6k

Lingvo

speech-recognition translation speech-to-text machine-translation mnist seq2seq language-model tts asr lm nlp tensorflow speech research distributed gpu-computing speech-synthesis

Python Updated Jun 3, 2019

Kyubyong / tacotron

1.5k

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

tts tensorflow speech-synthesis-model speech

Python Updated Mar 19, 2018

readbeyond / aeneas

1.4k

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Good first issues

MemoryError on 32-bit systems

documentation bug

#172 opened about 2 years ago by readbeyond

1

Add asciicasts to docs

documentation help wanted

#134 opened over 2 years ago by readbeyond

Investigate mypy (static type checker via type annotations)

documentation code

#104 opened over 2 years ago by pettarin

2

Python Updated Oct 11, 2018

r9y9 / wavenet_vocoder

974

WaveNet vocoder

wavenet speech-synthesis speech-processing pytorch python wavenet-vocoder neural-vocoder tts speech

Good first issues

µ=256 while Wavenet paper used µ=255

good first issue bug

#64 opened about 1 year ago by PetrochukM

2

Python Updated May 29, 2019

pndurette / gTTS

907

Python library and CLI tool to interface with Google Translate's text-to-speech API

speech python tts text-to-speech gtts

Good first issues

Documents suggestion

documentation

#162 opened 6 months ago by GXTony

1

Python Updated Feb 20, 2019

mravanelli / pytorch-kaldi

844

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is…

speech-recognition gru dnn kaldi rnn-model pytorch timit deep-learning deep-neural-networks recurrent-neural-networks multilayer-perceptron-network lstm lstm-neural-networks speech asr rnn dnn-hmm

Perl Updated May 18, 2019

julius-speech / julius

773

Open-Source Large Vocabulary Continuous Speech Recognition Engine

speech recognition audio-processing speech-recognition

Good first issues

Dictionary format

documentation

#18 opened over 3 years ago by nitslp-ri

New Version of Julius Book

documentation ToDo

#3 opened over 3 years ago by franklixuefei

1

C Updated May 31, 2019

jarikomppa / soloud

615

Free, easy, portable audio engine for games

Good first issues

WavStream::getLength reports doubled length for stereo tracks

good first issue help wanted

#201 opened 6 months ago by yevhen8

more filters should be implemented

good first issue help wanted

#167 opened over 1 year ago by brightening-eyes

Seek performance

good first issue help wanted

#156 opened almost 2 years ago by jazzbre

4

C Updated Mar 26, 2019

santi-pdp / segan

458

Speech Enhancement Generative Adversarial Network in TensorFlow

speech gan tensorflow deep-learning deep-neural-networks generative-model generative-adversarial-networks

Python Updated Aug 18, 2018

pytorch / audio

456

simple audio I/O for pytorch

audio python io mp3 wav speech

Python Updated May 31, 2019

evancohen / sonus

449

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

speech speech-recognition speech-to-text voice-control stt node hotword-detection keyword-spotting alexa voice-recognition

JavaScript Updated Jun 2, 2019

lkuza2 / java-speech-api

447

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to pro…

java speech-recognition speech speech-synthesis speech-to-text jarvis api google recognition

Java Updated May 2, 2019

Kyubyong / dc_tts

411

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

speech speech-to-text tts

Python Updated Jun 7, 2018

cboard-org / cboard

387

AAC communication board with text-to-speech for the browser

aac autism cerebral-palsy progressive-web-app communication-board speech tts text-to-speech

Good first issues

[Translation] Proof read machine translations

good first issue help wanted

#89 opened over 1 year ago by shayc

27

JavaScript Updated Jun 3, 2019

google / tacotron

375

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

machine-learning tts speech audio prosody tacotron

HTML Updated Apr 9, 2019

praat / praat

363

Praat: Doing Phonetics By Computer

speech phonetics acoustics

C Updated Jun 2, 2019

jtkim-kaist / VAD

362

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly rec…

vad dnn lstm bdnn acam attention speech data voice-detection speech-recognition voice-activity-detection speech-activity-detection

MATLAB Updated Feb 10, 2019

pykaldi / pykaldi

341

A Python wrapper for Kaldi

python wrapper kaldi openfst asr speech-recognition speech language-model feature-extraction clif numpy

Python Updated May 31, 2019

googleapis / nodejs-speech

336

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

nodejs machine-learning speech-to-text speech

JavaScript Updated May 30, 2019

drethage / speech-denoising-wavenet

289

A neural network for end-to-end speech denoising

machine-learning deep-learning neural-networks speech-denoising speech wavenet end-to-end speech-processing

Python Updated Jun 5, 2018

gotev / android-speech

210

Android speech recognition and text to speech made easy

android speech recognition tts

Java Updated Feb 2, 2019

bambocher / pocketsphinx-python

197

Python interface to CMU Sphinxbase and Pocketsphinx libraries

python sphinxbase pocketsphinx speech speech-recognition voice

Python Updated Apr 9, 2019

MITESHPUTHRANNEU / Speech-Emotion-Analyzer

188

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learn…

emotion python3 deep-learning neural-network data-science deep-neural-networks speech voice audio-files natural-language-processing natural-language-understanding speech-recognition emotion-recognition speech-emotion-recognition keras

Jupyter Notebook Updated Dec 7, 2018

primaryobjects / voice-gender

183

Gender recognition by voice and speech analysis

gender-recognition gender machine-learning data-science artificial-intelligence neural-network logistic-regression vocal voice speech acoustic-properties signal ai

R Updated Jul 3, 2018

robmsmt / KerasDeepSpeech

183

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

keras deepspeech asr ctc coreml speechrecognition speech-to-text deep-learning machine-learning neural-networks baidu speech deeplearning neural-network nn

Python Updated Mar 17, 2018

r9y9 / pysptk

173

A python wrapper for Speech Signal Processing Toolkit (SPTK).

python-wrapper speech-processing python speech-synthesis speech

Python Updated May 30, 2019

yongxuUSTC / sednn

169

deep learning based speech enhancement using keras python, make it easy to use

speech-enhancement deep-neural-networks speech deep-learning

Python Updated Mar 31, 2019

speech

Repositories 457

Good first issues

Good first issues

Good first issues

Good first issues

Good first issues

Good first issues

Related topics