#
sound-classification
Here are 57 public repositories matching this topic...
Audio classification with VGGish as feature extractor in TensorFlow
audio
deep-learning
neural-network
tensorflow
audio-classification
audio-processing
audioset
sound-classification
vggish
-
Updated
Dec 4, 2021 - Python
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
raspberry-pi
machine-learning
tensorflow
audio-analysis
dataset
raspberry
librosa
tensorflow-models
multilayer-perceptron-network
audio-signals
sound-classification
-
Updated
May 30, 2022 - Python
Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)
machine-learning
sound-processing
classification
urban-sound-classification
audio-processing
sound-synthesis
sound-classification
mel-spectrogram
audio-tagging
sound-classification-spectrograms
urban-sound-8k
-
Updated
May 26, 2022 - Jupyter Notebook
Sound Classification using Neural Networks
machine-learning
deep-learning
sound-processing
neural-networks
convolutional-neural-networks
urban-sound-classification
librosa
cnn-keras
sound-classification
-
Updated
Feb 1, 2022 - Jupyter Notebook
Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.
keras
cross-validation
pytorch
librosa
data-augmentation
cnn-model
environmental-sounds
sound-classification
urbansound8k
spectrograms
-
Updated
Apr 25, 2021 - Jupyter Notebook
Building a Sound Classification iOS Application using AI
python
swift
ios
machine-learning
ai
artificial-intelligence
ios-app
classification
sound-classification
sound-recognition
turicreate
esc-50
-
Updated
Aug 26, 2019 - Jupyter Notebook
.NET library to easily create Voice Command Control feature.
csharp
dotnet
wpf
voice-commands
image-processing
voice-recognition
image-classification
spectrogram
machinelearning
fft
asp-net
voice-control
winform
cnn-model
voice-assistant
sound-classification
ml-net
net-library
voice-command-control
voice-control-realtime
-
Updated
Feb 12, 2022 - C#
Deep Multi-Sensory Object Category Recognition Using Interactive Behavioral Exploration
deep-learning
tensorflow
video-classification
sound-classification
multimodal-deep-learning
haptic-classification
-
Updated
Sep 12, 2019 - Jupyter Notebook
Classify bird geni based on their calls or train your own sound classifier using a convolutional neural network
-
Updated
Jun 2, 2021 - Python
A neural network lib in C++20 optimized for CPU. Create, train and use a neural network in less than 10 lines of codes.
cmake
deep-learning
neural-network
cnn
rnn
image-classification
image-recognition
sound-classification
sound-recognition
-
Updated
May 24, 2022 - C++
-
Updated
May 28, 2020 - Jupyter Notebook
Spectrogram for UrbanSound8K audio dataset
deep-learning
dataset
spectrogram
urban-sound-classification
audio-classification
sound-classification
urban-sound
sound-dataset
urban-sound-8k
-
Updated
Feb 23, 2019
Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.
python
demo
real-time
deep-learning
intel
audio-classification
deep-learning-demo
sound-classification
openvino
-
Updated
Jul 28, 2021 - Python
Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)
-
Updated
Apr 24, 2020 - Python
-
Updated
May 28, 2020 - Jupyter Notebook
iOS alarm app w/ personal data powered by Core ML - Sound Classification
-
Updated
Oct 7, 2020 - Swift
Official project page of CrissCross: https://pritamqu.github.io/CrissCross/
representation-learning
action-recognition
audioset
dcase
ucf101
hmdb51
sound-classification
self-supervised-learning
kinetics-datasets
kinetics400
esc50
-
Updated
Apr 28, 2022 - Python
Official repository for the research article "Pruning vs XNOR-Net: A ComprehensiveStudy on Deep Learning for AudioClassification in Microcontrollers"
deep-learning
microcontrollers
quantization
audio-classification
xnor-net
model-compression
channel-pruning
sound-classification
filter-pruning
deeplea
-
Updated
Aug 24, 2021 - Python
Classification of Sounds Using Convolutional Neural Networks
python
machine-learning
tensorflow
cnn
classification
windowing
mfcc
sound-classification
kaiser-window
-
Updated
Apr 14, 2018 - Python
Audio Classification on iOS using SoundAnalysis API.
-
Updated
Oct 9, 2019 - Jupyter Notebook
-
Updated
May 28, 2020 - Jupyter Notebook
Classification of Environment Sound using CNN and ImageDataGenerator
-
Updated
Nov 1, 2018 - Jupyter Notebook
Classification of sound dataset
-
Updated
Sep 8, 2019 - Jupyter Notebook
Fully automated machine learning pipeline for bird sound recognition
machine-learning
deep-learning
sound-processing
image-classification
biodiversity-informatics
ecology
biodiversity
bird-monitoring
sound-classification
kubeflow
ornithology
bioacoustics
sound-classification-spectrograms
xeno-canto
-
Updated
Jun 1, 2022 - Jupyter Notebook
-
Updated
May 28, 2020 - Jupyter Notebook
Procesamiento de imágenes de los personajes y procesamiento de sonidos para los tipos de armas del juego Free Fire, con el fin de clasificarlos.
-
Updated
Jan 26, 2021 - JavaScript
Classification of urban sounds such as air conditioner, jackhammer, drilling, siren, street music, engine idling and children playing by using Mel-frequency Cepstral Coefficients (MFCCs) as audio feature and CNN algorithm.
-
Updated
Jan 14, 2022 - Jupyter Notebook
This github repository contains the sample code and exercises of btp-ai-core-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.
sample
sustainability
computer-vision
deep-learning
image-segmentation
sample-code
predictive-maintenance
condition-monitoring
sound-classification
defect-detection
sap-analytics-cloud
sap-ai-core
sap-ai-launchpad
sac-planning
-
Updated
Jun 2, 2022 - Jupyter Notebook
Improve this page
Add a description, image, and links to the sound-classification topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sound-classification topic, visit your repo's landing page and select "manage topics."


目前的多音字使用 pypinyin 或者 g2pM,精度有限,想做一个基于 BERT (或者 ERNIE) 多音字预测模型,简单来说就是假设某语言有 100 个多音字,每个多音字最多有 3 个发音,那么可以在 BERT 后面接 100 个 3 分类器(简单的 fc 层即可),在预测时,找到对应的分类器进行分类即可。
参考论文:
tencent_polyphone.pdf
数据可以用 https://github.com/kakaobrain/g2pM 提供的数据
进阶:多任务的 BERT
![image](https://user-images.githubusercontent.com/24568452