demucs 是facebook开源的混合声音分离模型
.NET library for creating spectrograms (visual representations of frequency spectrum over time)
Web Audio Spectrogram
Audio spectrogram in canvas.
spectrogram visualization in the browser
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
80MHz bandwidth with LimeSDR-Mini and GQRX
Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook
这里为大家呈现真实的8分频FFT频谱图!
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
翻译 - 基于GAN的Mel频谱图反演网络用于文本到语音合成
Turn an image into sound whose spectrogram looks like the image.
翻译 - 将图像转换成声谱图看起来像图像的声音。
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Qt based program to create real-time spectrogram
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc
Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques
Implementations of a number of generative models in Tensorflow 2. GAN, VAE, Seq2Seq, VAEGAN, GAIA, Spectrogram Inversion. Everything is self contained in a jupyter notebook for easy export to colab.
翻译 - Tensorflow 2中许多生成模型的实现。GAN,VAE,Seq2Seq,VAEGAN,GAIA,频谱图反演。一切都包含在jupyter笔记本中,可轻松导出到colab。
Example project showcasing how to use a Raspberry Pi Pico, Adafruit's PDM MEMS Microphone Breakout, and Adafruit's 2.0" 320x240 Color IPS TFT Display with microSD Card Breakout to "See sound in real-t...
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the sam...
MNSS (Music Noise Segmentation on a Spectrogram) is a deep-neural network based preprocessing technique that pre-filters unnecessary noise. MNSS is based on the convolutional neural networks and uses ...
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法