speech-processing

Star

Here are 276 public repositories matching this topic...

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Apr 14, 2025
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Mar 24, 2025
Python

microsoft / torchscale

Star

Foundation Architecture for (M)LLMs

machine-learning natural-language-processing translation computer-vision transformer speech-processing multimodal pretrained-language-model

Updated Apr 11, 2024
Python

linto-ai / whisper-timestamped

Star

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Mar 31, 2025
Python

r9y9 / wavenet_vocoder

Sponsor

Star

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Jul 29, 2023
Python

r9y9 / deepvoice3_pytorch

Sponsor

Star

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

Updated Dec 19, 2023
Python

resemble-ai / resemble-enhance

Star

AI powered speech denoising and enhancement

speech-processing denoise speech-enhancement speech-denoising

Updated Dec 3, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Controllable and fast Text-to-Speech for over 7000 languages!

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Nov 7, 2024
Python

mravanelli / SincNet

Star

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

haoheliu / voicefixer

Sponsor

Star

General Speech Restoration

speech tts speech-synthesis super-resolution speech-processing vocoder speech-analysis denoise mel speech-enhancement dereverberation declipping

Updated Feb 17, 2025
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Aug 24, 2024
Python

X-LANCE / SLAM-LLM

Star

Speech, Language, Audio, Music Processing with Large Language Model

speech-processing audio-processing peft music-processing large-language-model multimodal-large-language-models

Updated Apr 12, 2025
Python

drethage / speech-denoising-wavenet

Star

A neural network for end-to-end speech denoising

machine-learning deep-learning end-to-end speech neural-networks wavenet speech-processing speech-denoising

Updated Jul 6, 2023
Python

nyrahealth / CrisperWhisper

Star

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

audio recognition detection speech speech-recognition filler transcription whisper speech-processing asr timestamps verbatim

Updated Dec 19, 2024
Python

breizhn / DTLN

Star

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

audio raspberry-pi deep-learning tensorflow keras speech-processing dns-challenge noise-reduction audio-processing real-time-audio speech-enhancement speech-denoising onnx tf-lite noise-suppression dtln-model

Updated Jul 28, 2023
Python

Audio-WestlakeU / FullSubNet

Star

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

audio reproducible-research paper speech pytorch band speech-processing noise-reduction denoising speech-separation speech-enhancement narrow-band single-channel pretrained-model full-band sub-band

Updated Aug 19, 2023
Python

SuperKogito / spafe

Sponsor

Star

🔉 spafe: Simplified Python Audio Features Extraction

Updated Mar 20, 2025
Python

microsoft / UniSpeech

Star

UniSpeech - Large Scale Self-Supervised Learning for Speech

speech pytorch speech-recognition speaker-verification speech-processing speech-separation diarization speech-diarization

Updated Apr 5, 2024
Python

r9y9 / pysptk

Sponsor

Star

A python wrapper for Speech Signal Processing Toolkit (SPTK).

python dsp speech speech-synthesis python-wrapper digital-signal-processing speech-processing sptk

Updated Jul 16, 2024
Python

santi-pdp / pase

Star

Problem Agnostic Speech Encoder

deep-learning pytorch unsupervised-learning speech-processing multi-task-learning waveform-analysis self-supervised-learning

Updated Jul 6, 2023
Python

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-processing

Here are 276 public repositories matching this topic...

speechbrain / speechbrain

snakers4 / silero-vad

microsoft / torchscale

linto-ai / whisper-timestamped

r9y9 / wavenet_vocoder

r9y9 / deepvoice3_pytorch

resemble-ai / resemble-enhance

DigitalPhonetics / IMS-Toucan

mravanelli / SincNet

haoheliu / voicefixer

ictnlp / StreamSpeech

X-LANCE / SLAM-LLM

drethage / speech-denoising-wavenet

nyrahealth / CrisperWhisper

breizhn / DTLN

Audio-WestlakeU / FullSubNet

SuperKogito / spafe

microsoft / UniSpeech

r9y9 / pysptk

santi-pdp / pase

Improve this page

Add this topic to your repo