LEEYOONHYUNG

LEE YOON HYUNG LEEYOONHYUNG

48 followers · 22 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

mbzuai-oryx / LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 164 18 Updated Mar 14, 2025

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 3,018 243 Updated Mar 11, 2025

google-research / maskgit

Official Jax Implementation of MaskGIT

Jupyter Notebook 492 50 Updated Nov 18, 2022

valeoai / Halton-MaskGIT

[ICLR2025] Halton Scheduler for Masked Generative Image Transformer

Python 197 21 Updated Feb 27, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,412 184 Updated Feb 14, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,171 105 Updated Jan 2, 2025

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 335 21 Updated Jan 14, 2025

naver-ai / usdm

Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)

Python 83 3 Updated Dec 3, 2024

liutaocode / TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 385 23 Updated Mar 16, 2025

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,213 278 Updated Nov 5, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,371 1,425 Updated Mar 15, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,807 634 Updated Mar 13, 2025

supertone-inc / super-monotonic-align

Python 140 9 Updated Sep 19, 2024

taehong-moon / ee-diffusion

Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'

Python 16 Updated Jul 24, 2024

bshall / knn-vc

Voice Conversion With Just Nearest Neighbors

Python 474 67 Updated Mar 18, 2024

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,058 77 Updated Mar 2, 2025

keonlee9420 / evaluate-zero-shot-tts

Evaluation Protocol for Large-Scale Zero-Shot TTS Literature

Python 76 9 Updated Mar 12, 2025

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,249 396 Updated Mar 15, 2025

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 5,129 541 Updated Dec 10, 2024

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,155 164 Updated Feb 13, 2025

Edresson / ZS-TTS-Evaluation

Python 36 2 Updated Sep 19, 2024

YangLing0818 / consistency_flow_matching

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 196 6 Updated Jan 17, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,569 1,775 Updated Aug 1, 2024

maum-ai / nuwave2

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022

Python 285 23 Updated Sep 16, 2023

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

28,135 2,319 Updated Jun 18, 2024

keonlee9420 / DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Python 212 13 Updated Mar 13, 2023

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 892 107 Updated Aug 7, 2024

Vaibhavs10 / open-tts-tracker

1,115 70 Updated Feb 13, 2025

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 146 19 Updated Jun 6, 2022

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 325 41 Updated Sep 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LEE YOON HYUNG LEEYOONHYUNG

Achievements

Achievements

Highlights

Block or report LEEYOONHYUNG

Stars

mbzuai-oryx / LLMVoX

lucidrains / vector-quantize-pytorch

google-research / maskgit

valeoai / Halton-MaskGIT

modelscope / ClearerVoice-Studio

facebookresearch / flow_matching

Stability-AI / stable-codec

naver-ai / usdm

liutaocode / TTS-arxiv-daily

gpt-omni / mini-omni

SWivid / F5-TTS

kyutai-labs / moshi

supertone-inc / super-monotonic-align

taehong-moon / ee-diffusion

bshall / knn-vc

jishengpeng / WavTokenizer

keonlee9420 / evaluate-zero-shot-tts

quic / aimet

huggingface / parler-tts

VITA-MLLM / VITA

Edresson / ZS-TTS-Evaluation

YangLing0818 / consistency_flow_matching

karpathy / LLM101n

maum-ai / nuwave2

google-research / tuning_playbook

keonlee9420 / DailyTalk

gemelo-ai / vocos

Vaibhavs10 / open-tts-tracker

keonlee9420 / Comprehensive-E2E-TTS

keonlee9420 / Comprehensive-Transformer-TTS