Related Repositories for k2-fsa/libriheavy

Repository	⭐ Stars	🍴 Forks	Ratio
k2-fsa/libriheavy	175	4	43.75
openai/whisper	97	8	12.12
espnet/espnet	91	28	3.25
speechbrain/speechbrain	90	12	7.50
suno-ai/bark	89	6	14.83
coqui-ai/TTS	88	17	5.18
facebookresearch/audiocraft	88	7	12.57
open-mmlab/Amphion	88	6	14.67
2noise/ChatTTS	86	2	43.00
NVIDIA/NeMo	84	14	6.00
RVC-Boss/GPT-SoVITS	80	9	8.89
FunAudioLLM/CosyVoice	79	7	11.29
pyannote/pyannote-audio	78	9	8.67
facebookresearch/encodec	78	0
myshell-ai/OpenVoice	78	6	13.00
fishaudio/fish-speech	77	5	15.40
m-bain/whisperX	76	11	6.91
neonbjb/tortoise-tts	75	8	9.38
jasonppy/VoiceCraft	75	4	18.75
kyutai-labs/moshi	74	4	18.50
descriptinc/descript-audio-codec	73	6	12.17
facebookresearch/seamless_communication	72	3	24.00
jaywalnut310/vits	71	11	6.45
ZhangXInFD/SpeechTokenizer	69	2	34.50
huggingface/transformers	69	19	3.63
netease-youdao/EmotiVoice	69	5	13.80
yl4579/StyleTTS2	69	7	9.86
shivammehta25/Matcha-TTS	67	9	7.44
SWivid/F5-TTS	67	6	11.17
Plachtaa/VALL-E-X	67	3	22.33
lucidrains/audiolm-pytorch	66	2	33.00
snakers4/silero-vad	66	3	22.00
microsoft/unilm	65	8	8.12
haoheliu/AudioLDM	64	2	32.00
lucidrains/naturalspeech2-pytorch	64	3	21.33
archinetai/audio-diffusion-pytorch	62	4	15.50
s3prl/s3prl	62	7	8.86
kan-bayashi/ParallelWaveGAN	62	12	5.17
NVIDIA/BigVGAN	61	5	12.20
jik876/hifi-gan	61	14	4.36
bytedance/SALMONN	60	3	20.00
0nutation/SpeechGPT	60	1	60.00
sh-lee-prml/HierSpeechpp	60	2	30.00
Rikorose/DeepFilterNet	60	2	30.00
yangdongchao/UniAudio	60	2	30.00
aliutkus/speechmetrics	59	1	59.00
bootphon/phonemizer	59	3	19.67
facebookresearch/llama	58	1	58.00
svc-develop-team/so-vits-svc	58	6	9.67
AIGC-Audio/AudioGPT	58	0
ggerganov/whisper.cpp	58	8	7.25
jishengpeng/WavTokenizer	57	2	28.50
microsoft/DeepSpeed	57	0
gpt-omni/mini-omni	57	1	57.00
lifeiteng/vall-e	57	9	6.33
yangdongchao/AcademiCodec	57	6	9.50
CorentinJ/Real-Time-Voice-Cloning	56	10	5.60
meta-llama/llama3	56	1	56.00
huggingface/parler-tts	56	2	28.00
metavoiceio/metavoice-src	56	3	18.67
vllm-project/vllm	55	1	55.00
hpcaitech/Open-Sora	55	3	18.33
kaldi-asr/kaldi	54	25	2.16
QwenLM/Qwen-Audio	54	1	54.00
lhotse-speech/lhotse	54	10	5.40
Camb-ai/MARS5-TTS	54	3	18.00
lucidrains/vector-quantize-pytorch	54	4	13.50
liusongxiang/Large-Audio-Models	53	2	26.50
alibaba-damo-academy/FunCodec	53	2	26.50
KdaiP/StableTTS	53	3	17.67
ddlBoJack/emotion2vec	52	4	13.00
microsoft/SpeechT5	52	0
LAION-AI/CLAP	52	0
facebookresearch/AudioDec	52	3	17.33
archinetai/audio-ai-timeline	52	2	26.00
alibaba-damo-academy/FunASR	52	3	17.33
MontrealCorpusTools/Montreal-Forced-Aligner	51	2	25.50
huggingface/diffusers	51	2	25.50
FunAudioLLM/SenseVoice	51	1	51.00
BlinkDL/RWKV-LM	51	0
TencentGameMate/chinese_speech_pretrain	51	1	51.00
karpathy/minbpe	51	2	25.50
facebookresearch/DiT	51	0
Audio-AGI/AudioSep	51	1	51.00
facebookresearch/libri-light	51	1	51.00
ming024/FastSpeech2	51	6	8.50
DigitalPhonetics/IMS-Toucan	50	3	16.67
LAION-AI/audio-dataset	50	1	50.00
ga642381/speech-trident	50	2	25.00
microsoft/NeuralSpeech	50	1	50.00
hubertsiuzdak/snac	50	2	25.00
karpathy/nanoGPT	50	3	16.67
enhuiz/vall-e	50	2	25.00
iver56/audiomentations	50	3	16.67
jim-schwoebel/voice_datasets	50	6	8.33
haoheliu/versatile_audio_super_resolution	49	2	24.50
haoheliu/AudioLDM2	49	0
MoonInTheRiver/DiffSinger	49	6	8.17
huawei-noah/Speech-Backbones	48	3	16.00
NATSpeech/NATSpeech	48	5	9.60