Related Repositories for 0nutation/USLM

Repository	⭐ Stars	🍴 Forks	Ratio
0nutation/USLM	131	7	18.71
ZhangXInFD/SpeechTokenizer	102	8	12.75
suno-ai/bark	84	9	9.33
facebookresearch/audiocraft	82	9	9.11
facebookresearch/encodec	81	3	27.00
open-mmlab/Amphion	81	9	9.00
coqui-ai/TTS	81	17	4.76
openai/whisper	79	8	9.88
yl4579/StyleTTS2	76	6	12.67
espnet/espnet	74	21	3.52
RVC-Boss/GPT-SoVITS	72	6	12.00
lucidrains/audiolm-pytorch	71	3	23.67
speechbrain/speechbrain	69	12	5.75
Plachtaa/VALL-E-X	69	6	11.50
2noise/ChatTTS	68	5	13.60
yangdongchao/UniAudio	68	1	68.00
fishaudio/fish-speech	66	6	11.00
descriptinc/descript-audio-codec	66	7	9.43
0nutation/SpeechGPT	66	2	33.00
microsoft/unilm	66	4	16.50
FunAudioLLM/CosyVoice	65	4	16.25
lucidrains/naturalspeech2-pytorch	64	6	10.67
netease-youdao/EmotiVoice	64	7	9.14
facebookresearch/seamless_communication	63	6	10.50
jik876/hifi-gan	63	13	4.85
jaywalnut310/vits	63	6	10.50
myshell-ai/OpenVoice	62	5	12.40
m-bain/whisperX	62	7	8.86
svc-develop-team/so-vits-svc	61	8	7.62
lifeiteng/vall-e	61	6	10.17
neonbjb/tortoise-tts	61	4	15.25
yangdongchao/AcademiCodec	61	5	12.20
haoheliu/AudioLDM	60	3	20.00
kyutai-labs/moshi	60	5	12.00
shivammehta25/Matcha-TTS	60	7	8.57
pyannote/pyannote-audio	59	6	9.83
AIGC-Audio/AudioGPT	59	0
jasonppy/VoiceCraft	58	5	11.60
lucidrains/vector-quantize-pytorch	58	5	11.60
facebookresearch/llama	57	2	28.50
SWivid/F5-TTS	57	7	8.14
archinetai/audio-ai-timeline	57	3	19.00
archinetai/audio-diffusion-pytorch	57	3	19.00
bytedance/SALMONN	56	3	18.67
NVIDIA/NeMo	56	10	5.60
haoheliu/AudioLDM2	56	1	56.00
huggingface/diffusers	56	4	14.00
LAION-AI/CLAP	55	2	27.50
kan-bayashi/ParallelWaveGAN	55	8	6.88
sh-lee-prml/HierSpeechpp	54	3	18.00
gpt-omni/mini-omni	53	1	53.00
declare-lab/tango	53	5	10.60
microsoft/SpeechT5	52	3	17.33
NVIDIA/BigVGAN	52	8	6.50
facebookresearch/AudioDec	52	5	10.40
MoonInTheRiver/DiffSinger	52	4	13.00
ming024/FastSpeech2	52	8	6.50
CorentinJ/Real-Time-Voice-Cloning	51	8	6.38
ga642381/speech-trident	51	3	17.00
ddlBoJack/emotion2vec	51	1	51.00
s3prl/s3prl	51	11	4.64
pytorch/fairseq	51	13	3.92
Audio-AGI/AudioSep	51	3	17.00
jishengpeng/WavTokenizer	51	3	17.00
microsoft/DeepSpeed	50	3	16.67
huawei-noah/Speech-Backbones	50	4	12.50
yangdongchao/SoundStorm	50	3	16.67
lucidrains/soundstorm-pytorch	50	3	16.67
enhuiz/vall-e	50	3	16.67
SpeechifyInc/Meta-voicebox	49	2	24.50
facebookresearch/segment-anything	49	2	24.50
QwenLM/Qwen-Audio	49	2	24.50
resemble-ai/Resemblyzer	49	2	24.50
vllm-project/vllm	49	0
microsoft/NeuralSpeech	49	3	16.33
hpcaitech/ColossalAI	48	3	16.00
hpcaitech/Open-Sora	48	3	16.00
liusongxiang/Large-Audio-Models	48	2	24.00
VinAIResearch/XPhoneBERT	47	5	9.40
lucidrains/voicebox-pytorch	46	4	11.50
huggingface/transformers	46	12	3.83
MontrealCorpusTools/Montreal-Forced-Aligner	46	5	9.20
EmulationAI/awesome-large-audio-models	46	2	23.00
FunAudioLLM/SenseVoice	46	3	15.33
state-spaces/mamba	46	2	23.00
meta-llama/llama3	46	2	23.00
KdaiP/StableTTS	45	5	9.00
charactr-platform/vocos	45	8	5.62
NATSpeech/NATSpeech	45	2	22.50
haoheliu/versatile_audio_super_resolution	45	3	15.00
facebookresearch/DiT	45	2	22.50
hubertsiuzdak/snac	44	3	14.67
bootphon/phonemizer	44	3	14.67
snakers4/silero-vad	44	1	44.00
ZhangXInFD/soundstorm-speechtokenizer	44	5	8.80
lucidrains/musiclm-pytorch	44	2	22.00
iver56/audiomentations	43	1	43.00
DigitalPhonetics/IMS-Toucan	43	4	10.75
CompVis/stable-diffusion	43	3	14.33
zhenye234/CoMoSpeech	43	4	10.75