GitRelate(d)
Related Repositories for 0nutation/USLM
Repository
⭐ Stars
🍴 Forks
Ratio
0nutation/USLM
131
7
18.71
ZhangXInFD/SpeechTokenizer
102
8
12.75
suno-ai/bark
84
9
9.33
facebookresearch/audiocraft
82
9
9.11
facebookresearch/encodec
81
3
27.00
open-mmlab/Amphion
81
9
9.00
coqui-ai/TTS
81
17
4.76
openai/whisper
79
8
9.88
yl4579/StyleTTS2
76
6
12.67
espnet/espnet
74
21
3.52
RVC-Boss/GPT-SoVITS
72
6
12.00
lucidrains/audiolm-pytorch
71
3
23.67
speechbrain/speechbrain
69
12
5.75
Plachtaa/VALL-E-X
69
6
11.50
2noise/ChatTTS
68
5
13.60
yangdongchao/UniAudio
68
1
68.00
fishaudio/fish-speech
66
6
11.00
descriptinc/descript-audio-codec
66
7
9.43
0nutation/SpeechGPT
66
2
33.00
microsoft/unilm
66
4
16.50
FunAudioLLM/CosyVoice
65
4
16.25
lucidrains/naturalspeech2-pytorch
64
6
10.67
netease-youdao/EmotiVoice
64
7
9.14
facebookresearch/seamless_communication
63
6
10.50
jik876/hifi-gan
63
13
4.85
jaywalnut310/vits
63
6
10.50
myshell-ai/OpenVoice
62
5
12.40
m-bain/whisperX
62
7
8.86
svc-develop-team/so-vits-svc
61
8
7.62
lifeiteng/vall-e
61
6
10.17
neonbjb/tortoise-tts
61
4
15.25
yangdongchao/AcademiCodec
61
5
12.20
haoheliu/AudioLDM
60
3
20.00
kyutai-labs/moshi
60
5
12.00
shivammehta25/Matcha-TTS
60
7
8.57
pyannote/pyannote-audio
59
6
9.83
AIGC-Audio/AudioGPT
59
0
jasonppy/VoiceCraft
58
5
11.60
lucidrains/vector-quantize-pytorch
58
5
11.60
facebookresearch/llama
57
2
28.50
SWivid/F5-TTS
57
7
8.14
archinetai/audio-ai-timeline
57
3
19.00
archinetai/audio-diffusion-pytorch
57
3
19.00
bytedance/SALMONN
56
3
18.67
NVIDIA/NeMo
56
10
5.60
haoheliu/AudioLDM2
56
1
56.00
huggingface/diffusers
56
4
14.00
LAION-AI/CLAP
55
2
27.50
kan-bayashi/ParallelWaveGAN
55
8
6.88
sh-lee-prml/HierSpeechpp
54
3
18.00
gpt-omni/mini-omni
53
1
53.00
declare-lab/tango
53
5
10.60
microsoft/SpeechT5
52
3
17.33
NVIDIA/BigVGAN
52
8
6.50
facebookresearch/AudioDec
52
5
10.40
MoonInTheRiver/DiffSinger
52
4
13.00
ming024/FastSpeech2
52
8
6.50
CorentinJ/Real-Time-Voice-Cloning
51
8
6.38
ga642381/speech-trident
51
3
17.00
ddlBoJack/emotion2vec
51
1
51.00
s3prl/s3prl
51
11
4.64
pytorch/fairseq
51
13
3.92
Audio-AGI/AudioSep
51
3
17.00
jishengpeng/WavTokenizer
51
3
17.00
microsoft/DeepSpeed
50
3
16.67
huawei-noah/Speech-Backbones
50
4
12.50
yangdongchao/SoundStorm
50
3
16.67
lucidrains/soundstorm-pytorch
50
3
16.67
enhuiz/vall-e
50
3
16.67
SpeechifyInc/Meta-voicebox
49
2
24.50
facebookresearch/segment-anything
49
2
24.50
QwenLM/Qwen-Audio
49
2
24.50
resemble-ai/Resemblyzer
49
2
24.50
vllm-project/vllm
49
0
microsoft/NeuralSpeech
49
3
16.33
hpcaitech/ColossalAI
48
3
16.00
hpcaitech/Open-Sora
48
3
16.00
liusongxiang/Large-Audio-Models
48
2
24.00
VinAIResearch/XPhoneBERT
47
5
9.40
lucidrains/voicebox-pytorch
46
4
11.50
huggingface/transformers
46
12
3.83
MontrealCorpusTools/Montreal-Forced-Aligner
46
5
9.20
EmulationAI/awesome-large-audio-models
46
2
23.00
FunAudioLLM/SenseVoice
46
3
15.33
state-spaces/mamba
46
2
23.00
meta-llama/llama3
46
2
23.00
KdaiP/StableTTS
45
5
9.00
charactr-platform/vocos
45
8
5.62
NATSpeech/NATSpeech
45
2
22.50
haoheliu/versatile_audio_super_resolution
45
3
15.00
facebookresearch/DiT
45
2
22.50
hubertsiuzdak/snac
44
3
14.67
bootphon/phonemizer
44
3
14.67
snakers4/silero-vad
44
1
44.00
ZhangXInFD/soundstorm-speechtokenizer
44
5
8.80
lucidrains/musiclm-pytorch
44
2
22.00
iver56/audiomentations
43
1
43.00
DigitalPhonetics/IMS-Toucan
43
4
10.75
CompVis/stable-diffusion
43
3
14.33
zhenye234/CoMoSpeech
43
4
10.75
Show More