GitRelate(d)
Related Repositories for ddlBoJack/Awesome-Speech-Pretraining
Repository
⭐ Stars
🍴 Forks
Ratio
ddlBoJack/Awesome-Speech-Pretraining
192
6
32.00
speechbrain/speechbrain
102
22
4.64
openai/whisper
94
8
11.75
espnet/espnet
87
24
3.62
coqui-ai/TTS
84
8
10.50
microsoft/unilm
84
5
16.80
huggingface/transformers
80
11
7.27
s3prl/s3prl
79
17
4.65
suno-ai/bark
78
5
15.60
ddlBoJack/Speech-Resources
76
9
8.44
NVIDIA/NeMo
76
11
6.91
facebookresearch/audiocraft
72
3
24.00
facebookresearch/encodec
71
4
17.75
pytorch/fairseq
71
15
4.73
open-mmlab/Amphion
69
4
17.25
kaldi-asr/kaldi
63
24
2.62
jaywalnut310/vits
62
6
10.33
AIGC-Audio/AudioGPT
62
3
20.67
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
61
7
8.71
facebookresearch/llama
60
3
20.00
microsoft/DeepSpeed
60
2
30.00
lucidrains/audiolm-pytorch
60
5
12.00
BradyFU/Awesome-Multimodal-Large-Language-Models
59
2
29.50
RVC-Boss/GPT-SoVITS
59
5
11.80
0nutation/SpeechGPT
59
1
59.00
jik876/hifi-gan
59
11
5.36
microsoft/SpeechT5
58
3
19.33
m-bain/whisperX
58
3
19.33
huggingface/diffusers
57
4
14.25
pyannote/pyannote-audio
56
5
11.20
google-research/google-research
54
5
10.80
haoheliu/AudioLDM
53
2
26.50
alibaba-damo-academy/FunASR
53
4
13.25
descriptinc/descript-audio-codec
53
2
26.50
TencentGameMate/chinese_speech_pretrain
53
2
26.50
2noise/ChatTTS
53
1
53.00
facebookresearch/segment-anything
52
1
52.00
snakers4/silero-vad
52
4
13.00
lllyasviel/ControlNet
52
1
52.00
archinetai/audio-diffusion-pytorch
52
2
26.00
microsoft/NeuralSpeech
51
5
10.20
CompVis/stable-diffusion
51
5
10.20
karpathy/nanoGPT
51
1
51.00
ming024/FastSpeech2
51
10
5.10
FunAudioLLM/CosyVoice
51
4
12.75
facebookresearch/seamless_communication
50
3
16.67
CorentinJ/Real-Time-Voice-Cloning
50
14
3.57
liusongxiang/Large-Audio-Models
50
2
25.00
kyutai-labs/moshi
50
2
25.00
neonbjb/tortoise-tts
50
4
12.50
huggingface/peft
50
1
50.00
mli/paper-reading
49
6
8.17
aliutkus/speechmetrics
49
7
7.00
openai/CLIP
49
2
24.50
lucidrains/vector-quantize-pytorch
48
4
12.00
bytedance/SALMONN
48
3
16.00
ga642381/speech-trident
48
4
12.00
fighting41love/funNLP
48
11
4.36
archinetai/audio-ai-timeline
48
4
12.00
PaddlePaddle/PaddleSpeech
48
4
12.00
iver56/audiomentations
47
4
11.75
LAION-AI/CLAP
47
3
15.67
pytorch/pytorch
47
12
3.92
myshell-ai/OpenVoice
47
2
23.50
ddlBoJack/emotion2vec
46
4
11.50
shivammehta25/Matcha-TTS
46
2
23.00
lucidrains/naturalspeech2-pytorch
46
1
46.00
lhotse-speech/lhotse
46
5
9.20
asteroid-team/torch-audiomentations
46
6
7.67
MoonInTheRiver/DiffSinger
46
5
9.20
kan-bayashi/ParallelWaveGAN
45
6
7.50
vllm-project/vllm
45
0
k2-fsa/icefall
45
6
7.50
facebookresearch/fairseq
45
16
2.81
Rikorose/DeepFilterNet
45
7
6.43
lifeiteng/vall-e
45
6
7.50
pytorch/audio
45
8
5.62
facebookresearch/demucs
45
7
6.43
MontrealCorpusTools/Montreal-Forced-Aligner
45
4
11.25
wq2012/awesome-diarization
44
7
6.29
netease-youdao/EmotiVoice
44
3
14.67
haotian-liu/LLaVA
44
1
44.00
nanahou/Awesome-Speech-Enhancement
44
6
7.33
google-research/tuning_playbook
44
0
k2-fsa/k2
43
3
14.33
huggingface/accelerate
43
1
43.00
QwenLM/Qwen-Audio
43
3
14.33
Plachtaa/VALL-E-X
43
2
21.50
wenet-e2e/wenet
43
16
2.69
Vision-CAIR/MiniGPT-4
42
2
21.00
tensorflow/tensorflow
42
14
3.00
SWivid/F5-TTS
42
3
14.00
fishaudio/fish-speech
42
2
21.00
labuladong/fucking-algorithm
42
6
7.00
yl4579/StyleTTS2
42
3
14.00
enhuiz/vall-e
41
1
41.00
heejkoo/Awesome-Diffusion-Models
41
6
6.83
lucidrains/denoising-diffusion-pytorch
41
3
13.67
Audio-AGI/AudioSep
41
0
microsoft/DNS-Challenge
41
7
5.86
Show More