GitRelate(d)
Related Repositories for k2-fsa/libriheavy
Repository
⭐ Stars
🍴 Forks
Ratio
k2-fsa/libriheavy
175
4
43.75
openai/whisper
97
8
12.12
espnet/espnet
91
28
3.25
speechbrain/speechbrain
90
12
7.50
suno-ai/bark
89
6
14.83
coqui-ai/TTS
88
17
5.18
facebookresearch/audiocraft
88
7
12.57
open-mmlab/Amphion
88
6
14.67
2noise/ChatTTS
86
2
43.00
NVIDIA/NeMo
84
14
6.00
RVC-Boss/GPT-SoVITS
80
9
8.89
FunAudioLLM/CosyVoice
79
7
11.29
pyannote/pyannote-audio
78
9
8.67
facebookresearch/encodec
78
0
myshell-ai/OpenVoice
78
6
13.00
fishaudio/fish-speech
77
5
15.40
m-bain/whisperX
76
11
6.91
neonbjb/tortoise-tts
75
8
9.38
jasonppy/VoiceCraft
75
4
18.75
kyutai-labs/moshi
74
4
18.50
descriptinc/descript-audio-codec
73
6
12.17
facebookresearch/seamless_communication
72
3
24.00
jaywalnut310/vits
71
11
6.45
ZhangXInFD/SpeechTokenizer
69
2
34.50
huggingface/transformers
69
19
3.63
netease-youdao/EmotiVoice
69
5
13.80
yl4579/StyleTTS2
69
7
9.86
shivammehta25/Matcha-TTS
67
9
7.44
SWivid/F5-TTS
67
6
11.17
Plachtaa/VALL-E-X
67
3
22.33
lucidrains/audiolm-pytorch
66
2
33.00
snakers4/silero-vad
66
3
22.00
microsoft/unilm
65
8
8.12
haoheliu/AudioLDM
64
2
32.00
lucidrains/naturalspeech2-pytorch
64
3
21.33
archinetai/audio-diffusion-pytorch
62
4
15.50
s3prl/s3prl
62
7
8.86
kan-bayashi/ParallelWaveGAN
62
12
5.17
NVIDIA/BigVGAN
61
5
12.20
jik876/hifi-gan
61
14
4.36
bytedance/SALMONN
60
3
20.00
0nutation/SpeechGPT
60
1
60.00
sh-lee-prml/HierSpeechpp
60
2
30.00
Rikorose/DeepFilterNet
60
2
30.00
yangdongchao/UniAudio
60
2
30.00
aliutkus/speechmetrics
59
1
59.00
bootphon/phonemizer
59
3
19.67
facebookresearch/llama
58
1
58.00
svc-develop-team/so-vits-svc
58
6
9.67
AIGC-Audio/AudioGPT
58
0
ggerganov/whisper.cpp
58
8
7.25
jishengpeng/WavTokenizer
57
2
28.50
microsoft/DeepSpeed
57
0
gpt-omni/mini-omni
57
1
57.00
lifeiteng/vall-e
57
9
6.33
yangdongchao/AcademiCodec
57
6
9.50
CorentinJ/Real-Time-Voice-Cloning
56
10
5.60
meta-llama/llama3
56
1
56.00
huggingface/parler-tts
56
2
28.00
metavoiceio/metavoice-src
56
3
18.67
vllm-project/vllm
55
1
55.00
hpcaitech/Open-Sora
55
3
18.33
kaldi-asr/kaldi
54
25
2.16
QwenLM/Qwen-Audio
54
1
54.00
lhotse-speech/lhotse
54
10
5.40
Camb-ai/MARS5-TTS
54
3
18.00
lucidrains/vector-quantize-pytorch
54
4
13.50
liusongxiang/Large-Audio-Models
53
2
26.50
alibaba-damo-academy/FunCodec
53
2
26.50
KdaiP/StableTTS
53
3
17.67
ddlBoJack/emotion2vec
52
4
13.00
microsoft/SpeechT5
52
0
LAION-AI/CLAP
52
0
facebookresearch/AudioDec
52
3
17.33
archinetai/audio-ai-timeline
52
2
26.00
alibaba-damo-academy/FunASR
52
3
17.33
MontrealCorpusTools/Montreal-Forced-Aligner
51
2
25.50
huggingface/diffusers
51
2
25.50
FunAudioLLM/SenseVoice
51
1
51.00
BlinkDL/RWKV-LM
51
0
TencentGameMate/chinese_speech_pretrain
51
1
51.00
karpathy/minbpe
51
2
25.50
facebookresearch/DiT
51
0
Audio-AGI/AudioSep
51
1
51.00
facebookresearch/libri-light
51
1
51.00
ming024/FastSpeech2
51
6
8.50
DigitalPhonetics/IMS-Toucan
50
3
16.67
LAION-AI/audio-dataset
50
1
50.00
ga642381/speech-trident
50
2
25.00
microsoft/NeuralSpeech
50
1
50.00
hubertsiuzdak/snac
50
2
25.00
karpathy/nanoGPT
50
3
16.67
enhuiz/vall-e
50
2
25.00
iver56/audiomentations
50
3
16.67
jim-schwoebel/voice_datasets
50
6
8.33
haoheliu/versatile_audio_super_resolution
49
2
24.50
haoheliu/AudioLDM2
49
0
MoonInTheRiver/DiffSinger
49
6
8.17
huawei-noah/Speech-Backbones
48
3
16.00
NATSpeech/NATSpeech
48
5
9.60
Show More