Yushen CHEN
|
b03e9b2952
|
Merge pull request #389 from kunci115/main
Bug fix
|
2024-11-04 17:12:23 +08:00 |
|
Yushen CHEN
|
c1c20ed009
|
Update socket_server.py, to pass format check
|
2024-11-04 17:11:29 +08:00 |
|
Rino
|
24cfa9ecb9
|
Update README.md
|
2024-11-04 15:50:15 +07:00 |
|
Rino
|
c129dd7ba4
|
Rename socket.py to socket_server.py
[bug fix] due to circular import, can't use socket as file name
|
2024-11-04 15:48:09 +07:00 |
|
Rino
|
a83e764110
|
Update socket.py
[edit] adjusting mel_spec_type on load_model use case
|
2024-11-04 15:46:00 +07:00 |
|
SWivid
|
ac77a76cd3
|
add issue templates
|
2024-11-04 02:17:52 +08:00 |
|
SWivid
|
61ff2a62d9
|
formatting #363, credit to @JarodMica, also dur_pred check fork repo
|
2024-11-03 16:37:47 +08:00 |
|
Yushen CHEN
|
1085b73f59
|
Merge pull request #354 from kunci115/main
[add] socket stream
|
2024-11-03 16:24:57 +08:00 |
|
Rino
|
6e24f1ea78
|
Merge branch 'SWivid:main' into main
|
2024-11-03 11:40:25 +07:00 |
|
SWivid
|
ea90244d62
|
fix. add dtype check for asr pipeline addressing #356
|
2024-11-02 13:48:37 +08:00 |
|
SWivid
|
f7e248e2ce
|
formatting
|
2024-11-02 12:58:28 +08:00 |
|
Zhikang Niu
|
dc67a6819c
|
Merge pull request #367 from justinjohn0306/main
Ensure tensors are moved to CPU before saving with torchaudio
|
2024-11-02 11:15:41 +08:00 |
|
Rino
|
0fe34a862c
|
Merge branch 'SWivid:main' into main
|
2024-11-02 01:54:18 +07:00 |
|
Justin John
|
183ad09084
|
Ensure tensors are moved to CPU before saving with torchaudio
|
2024-11-01 23:47:00 +05:30 |
|
SWivid
|
b0f482421b
|
fix-patch-2 for #361
|
2024-11-01 19:14:46 +08:00 |
|
Rino
|
e12fe350f5
|
Merge branch 'SWivid:main' into main
|
2024-11-01 17:44:04 +07:00 |
|
SWivid
|
c370c81897
|
Merge branch 'main' of github.com:SWivid/F5-TTS into main
|
2024-11-01 18:43:30 +08:00 |
|
SWivid
|
0622087b82
|
add backward compatibility addressing #361
|
2024-11-01 18:43:02 +08:00 |
|
Yushen CHEN
|
11d2886e47
|
Merge pull request #359 from lpscr/main
small update gradio finetune
|
2024-11-01 18:22:46 +08:00 |
|
Yushen CHEN
|
b664bc7777
|
Update finetune_gradio.py
|
2024-11-01 18:20:39 +08:00 |
|
Yushen CHEN
|
552c0fd99c
|
Update prepare_csv_wavs.py
|
2024-11-01 18:17:57 +08:00 |
|
unknown
|
27d98a52cd
|
clear pipe
|
2024-11-01 12:08:47 +02:00 |
|
lpscr
|
9984a48041
|
Merge branch 'SWivid:main' into main
|
2024-11-01 12:06:52 +02:00 |
|
unknown
|
199c56c23c
|
clear pipe
|
2024-11-01 12:05:12 +02:00 |
|
unknown
|
f7a698bc2f
|
resample when need
|
2024-11-01 11:39:06 +02:00 |
|
unknown
|
5af195f1f9
|
only mono duraction fix value bfp16 to bf16
|
2024-11-01 11:18:05 +02:00 |
|
Rino
|
db902761c3
|
Update socket.py
[remove] unused var
|
2024-11-01 15:14:52 +07:00 |
|
Rino
|
e8f14072ec
|
Merge branch 'SWivid:main' into main
|
2024-11-01 15:06:49 +07:00 |
|
SWivid
|
2a3deaab33
|
fix. update tgt_sr def for log_sample with new mel_spec module
|
2024-11-01 15:42:19 +08:00 |
|
SWivid
|
315230210d
|
minor fix
|
2024-11-01 15:11:48 +08:00 |
|
Yushen CHEN
|
305e3eab35
|
Merge pull request #345 from ZhikangNiu/main
[WIP]Support Bigvgan vocoder
|
2024-11-01 14:37:30 +08:00 |
|
ZhikangNiu
|
18e1ab508f
|
refactor: del global params and set vocos as default vocoder, add dtype check
|
2024-11-01 14:17:22 +08:00 |
|
Rino
|
8713de5ffe
|
Merge branch 'SWivid:main' into main
|
2024-11-01 13:11:20 +07:00 |
|
SWivid
|
7a6fb0eb4e
|
fix address #348
|
2024-11-01 14:09:41 +08:00 |
|
Rino
|
561c67387d
|
Update socket.py
[edit] socket.py calling vocab, ckpt, ref audio
|
2024-11-01 11:22:03 +07:00 |
|
Rino Alfian
|
1c8fe499ef
|
Update README.md
socket client realtime stream
|
2024-11-01 10:13:36 +07:00 |
|
Rino Alfian
|
1e3fac8f5e
|
[add] socket.py
to play stream socket mode
|
2024-11-01 10:10:31 +07:00 |
|
ZhikangNiu
|
b180961782
|
refactor: more details about bigvgan, clear function definition
|
2024-11-01 11:02:39 +08:00 |
|
ZhikangNiu
|
36a4aad668
|
change some infer function to support two vocoder
|
2024-10-31 22:44:45 +08:00 |
|
ZhikangNiu
|
712d52772e
|
update Bigvgan vocoder and F5-bigvgan version, trained on Emilia ZH&EN, 1.25m updates
|
2024-10-31 20:06:36 +08:00 |
|
Yushen CHEN
|
dee0420b59
|
Merge pull request #334 from lpscr/main
small fix stuff
|
2024-10-31 11:44:58 +08:00 |
|
Yushen CHEN
|
6cbb548f9c
|
Update api.py
|
2024-10-31 11:39:23 +08:00 |
|
unknown
|
3dd59b8cdf
|
when ref_text empty automatic transcribing
|
2024-10-30 14:26:13 +02:00 |
|
unknown
|
02d59131c4
|
fix when none tts_api
|
2024-10-30 14:25:16 +02:00 |
|
Yushen CHEN
|
7dd3eecf2d
|
Merge pull request #333 from lpscr/main
Support more formats.
|
2024-10-30 18:28:07 +08:00 |
|
unknown
|
9c4fc38fa4
|
fix
|
2024-10-30 12:23:02 +02:00 |
|
unknown
|
77620f602f
|
suport more format
|
2024-10-30 11:41:42 +02:00 |
|
Yushen CHEN
|
739c4a1823
|
Merge pull request #332 from lpscr/main
gradio quick update suport for create metadata
|
2024-10-30 17:23:47 +08:00 |
|
unknown
|
6970556abf
|
fix logger in settings miss
|
2024-10-30 11:21:47 +02:00 |
|
unknown
|
513e6b466e
|
add note for more format
|
2024-10-30 11:14:39 +02:00 |
|