unknown
|
e6f3e50eb1
|
Fix the glitch effect at the beginning audio
|
2024-11-04 17:05:16 +02:00 |
|
Yushen CHEN
|
c1c20ed009
|
Update socket_server.py, to pass format check
|
2024-11-04 17:11:29 +08:00 |
|
Rino
|
24cfa9ecb9
|
Update README.md
|
2024-11-04 15:50:15 +07:00 |
|
Rino
|
c129dd7ba4
|
Rename socket.py to socket_server.py
[bug fix] due to circular import, can't use socket as file name
|
2024-11-04 15:48:09 +07:00 |
|
Rino
|
a83e764110
|
Update socket.py
[edit] adjusting mel_spec_type on load_model use case
|
2024-11-04 15:46:00 +07:00 |
|
SWivid
|
61ff2a62d9
|
formatting #363, credit to @JarodMica, also dur_pred check fork repo
|
2024-11-03 16:37:47 +08:00 |
|
Rino
|
6e24f1ea78
|
Merge branch 'SWivid:main' into main
|
2024-11-03 11:40:25 +07:00 |
|
SWivid
|
ea90244d62
|
fix. add dtype check for asr pipeline addressing #356
|
2024-11-02 13:48:37 +08:00 |
|
SWivid
|
f7e248e2ce
|
formatting
|
2024-11-02 12:58:28 +08:00 |
|
Rino
|
0fe34a862c
|
Merge branch 'SWivid:main' into main
|
2024-11-02 01:54:18 +07:00 |
|
Justin John
|
183ad09084
|
Ensure tensors are moved to CPU before saving with torchaudio
|
2024-11-01 23:47:00 +05:30 |
|
SWivid
|
b0f482421b
|
fix-patch-2 for #361
|
2024-11-01 19:14:46 +08:00 |
|
Rino
|
e12fe350f5
|
Merge branch 'SWivid:main' into main
|
2024-11-01 17:44:04 +07:00 |
|
SWivid
|
c370c81897
|
Merge branch 'main' of github.com:SWivid/F5-TTS into main
|
2024-11-01 18:43:30 +08:00 |
|
SWivid
|
0622087b82
|
add backward compatibility addressing #361
|
2024-11-01 18:43:02 +08:00 |
|
Yushen CHEN
|
b664bc7777
|
Update finetune_gradio.py
|
2024-11-01 18:20:39 +08:00 |
|
Yushen CHEN
|
552c0fd99c
|
Update prepare_csv_wavs.py
|
2024-11-01 18:17:57 +08:00 |
|
unknown
|
27d98a52cd
|
clear pipe
|
2024-11-01 12:08:47 +02:00 |
|
lpscr
|
9984a48041
|
Merge branch 'SWivid:main' into main
|
2024-11-01 12:06:52 +02:00 |
|
unknown
|
199c56c23c
|
clear pipe
|
2024-11-01 12:05:12 +02:00 |
|
unknown
|
f7a698bc2f
|
resample when need
|
2024-11-01 11:39:06 +02:00 |
|
unknown
|
5af195f1f9
|
only mono duraction fix value bfp16 to bf16
|
2024-11-01 11:18:05 +02:00 |
|
Rino
|
db902761c3
|
Update socket.py
[remove] unused var
|
2024-11-01 15:14:52 +07:00 |
|
Rino
|
e8f14072ec
|
Merge branch 'SWivid:main' into main
|
2024-11-01 15:06:49 +07:00 |
|
SWivid
|
2a3deaab33
|
fix. update tgt_sr def for log_sample with new mel_spec module
|
2024-11-01 15:42:19 +08:00 |
|
SWivid
|
315230210d
|
minor fix
|
2024-11-01 15:11:48 +08:00 |
|
Yushen CHEN
|
305e3eab35
|
Merge pull request #345 from ZhikangNiu/main
[WIP]Support Bigvgan vocoder
|
2024-11-01 14:37:30 +08:00 |
|
ZhikangNiu
|
18e1ab508f
|
refactor: del global params and set vocos as default vocoder, add dtype check
|
2024-11-01 14:17:22 +08:00 |
|
Rino
|
8713de5ffe
|
Merge branch 'SWivid:main' into main
|
2024-11-01 13:11:20 +07:00 |
|
SWivid
|
7a6fb0eb4e
|
fix address #348
|
2024-11-01 14:09:41 +08:00 |
|
Rino
|
561c67387d
|
Update socket.py
[edit] socket.py calling vocab, ckpt, ref audio
|
2024-11-01 11:22:03 +07:00 |
|
Rino Alfian
|
1c8fe499ef
|
Update README.md
socket client realtime stream
|
2024-11-01 10:13:36 +07:00 |
|
Rino Alfian
|
1e3fac8f5e
|
[add] socket.py
to play stream socket mode
|
2024-11-01 10:10:31 +07:00 |
|
ZhikangNiu
|
b180961782
|
refactor: more details about bigvgan, clear function definition
|
2024-11-01 11:02:39 +08:00 |
|
ZhikangNiu
|
36a4aad668
|
change some infer function to support two vocoder
|
2024-10-31 22:44:45 +08:00 |
|
ZhikangNiu
|
712d52772e
|
update Bigvgan vocoder and F5-bigvgan version, trained on Emilia ZH&EN, 1.25m updates
|
2024-10-31 20:06:36 +08:00 |
|
Yushen CHEN
|
6cbb548f9c
|
Update api.py
|
2024-10-31 11:39:23 +08:00 |
|
unknown
|
3dd59b8cdf
|
when ref_text empty automatic transcribing
|
2024-10-30 14:26:13 +02:00 |
|
unknown
|
02d59131c4
|
fix when none tts_api
|
2024-10-30 14:25:16 +02:00 |
|
unknown
|
9c4fc38fa4
|
fix
|
2024-10-30 12:23:02 +02:00 |
|
unknown
|
77620f602f
|
suport more format
|
2024-10-30 11:41:42 +02:00 |
|
unknown
|
6970556abf
|
fix logger in settings miss
|
2024-10-30 11:21:47 +02:00 |
|
unknown
|
513e6b466e
|
add note for more format
|
2024-10-30 11:14:39 +02:00 |
|
unknown
|
9eb0d1d226
|
add to suport audio name , audio.wav , full audio path , add note
|
2024-10-30 11:09:22 +02:00 |
|
unknown
|
e9419764d7
|
add to suport audio name , audio.wav , full audio path
|
2024-10-30 11:03:26 +02:00 |
|
SWivid
|
aaa92f6e6d
|
finish trainer modification
|
2024-10-30 03:57:09 +08:00 |
|
SWivid
|
87c4f9ff06
|
minor fix
|
2024-10-30 03:23:22 +08:00 |
|
SWivid
|
381ea0c82c
|
fix vocoder loading
|
2024-10-30 03:16:09 +08:00 |
|
SWivid
|
da1b40968a
|
basic structure
|
2024-10-30 02:38:58 +08:00 |
|
SWivid
|
5b10099d33
|
Merge branch 'main' of github.com:lpscr/F5-TTS into lpscr-main
|
2024-10-29 23:36:58 +08:00 |
|