Commit Graph

115 Commits

Author SHA1 Message Date
unknown
e6f3e50eb1 Fix the glitch effect at the beginning audio 2024-11-04 17:05:16 +02:00
Yushen CHEN
c1c20ed009 Update socket_server.py, to pass format check 2024-11-04 17:11:29 +08:00
Rino
24cfa9ecb9 Update README.md 2024-11-04 15:50:15 +07:00
Rino
c129dd7ba4 Rename socket.py to socket_server.py
[bug fix] due to circular import, can't use socket as file name
2024-11-04 15:48:09 +07:00
Rino
a83e764110 Update socket.py
[edit] adjusting mel_spec_type on load_model use case
2024-11-04 15:46:00 +07:00
SWivid
61ff2a62d9 formatting #363, credit to @JarodMica, also dur_pred check fork repo 2024-11-03 16:37:47 +08:00
Rino
6e24f1ea78 Merge branch 'SWivid:main' into main 2024-11-03 11:40:25 +07:00
SWivid
ea90244d62 fix. add dtype check for asr pipeline addressing #356 2024-11-02 13:48:37 +08:00
SWivid
f7e248e2ce formatting 2024-11-02 12:58:28 +08:00
Rino
0fe34a862c Merge branch 'SWivid:main' into main 2024-11-02 01:54:18 +07:00
Justin John
183ad09084 Ensure tensors are moved to CPU before saving with torchaudio 2024-11-01 23:47:00 +05:30
SWivid
b0f482421b fix-patch-2 for #361 2024-11-01 19:14:46 +08:00
Rino
e12fe350f5 Merge branch 'SWivid:main' into main 2024-11-01 17:44:04 +07:00
SWivid
c370c81897 Merge branch 'main' of github.com:SWivid/F5-TTS into main 2024-11-01 18:43:30 +08:00
SWivid
0622087b82 add backward compatibility addressing #361 2024-11-01 18:43:02 +08:00
Yushen CHEN
b664bc7777 Update finetune_gradio.py 2024-11-01 18:20:39 +08:00
Yushen CHEN
552c0fd99c Update prepare_csv_wavs.py 2024-11-01 18:17:57 +08:00
unknown
27d98a52cd clear pipe 2024-11-01 12:08:47 +02:00
lpscr
9984a48041 Merge branch 'SWivid:main' into main 2024-11-01 12:06:52 +02:00
unknown
199c56c23c clear pipe 2024-11-01 12:05:12 +02:00
unknown
f7a698bc2f resample when need 2024-11-01 11:39:06 +02:00
unknown
5af195f1f9 only mono duraction fix value bfp16 to bf16 2024-11-01 11:18:05 +02:00
Rino
db902761c3 Update socket.py
[remove] unused var
2024-11-01 15:14:52 +07:00
Rino
e8f14072ec Merge branch 'SWivid:main' into main 2024-11-01 15:06:49 +07:00
SWivid
2a3deaab33 fix. update tgt_sr def for log_sample with new mel_spec module 2024-11-01 15:42:19 +08:00
SWivid
315230210d minor fix 2024-11-01 15:11:48 +08:00
Yushen CHEN
305e3eab35 Merge pull request #345 from ZhikangNiu/main
[WIP]Support Bigvgan vocoder
2024-11-01 14:37:30 +08:00
ZhikangNiu
18e1ab508f refactor: del global params and set vocos as default vocoder, add dtype check 2024-11-01 14:17:22 +08:00
Rino
8713de5ffe Merge branch 'SWivid:main' into main 2024-11-01 13:11:20 +07:00
SWivid
7a6fb0eb4e fix address #348 2024-11-01 14:09:41 +08:00
Rino
561c67387d Update socket.py
[edit] socket.py calling vocab, ckpt, ref audio
2024-11-01 11:22:03 +07:00
Rino Alfian
1c8fe499ef Update README.md
socket client realtime stream
2024-11-01 10:13:36 +07:00
Rino Alfian
1e3fac8f5e [add] socket.py
to play stream socket mode
2024-11-01 10:10:31 +07:00
ZhikangNiu
b180961782 refactor: more details about bigvgan, clear function definition 2024-11-01 11:02:39 +08:00
ZhikangNiu
36a4aad668 change some infer function to support two vocoder 2024-10-31 22:44:45 +08:00
ZhikangNiu
712d52772e update Bigvgan vocoder and F5-bigvgan version, trained on Emilia ZH&EN, 1.25m updates 2024-10-31 20:06:36 +08:00
Yushen CHEN
6cbb548f9c Update api.py 2024-10-31 11:39:23 +08:00
unknown
3dd59b8cdf when ref_text empty automatic transcribing 2024-10-30 14:26:13 +02:00
unknown
02d59131c4 fix when none tts_api 2024-10-30 14:25:16 +02:00
unknown
9c4fc38fa4 fix 2024-10-30 12:23:02 +02:00
unknown
77620f602f suport more format 2024-10-30 11:41:42 +02:00
unknown
6970556abf fix logger in settings miss 2024-10-30 11:21:47 +02:00
unknown
513e6b466e add note for more format 2024-10-30 11:14:39 +02:00
unknown
9eb0d1d226 add to suport audio name , audio.wav , full audio path , add note 2024-10-30 11:09:22 +02:00
unknown
e9419764d7 add to suport audio name , audio.wav , full audio path 2024-10-30 11:03:26 +02:00
SWivid
aaa92f6e6d finish trainer modification 2024-10-30 03:57:09 +08:00
SWivid
87c4f9ff06 minor fix 2024-10-30 03:23:22 +08:00
SWivid
381ea0c82c fix vocoder loading 2024-10-30 03:16:09 +08:00
SWivid
da1b40968a basic structure 2024-10-30 02:38:58 +08:00
SWivid
5b10099d33 Merge branch 'main' of github.com:lpscr/F5-TTS into lpscr-main 2024-10-29 23:36:58 +08:00