Commit Graph

327 Commits

Author SHA1 Message Date
SWivid
be53fb179c fix. force reload from custom cache file to override infer_gradio init loaded value in case webpage refreshed 2024-11-11 14:46:50 +08:00
Yushen CHEN
da059e86b2 Update SHARED.md; typo 2024-11-11 11:20:11 +08:00
SWivid
a23ec25b39 v0.1.1 2024-11-11 11:19:04 +08:00
SWivid
59761ec9f8 Update. Cache last used custom model path #447 2024-11-11 11:13:11 +08:00
Yushen CHEN
e6e7bd01d3 Merge pull request #448 from peanutcocktail/main
Set PYTORCH_ENABLE_MPS_FALLBACK=1 for MPS by default
2024-11-11 07:11:19 +08:00
cocktailpeanut
5635eac12d PYTORCH_ENABLE_MPS_FALLBACK 2024-11-10 15:36:22 -05:00
SWivid
3fcdbc70b4 minor update 2024-11-09 04:54:06 +08:00
SWivid
dcd9a19889 v0.1.0. Add custom model support for local deploy; add share model cards, etc. 2024-11-09 04:35:44 +08:00
SWivid
23409af801 Update infer-gradio with ref_text auto-filling; minor fix 2024-11-09 01:14:54 +08:00
SWivid
c33a83c009 fix. better asr_transcript cache logic; global model select init. 2024-11-08 20:57:47 +08:00
Yushen CHEN
e78ae2ce92 Create config.yml 2024-11-07 14:23:52 +08:00
Yushen CHEN
da8853838f Update question.yml 2024-11-07 13:23:35 +08:00
Yushen CHEN
8a7e8495ff Merge pull request #401 from hcsolakoglu/fix-finetune-cli-bnb-optimizer
Add --bnb_optimizer argument to CLI and pass it to Trainer initialization
2024-11-05 20:20:31 +08:00
Yushen CHEN
dbe35da754 Update finetune_cli.py; formatting 2024-11-05 20:19:53 +08:00
hcsolakoglu
2033993f57 Add --bnb_optimizer argument to CLI and pass it to Trainer initialization
Add `--bnb_optimizer` argument to CLI and pass it to Trainer initialization.

* Add `--bnb_optimizer` argument to `parse_args()` function in `src/f5_tts/train/finetune_cli.py`.
* Pass `bnb_optimizer` argument to `Trainer` initialization in the `main()` function of `src/f5_tts/train/finetune_cli.py`.
2024-11-05 15:11:37 +03:00
SWivid
4a69e6bad2 formatting 2024-11-05 00:02:21 +08:00
Yushen CHEN
8718b1d821 Merge pull request #394 from lpscr/main
Fix the glitch effect at the beginning audio
2024-11-05 00:01:48 +08:00
Yushen CHEN
2f914140f4 Update utils_infer.py 2024-11-05 00:00:29 +08:00
unknown
e6f3e50eb1 Fix the glitch effect at the beginning audio 2024-11-04 17:05:16 +02:00
Yushen CHEN
b03e9b2952 Merge pull request #389 from kunci115/main
Bug fix
2024-11-04 17:12:23 +08:00
Yushen CHEN
c1c20ed009 Update socket_server.py, to pass format check 2024-11-04 17:11:29 +08:00
Rino
24cfa9ecb9 Update README.md 2024-11-04 15:50:15 +07:00
Rino
c129dd7ba4 Rename socket.py to socket_server.py
[bug fix] due to circular import, can't use socket as file name
2024-11-04 15:48:09 +07:00
Rino
a83e764110 Update socket.py
[edit] adjusting mel_spec_type on load_model use case
2024-11-04 15:46:00 +07:00
SWivid
ac77a76cd3 add issue templates 2024-11-04 02:17:52 +08:00
SWivid
61ff2a62d9 formatting #363, credit to @JarodMica, also dur_pred check fork repo 2024-11-03 16:37:47 +08:00
Yushen CHEN
1085b73f59 Merge pull request #354 from kunci115/main
[add] socket stream
2024-11-03 16:24:57 +08:00
Rino
6e24f1ea78 Merge branch 'SWivid:main' into main 2024-11-03 11:40:25 +07:00
SWivid
ea90244d62 fix. add dtype check for asr pipeline addressing #356 2024-11-02 13:48:37 +08:00
SWivid
f7e248e2ce formatting 2024-11-02 12:58:28 +08:00
Zhikang Niu
dc67a6819c Merge pull request #367 from justinjohn0306/main
Ensure tensors are moved to CPU before saving with torchaudio
2024-11-02 11:15:41 +08:00
Rino
0fe34a862c Merge branch 'SWivid:main' into main 2024-11-02 01:54:18 +07:00
Justin John
183ad09084 Ensure tensors are moved to CPU before saving with torchaudio 2024-11-01 23:47:00 +05:30
SWivid
b0f482421b fix-patch-2 for #361 2024-11-01 19:14:46 +08:00
Rino
e12fe350f5 Merge branch 'SWivid:main' into main 2024-11-01 17:44:04 +07:00
SWivid
c370c81897 Merge branch 'main' of github.com:SWivid/F5-TTS into main 2024-11-01 18:43:30 +08:00
SWivid
0622087b82 add backward compatibility addressing #361 2024-11-01 18:43:02 +08:00
Yushen CHEN
11d2886e47 Merge pull request #359 from lpscr/main
small update gradio finetune
2024-11-01 18:22:46 +08:00
Yushen CHEN
b664bc7777 Update finetune_gradio.py 2024-11-01 18:20:39 +08:00
Yushen CHEN
552c0fd99c Update prepare_csv_wavs.py 2024-11-01 18:17:57 +08:00
unknown
27d98a52cd clear pipe 2024-11-01 12:08:47 +02:00
lpscr
9984a48041 Merge branch 'SWivid:main' into main 2024-11-01 12:06:52 +02:00
unknown
199c56c23c clear pipe 2024-11-01 12:05:12 +02:00
unknown
f7a698bc2f resample when need 2024-11-01 11:39:06 +02:00
unknown
5af195f1f9 only mono duraction fix value bfp16 to bf16 2024-11-01 11:18:05 +02:00
Rino
db902761c3 Update socket.py
[remove] unused var
2024-11-01 15:14:52 +07:00
Rino
e8f14072ec Merge branch 'SWivid:main' into main 2024-11-01 15:06:49 +07:00
SWivid
2a3deaab33 fix. update tgt_sr def for log_sample with new mel_spec module 2024-11-01 15:42:19 +08:00
SWivid
315230210d minor fix 2024-11-01 15:11:48 +08:00
Yushen CHEN
305e3eab35 Merge pull request #345 from ZhikangNiu/main
[WIP]Support Bigvgan vocoder
2024-11-01 14:37:30 +08:00