Commit Graph

143 Commits

Author SHA1 Message Date
Yushen CHEN
0f9f878be1 Merge pull request #196 from thunn/add_pre_commit_tooling
add and run pre-commit with ruff
2024-10-21 13:19:15 +08:00
Tom Hunn
a4ca14b5f6 add and run pre-commit with ruff 2024-10-21 14:46:45 +10:00
SWivid
77e00db01b Use main voice if can't find voice tag or specified voice. 2024-10-21 11:53:44 +08:00
SWivid
2c0924378d add sanity check ensuring mono audio input for training 2024-10-21 04:14:52 +08:00
SWivid
5600d9079a minor fix. 2024-10-21 03:40:20 +08:00
SWivid
d3badb95cf fp16 inference only for cuda devices now 2024-10-21 03:34:28 +08:00
SWivid
bd16a8c281 minor fix for hf space 2024-10-21 02:52:11 +08:00
SWivid
03a20e0258 reorganize inference scripts with shared funcs 2024-10-21 02:21:13 +08:00
SWivid
b4f81425f3 disable fp16 for cpu device 2024-10-20 22:45:54 +08:00
SWivid
073092d0d3 Merge branch 'main' of github.com:SWivid/F5-TTS into main 2024-10-20 20:43:25 +08:00
SWivid
28b46d32d3 rewrite without einx & einops; clean up 2024-10-20 20:41:24 +08:00
chigkim
765a2ae390 Load model once in the beginning. 2024-10-20 08:01:22 -04:00
SWivid
554f3189e1 Use default fp16 inference 2024-10-20 16:32:18 +08:00
SWivid
69850fa236 fix. address #179 2024-10-20 12:43:01 +08:00
SWivid
aaf1fa7efa Update README.md 2024-10-20 12:32:58 +08:00
Zhikang Niu
f618db7290 Update README.md 2024-10-20 10:27:03 +08:00
Zhikang Niu
532fbe8f02 Merge pull request #166 from cocktailpeanut/wandb_usability
User-friendly wandb support
2024-10-20 10:25:17 +08:00
chigkim
8831701897 REorganized cli output to be less verbose. 2024-10-19 18:28:31 -04:00
SWivid
a016d6f89c fix address #178 2024-10-19 21:59:52 +08:00
Yushen CHEN
84cb6e5f00 Merge pull request #173 from lpscr/main
add new args in interface-cli.py for pass model and vocab
2024-10-19 17:03:10 +08:00
unknown
60f1b31446 update read me for new arg 2024-10-19 11:29:59 +03:00
unknown
5663bac2a8 add new arg for vocab_file and ckpt_file to easy load any model 2024-10-19 11:24:31 +03:00
Zhikang Niu
925ce4b0dd Update README.md 2024-10-19 12:31:54 +08:00
jpgallegoar
501a566dcf Merge pull request #170 from lpscr/main
fix max_sample
2024-10-19 00:40:16 +02:00
unknown
b87e46095b fix max_sample 2024-10-19 00:14:13 +03:00
cocktailpeanut
ae6e97b836 user-friendly wandb support 2024-10-18 14:59:59 -04:00
Yushen CHEN
182b0f08e4 Merge pull request #149 from lpscr/main
fix problem error about miss parametre in finetune-cli.py
2024-10-18 13:04:40 +08:00
Yushen CHEN
852fb3245a Merge pull request #152 from SWivid/hf-spaces-fix
Fix for HF Spaces demo
2024-10-18 13:02:21 +08:00
Zhikang Niu
5fa56825be Create Dockerfile 2024-10-18 11:23:01 +08:00
unknown
66062f9916 replace python with accelerate 2024-10-18 01:18:22 +03:00
mrfakename
32cdd210a2 Fix for HF Spaces demo 2024-10-17 14:52:42 -07:00
unknown
34ccbcb451 cache all error messages and add support to fix mac gpu issues 2024-10-18 00:42:16 +03:00
lpscr
549ee89b74 Merge branch 'SWivid:main' into main 2024-10-17 22:44:36 +03:00
unknown
3f3743eda4 add finetune miss 2024-10-17 22:43:18 +03:00
Yushen CHEN
18f526dd25 Merge pull request #148 from jpgallegoar/main
Increase number of speech types.
2024-10-18 02:56:06 +08:00
Yushen CHEN
12304bfafa Merge pull request #147 from cocktailpeanut/main
allow multiple audio files for finetune UI
2024-10-18 02:55:42 +08:00
jpgallegoar
8b150b04a4 Increase number of speech types. 2024-10-17 19:29:36 +02:00
cocktailpeanut
a77c244a16 allow multiple files 2024-10-17 13:13:33 -04:00
SWivid
39a513e9dd Update README.md 2024-10-18 01:08:05 +08:00
Yushen CHEN
7dbdedf4a5 Merge pull request #146 from chigkim/multivoice
Multivoice CLI Similar to Gradio App
2024-10-18 01:03:17 +08:00
Yushen CHEN
37d333c528 Update README.md 2024-10-18 00:52:42 +08:00
Yushen CHEN
c0bd2e091a Update README.md 2024-10-18 00:50:31 +08:00
Yushen CHEN
cca3a69d01 Merge pull request #125 from lpscr/main
gradio_finetune
2024-10-18 00:45:49 +08:00
Chi Kim
cfa9382a57 Multivoice 2024-10-17 12:08:46 -04:00
unknown
68718023ea add new tab to check if it is possible to train in this language 2024-10-17 15:51:19 +03:00
unknown
ed0d78e0bf auto settings and reduse new tab 2024-10-17 14:40:39 +03:00
Zhikang Niu
1a09b80482 Update README.md 2024-10-17 12:25:13 +08:00
unknown
bf14be8ea3 remove path test 2024-10-16 23:55:12 +03:00
unknown
44216b443e update 2024-10-16 23:31:39 +03:00
Yushen CHEN
147fc2cfc0 Merge pull request #127 from kunibald413/main
add csv wavs data prep script
2024-10-17 00:15:34 +08:00