lpscr
|
cd3c4afa69
|
fix. #213 correct device initialization
|
2024-10-22 17:48:48 +08:00 |
|
SWivid
|
f8eb8ab740
|
Update README.md
|
2024-10-22 13:15:19 +08:00 |
|
SWivid
|
92e5f55c46
|
Merge branch 'main' of github.com:SWivid/F5-TTS into main
|
2024-10-22 01:16:32 +08:00 |
|
SWivid
|
992dbb5c24
|
fix save last ckpt. make sure work paid off
|
2024-10-22 01:15:45 +08:00 |
|
lpscr
|
99190a03cb
|
add model test in finetune and update some stuff (#207)
* add test model tab and some updates
* small updates label audios
* small updates label text
* small updates and to or in model load
|
2024-10-22 00:28:28 +08:00 |
|
SWivid
|
256f3f1320
|
Update. change asr pipeline back to whisper-large-v3-turbo
|
2024-10-21 22:17:44 +08:00 |
|
lpscr
|
a79199e54d
|
small fix in api remove_silence (#201)
* fix remove_silence
* change remove_silence false
* add seed vaule
|
2024-10-21 18:43:11 +08:00 |
|
SWivid
|
e80addf1e8
|
fix. utils_infer.py ref_audio_len misplace
|
2024-10-21 18:36:07 +08:00 |
|
SWivid
|
d15ef3679a
|
fix address #191
|
2024-10-21 17:55:58 +08:00 |
|
SWivid
|
b899a35b88
|
load asr pipeline only if needed
|
2024-10-21 17:45:06 +08:00 |
|
Haitao
|
795cb19e4f
|
allow for passing in custom mel spec module (#200)
|
2024-10-21 17:00:48 +08:00 |
|
lpscr
|
25cdc5182f
|
add api for easy use (#186)
* add api
* update infer limits
|
2024-10-21 16:57:24 +08:00 |
|
Yushen CHEN
|
0f9f878be1
|
Merge pull request #196 from thunn/add_pre_commit_tooling
add and run pre-commit with ruff
|
2024-10-21 13:19:15 +08:00 |
|
Tom Hunn
|
a4ca14b5f6
|
add and run pre-commit with ruff
|
2024-10-21 14:46:45 +10:00 |
|
SWivid
|
77e00db01b
|
Use main voice if can't find voice tag or specified voice.
|
2024-10-21 11:53:44 +08:00 |
|
SWivid
|
2c0924378d
|
add sanity check ensuring mono audio input for training
|
2024-10-21 04:14:52 +08:00 |
|
SWivid
|
5600d9079a
|
minor fix.
|
2024-10-21 03:40:20 +08:00 |
|
SWivid
|
d3badb95cf
|
fp16 inference only for cuda devices now
|
2024-10-21 03:34:28 +08:00 |
|
SWivid
|
bd16a8c281
|
minor fix for hf space
|
2024-10-21 02:52:11 +08:00 |
|
SWivid
|
03a20e0258
|
reorganize inference scripts with shared funcs
|
2024-10-21 02:21:13 +08:00 |
|
SWivid
|
b4f81425f3
|
disable fp16 for cpu device
|
2024-10-20 22:45:54 +08:00 |
|
SWivid
|
073092d0d3
|
Merge branch 'main' of github.com:SWivid/F5-TTS into main
|
2024-10-20 20:43:25 +08:00 |
|
SWivid
|
28b46d32d3
|
rewrite without einx & einops; clean up
|
2024-10-20 20:41:24 +08:00 |
|
chigkim
|
765a2ae390
|
Load model once in the beginning.
|
2024-10-20 08:01:22 -04:00 |
|
SWivid
|
554f3189e1
|
Use default fp16 inference
|
2024-10-20 16:32:18 +08:00 |
|
SWivid
|
69850fa236
|
fix. address #179
|
2024-10-20 12:43:01 +08:00 |
|
SWivid
|
aaf1fa7efa
|
Update README.md
|
2024-10-20 12:32:58 +08:00 |
|
Zhikang Niu
|
f618db7290
|
Update README.md
|
2024-10-20 10:27:03 +08:00 |
|
Zhikang Niu
|
532fbe8f02
|
Merge pull request #166 from cocktailpeanut/wandb_usability
User-friendly wandb support
|
2024-10-20 10:25:17 +08:00 |
|
chigkim
|
8831701897
|
REorganized cli output to be less verbose.
|
2024-10-19 18:28:31 -04:00 |
|
SWivid
|
a016d6f89c
|
fix address #178
|
2024-10-19 21:59:52 +08:00 |
|
Yushen CHEN
|
84cb6e5f00
|
Merge pull request #173 from lpscr/main
add new args in interface-cli.py for pass model and vocab
|
2024-10-19 17:03:10 +08:00 |
|
unknown
|
60f1b31446
|
update read me for new arg
|
2024-10-19 11:29:59 +03:00 |
|
unknown
|
5663bac2a8
|
add new arg for vocab_file and ckpt_file to easy load any model
|
2024-10-19 11:24:31 +03:00 |
|
Zhikang Niu
|
925ce4b0dd
|
Update README.md
|
2024-10-19 12:31:54 +08:00 |
|
jpgallegoar
|
501a566dcf
|
Merge pull request #170 from lpscr/main
fix max_sample
|
2024-10-19 00:40:16 +02:00 |
|
unknown
|
b87e46095b
|
fix max_sample
|
2024-10-19 00:14:13 +03:00 |
|
cocktailpeanut
|
ae6e97b836
|
user-friendly wandb support
|
2024-10-18 14:59:59 -04:00 |
|
Yushen CHEN
|
182b0f08e4
|
Merge pull request #149 from lpscr/main
fix problem error about miss parametre in finetune-cli.py
|
2024-10-18 13:04:40 +08:00 |
|
Yushen CHEN
|
852fb3245a
|
Merge pull request #152 from SWivid/hf-spaces-fix
Fix for HF Spaces demo
|
2024-10-18 13:02:21 +08:00 |
|
Zhikang Niu
|
5fa56825be
|
Create Dockerfile
|
2024-10-18 11:23:01 +08:00 |
|
unknown
|
66062f9916
|
replace python with accelerate
|
2024-10-18 01:18:22 +03:00 |
|
mrfakename
|
32cdd210a2
|
Fix for HF Spaces demo
|
2024-10-17 14:52:42 -07:00 |
|
unknown
|
34ccbcb451
|
cache all error messages and add support to fix mac gpu issues
|
2024-10-18 00:42:16 +03:00 |
|
lpscr
|
549ee89b74
|
Merge branch 'SWivid:main' into main
|
2024-10-17 22:44:36 +03:00 |
|
unknown
|
3f3743eda4
|
add finetune miss
|
2024-10-17 22:43:18 +03:00 |
|
Yushen CHEN
|
18f526dd25
|
Merge pull request #148 from jpgallegoar/main
Increase number of speech types.
|
2024-10-18 02:56:06 +08:00 |
|
Yushen CHEN
|
12304bfafa
|
Merge pull request #147 from cocktailpeanut/main
allow multiple audio files for finetune UI
|
2024-10-18 02:55:42 +08:00 |
|
jpgallegoar
|
8b150b04a4
|
Increase number of speech types.
|
2024-10-17 19:29:36 +02:00 |
|
cocktailpeanut
|
a77c244a16
|
allow multiple files
|
2024-10-17 13:13:33 -04:00 |
|