Commit Graph

249 Commits

Author SHA1 Message Date
SWivid
2ccfbdcc61 Merge branch 'main' of github.com:lpscr/F5-TTS into lpscr-main 2024-10-29 23:34:00 +08:00
unknown
d5c307b56a add logger 2024-10-29 17:30:28 +02:00
unknown
8a5041ef9f update 2024-10-29 17:14:03 +02:00
unknown
886500ac97 update 2024-10-29 17:02:21 +02:00
unknown
2ca1fb7c25 update 2024-10-29 16:46:38 +02:00
unknown
3409192662 update 2024-10-29 16:44:23 +02:00
unknown
37eb3b50da add tensorboard and add export sample for mel and audio 2024-10-29 14:20:50 +02:00
jpgallegoar
5f6dcc7e11 Merge pull request #316 from Jxspa/new_branch
Add text chat function
2024-10-29 13:12:39 +01:00
J
6667d6f501 Add optional text chat function 2024-10-29 10:59:42 +00:00
Jarod Mica
d601a70ad1 Revert "."
This reverts commit 5089dfe51b.
2024-10-29 03:25:02 -07:00
Jarod Mica
5089dfe51b . 2024-10-29 09:48:17 +00:00
SWivid
f10ecad766 add ws icon link 2024-10-29 14:59:50 +08:00
SWivid
551857b268 minor fix. 2024-10-29 13:40:40 +08:00
SWivid
91881841dd fix. wider search for silence 2024-10-29 13:26:55 +08:00
SWivid
82d04be12a Update infer README.md 2024-10-29 13:18:04 +08:00
SWivid
85089a276b fix. change 18s threshold to 15s for clipping 2024-10-29 13:11:29 +08:00
SWivid
456456971b fix. proper clip for long infer ref_audio if no silence found 2024-10-29 12:50:55 +08:00
jpgallegoar
6d044f10b0 Merge pull request #308 from lpscr/main
finetune quick fix for the vocab take the vocab from the project
2024-10-28 20:37:11 +01:00
unknown
2dddb10c36 fix vocab file take from the project 2024-10-28 21:09:45 +02:00
lpscr
ec3c35b3e9 Merge branch 'SWivid:main' into main 2024-10-28 14:45:23 +02:00
lpscr
5d8180ea58 gradio stream output ! (#304)
* add stream output
2024-10-28 19:41:10 +08:00
unknown
41eb33c5c6 add stream output 2024-10-28 12:51:51 +02:00
lpscr
ae4ef3f3a7 Merge branch 'SWivid:main' into main 2024-10-28 12:49:02 +02:00
lpscr
700039b554 gradio finetune fix wrong value (#301)
* fix wrong value print vocab
2024-10-28 17:03:10 +08:00
unknown
5427f28a6d fix wrong value print vocab 2024-10-28 10:26:19 +02:00
lpscr
7f2a33127f Merge branch 'SWivid:main' into main 2024-10-28 10:24:33 +02:00
lpscr
a7fd2e7e9a finetune quick note for ema (#298)
* add note about ema
2024-10-28 13:09:37 +08:00
lpscr
a5dbb6e817 Merge branch 'SWivid:main' into main 2024-10-27 21:11:25 +02:00
unknown
9db5de651b add note about ema 2024-10-27 21:10:26 +02:00
lpscr
0641d5d9b3 in gradio finetune fix problem curse problem space in symbols (#296)
* fix space curse problem with utf-8-sig

* fix extend

* Do not overwrite the vocab if it already exists !

* add settings

* add settings

* add settings

* fix path

* change name make more clear the preetain need path
2024-10-28 03:09:26 +08:00
unknown
8c3810a66c change name make more clear the preetain need path 2024-10-27 20:48:45 +02:00
unknown
eb19d9d928 fix path 2024-10-27 20:39:04 +02:00
unknown
3af98f2a52 add settings 2024-10-27 20:21:12 +02:00
unknown
48e3eb1c57 add settings 2024-10-27 20:19:59 +02:00
unknown
0f2a9230ec add settings 2024-10-27 20:16:21 +02:00
unknown
2eae16b4a3 Do not overwrite the vocab if it already exists ! 2024-10-27 19:30:04 +02:00
unknown
0de2e531d4 fix extend 2024-10-27 19:14:04 +02:00
lpscr
7e758f4af0 Merge branch 'SWivid:main' into main 2024-10-27 18:02:14 +02:00
unknown
e1e3b26987 fix space curse problem with utf-8-sig 2024-10-27 18:01:46 +02:00
Yushen CHEN
1888fa4919 Merge pull request #294 from lpscr/main
add use_ema in model test for finetune gradio
2024-10-27 22:45:43 +08:00
unknown
d864425144 add use_ema in model test 2024-10-27 15:50:14 +02:00
lpscr
3941e8102f make happy all other language dont suport the symbols in vocab , now you can finetune by extend (#293)
* fix path

* change name

* change name

* fix path

* fix last per steps and add notes

* change order tab add note in vocab check tab

* add note in reduse checkpoint tab

* note in reduse checkpoint tab update

* extend vocab to train language miss symbols

* change enten to ,

* hide the option create new vocab , change order tab , add some info
2024-10-27 21:03:22 +08:00
jpgallegoar
2056f5de41 Format 2024-10-26 19:11:55 +02:00
jpgallegoar
a8bb34bde2 Format 2024-10-26 19:00:07 +02:00
jpgallegoar
4eb305418d Added Insert button on Multi-Speech tab 2024-10-26 18:58:43 +02:00
lpscr
c1a9986a15 gradio finetune fix last per step and add note (#284)
* fix path

* change name

* change name

* fix path

* fix last per steps and add notes

* change order tab add note in vocab check tab

* add note in reduse checkpoint tab

* note in reduse checkpoint tab update
2024-10-26 17:57:04 +08:00
Yushen CHEN
6c623447b8 Merge pull request #280 from justinjohn0306/main
Add speed control to F5-TTS inference CLI
2024-10-26 13:28:35 +08:00
Justin John
be090b0d0a Cleanup 2024-10-26 10:48:48 +05:30
Justin John
ed179067df Add speed control to F5-TTS inference CLI
- Added support for the --speed argument to control the speed of generated audio.
- Updated the CLI to accept a speed parameter with a default value of 1.0.
- Adjusted the infer_process to apply the speed factor during TTS generation.
2024-10-26 10:45:11 +05:30
Yushen CHEN
e963929b8e Merge pull request #277 from justinjohn0306/main
fix path in finetune_gradio for finetune-cli
2024-10-26 10:12:18 +08:00