SWivid
|
2a3deaab33
|
fix. update tgt_sr def for log_sample with new mel_spec module
|
2024-11-01 15:42:19 +08:00 |
|
SWivid
|
315230210d
|
minor fix
|
2024-11-01 15:11:48 +08:00 |
|
Yushen CHEN
|
305e3eab35
|
Merge pull request #345 from ZhikangNiu/main
[WIP]Support Bigvgan vocoder
|
2024-11-01 14:37:30 +08:00 |
|
ZhikangNiu
|
18e1ab508f
|
refactor: del global params and set vocos as default vocoder, add dtype check
|
2024-11-01 14:17:22 +08:00 |
|
SWivid
|
7a6fb0eb4e
|
fix address #348
|
2024-11-01 14:09:41 +08:00 |
|
ZhikangNiu
|
b180961782
|
refactor: more details about bigvgan, clear function definition
|
2024-11-01 11:02:39 +08:00 |
|
ZhikangNiu
|
36a4aad668
|
change some infer function to support two vocoder
|
2024-10-31 22:44:45 +08:00 |
|
ZhikangNiu
|
712d52772e
|
update Bigvgan vocoder and F5-bigvgan version, trained on Emilia ZH&EN, 1.25m updates
|
2024-10-31 20:06:36 +08:00 |
|
Yushen CHEN
|
dee0420b59
|
Merge pull request #334 from lpscr/main
small fix stuff
|
2024-10-31 11:44:58 +08:00 |
|
Yushen CHEN
|
6cbb548f9c
|
Update api.py
|
2024-10-31 11:39:23 +08:00 |
|
unknown
|
3dd59b8cdf
|
when ref_text empty automatic transcribing
|
2024-10-30 14:26:13 +02:00 |
|
unknown
|
02d59131c4
|
fix when none tts_api
|
2024-10-30 14:25:16 +02:00 |
|
Yushen CHEN
|
7dd3eecf2d
|
Merge pull request #333 from lpscr/main
Support more formats.
|
2024-10-30 18:28:07 +08:00 |
|
unknown
|
9c4fc38fa4
|
fix
|
2024-10-30 12:23:02 +02:00 |
|
unknown
|
77620f602f
|
suport more format
|
2024-10-30 11:41:42 +02:00 |
|
Yushen CHEN
|
739c4a1823
|
Merge pull request #332 from lpscr/main
gradio quick update suport for create metadata
|
2024-10-30 17:23:47 +08:00 |
|
unknown
|
6970556abf
|
fix logger in settings miss
|
2024-10-30 11:21:47 +02:00 |
|
unknown
|
513e6b466e
|
add note for more format
|
2024-10-30 11:14:39 +02:00 |
|
unknown
|
9eb0d1d226
|
add to suport audio name , audio.wav , full audio path , add note
|
2024-10-30 11:09:22 +02:00 |
|
unknown
|
e9419764d7
|
add to suport audio name , audio.wav , full audio path
|
2024-10-30 11:03:26 +02:00 |
|
SWivid
|
b7a1746638
|
Merge branch 'lpscr-main' into main
|
2024-10-30 03:57:16 +08:00 |
|
SWivid
|
aaa92f6e6d
|
finish trainer modification
|
2024-10-30 03:57:09 +08:00 |
|
SWivid
|
87c4f9ff06
|
minor fix
|
2024-10-30 03:23:22 +08:00 |
|
SWivid
|
381ea0c82c
|
fix vocoder loading
|
2024-10-30 03:16:09 +08:00 |
|
SWivid
|
da1b40968a
|
basic structure
|
2024-10-30 02:38:58 +08:00 |
|
SWivid
|
5b10099d33
|
Merge branch 'main' of github.com:lpscr/F5-TTS into lpscr-main
|
2024-10-29 23:36:58 +08:00 |
|
unknown
|
0be49f4967
|
flip axis y in mel
|
2024-10-29 17:35:42 +02:00 |
|
SWivid
|
2ccfbdcc61
|
Merge branch 'main' of github.com:lpscr/F5-TTS into lpscr-main
|
2024-10-29 23:34:00 +08:00 |
|
unknown
|
d5c307b56a
|
add logger
|
2024-10-29 17:30:28 +02:00 |
|
unknown
|
8a5041ef9f
|
update
|
2024-10-29 17:14:03 +02:00 |
|
unknown
|
886500ac97
|
update
|
2024-10-29 17:02:21 +02:00 |
|
unknown
|
2ca1fb7c25
|
update
|
2024-10-29 16:46:38 +02:00 |
|
unknown
|
3409192662
|
update
|
2024-10-29 16:44:23 +02:00 |
|
unknown
|
37eb3b50da
|
add tensorboard and add export sample for mel and audio
|
2024-10-29 14:20:50 +02:00 |
|
jpgallegoar
|
5f6dcc7e11
|
Merge pull request #316 from Jxspa/new_branch
Add text chat function
|
2024-10-29 13:12:39 +01:00 |
|
J
|
6667d6f501
|
Add optional text chat function
|
2024-10-29 10:59:42 +00:00 |
|
Jarod Mica
|
d601a70ad1
|
Revert "."
This reverts commit 5089dfe51b.
|
2024-10-29 03:25:02 -07:00 |
|
Jarod Mica
|
5089dfe51b
|
.
|
2024-10-29 09:48:17 +00:00 |
|
SWivid
|
f10ecad766
|
add ws icon link
|
2024-10-29 14:59:50 +08:00 |
|
SWivid
|
551857b268
|
minor fix.
|
2024-10-29 13:40:40 +08:00 |
|
SWivid
|
91881841dd
|
fix. wider search for silence
|
2024-10-29 13:26:55 +08:00 |
|
SWivid
|
82d04be12a
|
Update infer README.md
|
2024-10-29 13:18:04 +08:00 |
|
SWivid
|
85089a276b
|
fix. change 18s threshold to 15s for clipping
|
2024-10-29 13:11:29 +08:00 |
|
SWivid
|
456456971b
|
fix. proper clip for long infer ref_audio if no silence found
|
2024-10-29 12:50:55 +08:00 |
|
jpgallegoar
|
6d044f10b0
|
Merge pull request #308 from lpscr/main
finetune quick fix for the vocab take the vocab from the project
|
2024-10-28 20:37:11 +01:00 |
|
unknown
|
2dddb10c36
|
fix vocab file take from the project
|
2024-10-28 21:09:45 +02:00 |
|
lpscr
|
ec3c35b3e9
|
Merge branch 'SWivid:main' into main
|
2024-10-28 14:45:23 +02:00 |
|
lpscr
|
5d8180ea58
|
gradio stream output ! (#304)
* add stream output
|
2024-10-28 19:41:10 +08:00 |
|
unknown
|
41eb33c5c6
|
add stream output
|
2024-10-28 12:51:51 +02:00 |
|
lpscr
|
ae4ef3f3a7
|
Merge branch 'SWivid:main' into main
|
2024-10-28 12:49:02 +02:00 |
|