F5-TTS

mirror of https://github.com/SWivid/F5-TTS.git synced 2026-01-02 08:11:02 -08:00

Author	SHA1	Message	Date
Yushen CHEN	b03e9b2952	Merge pull request #389 from kunci115/main Bug fix	2024-11-04 17:12:23 +08:00
Yushen CHEN	c1c20ed009	Update socket_server.py, to pass format check	2024-11-04 17:11:29 +08:00
Rino	24cfa9ecb9	Update README.md	2024-11-04 15:50:15 +07:00
Rino	c129dd7ba4	Rename socket.py to socket_server.py [bug fix] due to circular import, can't use socket as file name	2024-11-04 15:48:09 +07:00
Rino	a83e764110	Update socket.py [edit] adjusting mel_spec_type on load_model use case	2024-11-04 15:46:00 +07:00
SWivid	ac77a76cd3	add issue templates	2024-11-04 02:17:52 +08:00
SWivid	61ff2a62d9	formatting #363 , credit to @JarodMica, also dur_pred check fork repo	2024-11-03 16:37:47 +08:00
Yushen CHEN	1085b73f59	Merge pull request #354 from kunci115/main [add] socket stream	2024-11-03 16:24:57 +08:00
Rino	6e24f1ea78	Merge branch 'SWivid:main' into main	2024-11-03 11:40:25 +07:00
SWivid	ea90244d62	fix. add dtype check for asr pipeline addressing #356	2024-11-02 13:48:37 +08:00
SWivid	f7e248e2ce	formatting	2024-11-02 12:58:28 +08:00
Zhikang Niu	dc67a6819c	Merge pull request #367 from justinjohn0306/main Ensure tensors are moved to CPU before saving with torchaudio	2024-11-02 11:15:41 +08:00
Rino	0fe34a862c	Merge branch 'SWivid:main' into main	2024-11-02 01:54:18 +07:00
Justin John	183ad09084	Ensure tensors are moved to CPU before saving with torchaudio	2024-11-01 23:47:00 +05:30
SWivid	b0f482421b	fix-patch-2 for #361	2024-11-01 19:14:46 +08:00
Rino	e12fe350f5	Merge branch 'SWivid:main' into main	2024-11-01 17:44:04 +07:00
SWivid	c370c81897	Merge branch 'main' of github.com:SWivid/F5-TTS into main	2024-11-01 18:43:30 +08:00
SWivid	0622087b82	add backward compatibility addressing #361	2024-11-01 18:43:02 +08:00
Yushen CHEN	11d2886e47	Merge pull request #359 from lpscr/main small update gradio finetune	2024-11-01 18:22:46 +08:00
Yushen CHEN	b664bc7777	Update finetune_gradio.py	2024-11-01 18:20:39 +08:00
Yushen CHEN	552c0fd99c	Update prepare_csv_wavs.py	2024-11-01 18:17:57 +08:00
unknown	27d98a52cd	clear pipe	2024-11-01 12:08:47 +02:00
lpscr	9984a48041	Merge branch 'SWivid:main' into main	2024-11-01 12:06:52 +02:00
unknown	199c56c23c	clear pipe	2024-11-01 12:05:12 +02:00
unknown	f7a698bc2f	resample when need	2024-11-01 11:39:06 +02:00
unknown	5af195f1f9	only mono duraction fix value bfp16 to bf16	2024-11-01 11:18:05 +02:00
Rino	db902761c3	Update socket.py [remove] unused var	2024-11-01 15:14:52 +07:00
Rino	e8f14072ec	Merge branch 'SWivid:main' into main	2024-11-01 15:06:49 +07:00
SWivid	2a3deaab33	fix. update tgt_sr def for log_sample with new mel_spec module	2024-11-01 15:42:19 +08:00
SWivid	315230210d	minor fix	2024-11-01 15:11:48 +08:00
Yushen CHEN	305e3eab35	Merge pull request #345 from ZhikangNiu/main [WIP]Support Bigvgan vocoder	2024-11-01 14:37:30 +08:00
ZhikangNiu	18e1ab508f	refactor: del global params and set vocos as default vocoder, add dtype check	2024-11-01 14:17:22 +08:00
Rino	8713de5ffe	Merge branch 'SWivid:main' into main	2024-11-01 13:11:20 +07:00
SWivid	7a6fb0eb4e	fix address #348	2024-11-01 14:09:41 +08:00
Rino	561c67387d	Update socket.py [edit] socket.py calling vocab, ckpt, ref audio	2024-11-01 11:22:03 +07:00
Rino Alfian	1c8fe499ef	Update README.md socket client realtime stream	2024-11-01 10:13:36 +07:00
Rino Alfian	1e3fac8f5e	[add] socket.py to play stream socket mode	2024-11-01 10:10:31 +07:00
ZhikangNiu	b180961782	refactor: more details about bigvgan, clear function definition	2024-11-01 11:02:39 +08:00
ZhikangNiu	36a4aad668	change some infer function to support two vocoder	2024-10-31 22:44:45 +08:00
ZhikangNiu	712d52772e	update Bigvgan vocoder and F5-bigvgan version, trained on Emilia ZH&EN, 1.25m updates	2024-10-31 20:06:36 +08:00
Yushen CHEN	dee0420b59	Merge pull request #334 from lpscr/main small fix stuff	2024-10-31 11:44:58 +08:00
Yushen CHEN	6cbb548f9c	Update api.py	2024-10-31 11:39:23 +08:00
unknown	3dd59b8cdf	when ref_text empty automatic transcribing	2024-10-30 14:26:13 +02:00
unknown	02d59131c4	fix when none tts_api	2024-10-30 14:25:16 +02:00
Yushen CHEN	7dd3eecf2d	Merge pull request #333 from lpscr/main Support more formats.	2024-10-30 18:28:07 +08:00
unknown	9c4fc38fa4	fix	2024-10-30 12:23:02 +02:00
unknown	77620f602f	suport more format	2024-10-30 11:41:42 +02:00
Yushen CHEN	739c4a1823	Merge pull request #332 from lpscr/main gradio quick update suport for create metadata	2024-10-30 17:23:47 +08:00
unknown	6970556abf	fix logger in settings miss	2024-10-30 11:21:47 +02:00
unknown	513e6b466e	add note for more format	2024-10-30 11:14:39 +02:00

1 2 3 4 5 ...

308 Commits