F5-TTS

mirror of https://github.com/SWivid/F5-TTS.git synced 2025-12-30 06:31:54 -08:00

Author	SHA1	Message	Date
Yushen CHEN	0f9f878be1	Merge pull request #196 from thunn/add_pre_commit_tooling add and run pre-commit with ruff	2024-10-21 13:19:15 +08:00
Tom Hunn	a4ca14b5f6	add and run pre-commit with ruff	2024-10-21 14:46:45 +10:00
SWivid	77e00db01b	Use main voice if can't find voice tag or specified voice.	2024-10-21 11:53:44 +08:00
SWivid	2c0924378d	add sanity check ensuring mono audio input for training	2024-10-21 04:14:52 +08:00
SWivid	5600d9079a	minor fix.	2024-10-21 03:40:20 +08:00
SWivid	d3badb95cf	fp16 inference only for cuda devices now	2024-10-21 03:34:28 +08:00
SWivid	bd16a8c281	minor fix for hf space	2024-10-21 02:52:11 +08:00
SWivid	03a20e0258	reorganize inference scripts with shared funcs	2024-10-21 02:21:13 +08:00
SWivid	b4f81425f3	disable fp16 for cpu device	2024-10-20 22:45:54 +08:00
SWivid	073092d0d3	Merge branch 'main' of github.com:SWivid/F5-TTS into main	2024-10-20 20:43:25 +08:00
SWivid	28b46d32d3	rewrite without einx & einops; clean up	2024-10-20 20:41:24 +08:00
chigkim	765a2ae390	Load model once in the beginning.	2024-10-20 08:01:22 -04:00
SWivid	554f3189e1	Use default fp16 inference	2024-10-20 16:32:18 +08:00
SWivid	69850fa236	fix. address #179	2024-10-20 12:43:01 +08:00
SWivid	aaf1fa7efa	Update README.md	2024-10-20 12:32:58 +08:00
Zhikang Niu	f618db7290	Update README.md	2024-10-20 10:27:03 +08:00
Zhikang Niu	532fbe8f02	Merge pull request #166 from cocktailpeanut/wandb_usability User-friendly wandb support	2024-10-20 10:25:17 +08:00
chigkim	8831701897	REorganized cli output to be less verbose.	2024-10-19 18:28:31 -04:00
SWivid	a016d6f89c	fix address #178	2024-10-19 21:59:52 +08:00
Yushen CHEN	84cb6e5f00	Merge pull request #173 from lpscr/main add new args in interface-cli.py for pass model and vocab	2024-10-19 17:03:10 +08:00
unknown	60f1b31446	update read me for new arg	2024-10-19 11:29:59 +03:00
unknown	5663bac2a8	add new arg for vocab_file and ckpt_file to easy load any model	2024-10-19 11:24:31 +03:00
Zhikang Niu	925ce4b0dd	Update README.md	2024-10-19 12:31:54 +08:00
jpgallegoar	501a566dcf	Merge pull request #170 from lpscr/main fix max_sample	2024-10-19 00:40:16 +02:00
unknown	b87e46095b	fix max_sample	2024-10-19 00:14:13 +03:00
cocktailpeanut	ae6e97b836	user-friendly wandb support	2024-10-18 14:59:59 -04:00
Yushen CHEN	182b0f08e4	Merge pull request #149 from lpscr/main fix problem error about miss parametre in finetune-cli.py	2024-10-18 13:04:40 +08:00
Yushen CHEN	852fb3245a	Merge pull request #152 from SWivid/hf-spaces-fix Fix for HF Spaces demo	2024-10-18 13:02:21 +08:00
Zhikang Niu	5fa56825be	Create Dockerfile	2024-10-18 11:23:01 +08:00
unknown	66062f9916	replace python with accelerate	2024-10-18 01:18:22 +03:00
mrfakename	32cdd210a2	Fix for HF Spaces demo	2024-10-17 14:52:42 -07:00
unknown	34ccbcb451	cache all error messages and add support to fix mac gpu issues	2024-10-18 00:42:16 +03:00
lpscr	549ee89b74	Merge branch 'SWivid:main' into main	2024-10-17 22:44:36 +03:00
unknown	3f3743eda4	add finetune miss	2024-10-17 22:43:18 +03:00
Yushen CHEN	18f526dd25	Merge pull request #148 from jpgallegoar/main Increase number of speech types.	2024-10-18 02:56:06 +08:00
Yushen CHEN	12304bfafa	Merge pull request #147 from cocktailpeanut/main allow multiple audio files for finetune UI	2024-10-18 02:55:42 +08:00
jpgallegoar	8b150b04a4	Increase number of speech types.	2024-10-17 19:29:36 +02:00
cocktailpeanut	a77c244a16	allow multiple files	2024-10-17 13:13:33 -04:00
SWivid	39a513e9dd	Update README.md	2024-10-18 01:08:05 +08:00
Yushen CHEN	7dbdedf4a5	Merge pull request #146 from chigkim/multivoice Multivoice CLI Similar to Gradio App	2024-10-18 01:03:17 +08:00
Yushen CHEN	37d333c528	Update README.md	2024-10-18 00:52:42 +08:00
Yushen CHEN	c0bd2e091a	Update README.md	2024-10-18 00:50:31 +08:00
Yushen CHEN	cca3a69d01	Merge pull request #125 from lpscr/main gradio_finetune	2024-10-18 00:45:49 +08:00
Chi Kim	cfa9382a57	Multivoice	2024-10-17 12:08:46 -04:00
unknown	68718023ea	add new tab to check if it is possible to train in this language	2024-10-17 15:51:19 +03:00
unknown	ed0d78e0bf	auto settings and reduse new tab	2024-10-17 14:40:39 +03:00
Zhikang Niu	1a09b80482	Update README.md	2024-10-17 12:25:13 +08:00
unknown	bf14be8ea3	remove path test	2024-10-16 23:55:12 +03:00
unknown	44216b443e	update	2024-10-16 23:31:39 +03:00
Yushen CHEN	147fc2cfc0	Merge pull request #127 from kunibald413/main add csv wavs data prep script	2024-10-17 00:15:34 +08:00

1 2 3

143 Commits