F5-TTS

mirror of https://github.com/SWivid/F5-TTS.git synced 2026-01-08 19:33:49 -08:00

Author	SHA1	Message	Date
Yushen CHEN	eb84cbab0f	Merge pull request #487 from mame82/issue-480 Fixes both issues from #480	2024-11-19 15:39:39 +08:00
mame82	7895721ff8	Fixes both issues from #480	2024-11-18 20:13:25 +01:00
SWivid	194bf1e853	formatting	2024-11-18 22:33:13 +08:00
Yushen CHEN	058b4461be	Merge pull request #485 from atlonxp/main Update dataset.py, change recursive __getitem__ to while loop; fix finetune_gradio.py #480	2024-11-18 22:31:36 +08:00
Yushen CHEN	5f7944a748	Update dataset.py, formatting	2024-11-18 22:28:03 +08:00
atlonxp	07b100e96f	Update get_random_sample_transcribe when absolute path is used in finetune_gradio.py Update `get_random_sample_transcribe` to handle absolute paths in `finetune_gradio.py` - Added support for processing absolute file paths during transcription. - Updated logic to correctly handle and return audio file paths with absolute URLs. - Example use case: URLs like `https://localhost:55555/speech/150/150896/150-150896-40.wav.wav`. - Correct one should be: `https://localhost:55555/speech/150/150896/150-150896-40.wav`.	2024-11-18 21:08:49 +07:00
atlonxp	14e923a427	Update dataset.py change recursive approach to while loop, avoiding potential memory leak.	2024-11-18 20:53:47 +07:00
Yushen CHEN	5cc02536a6	Merge pull request #473 from tjb4578/bugfix/socket-server-repeated-final-chunk Removed redundant final chunk logic from socket_server.py	2024-11-18 16:25:10 +08:00
SWivid	84db0029ce	fix. 476#issuecomment-2481877878	2024-11-18 13:16:25 +08:00
SWivid	cb8ce3306d	update. compatibility with mps device #477 thanks to @aboutmydreams	2024-11-17 18:57:28 +08:00
SWivid	0f80f25c5f	minor fix. speech_edit & eval_infer_batch	2024-11-17 06:25:07 +08:00
SWivid	4b9441f8e0	minor update	2024-11-17 05:54:57 +08:00
Yushen CHEN	333d99ab6c	Merge pull request #476 from lpscr/main add in gradio finetune 8bit value fix some stuff and add new transcripe into easy	2024-11-17 00:46:38 +08:00
unknown	96946f85fa	import transcribe from utils_infer	2024-11-16 18:39:52 +02:00
unknown	c4d7252cf8	remove the get_nested_value no use	2024-11-16 18:34:20 +02:00
unknown	bb4d538dc5	add transcribe function	2024-11-16 18:26:42 +02:00
unknown	e636d98090	add transcribe function	2024-11-16 18:18:42 +02:00
unknown	96e24d3605	add 8bit and fix some value	2024-11-16 18:17:51 +02:00
unknown	1bbec4ac7e	add 8bit and fix some value	2024-11-16 18:06:45 +02:00
Tyler Banks	829d265ee1	Removed redundant final chunk logic Handle socket server edge case	2024-11-15 23:33:04 -05:00
SWivid	2a844ae064	minor update patch-1	2024-11-15 19:15:34 +08:00
SWivid	6f13ad4349	minor update	2024-11-15 18:50:35 +08:00
Yushen CHEN	ecbf6447dd	Merge pull request #469 from JarodMica/main_repo_update Allow for local path specification of HF models/repos	2024-11-15 18:22:06 +08:00
Jarod Mica	334367d070	Add JA model to SHARED.md	2024-11-15 02:10:33 -08:00
Jarod Mica	929b5ae313	run pre-commit	2024-11-15 01:50:20 -08:00
Jarod Mica	d1d8139bab	Allow for local path specification of HF models/repos	2024-11-15 01:46:26 -08:00
Yushen CHEN	8f65f9f3e4	Update SHARED.md	2024-11-14 00:14:28 +08:00
SWivid	159797d7c4	formatting	2024-11-14 00:10:42 +08:00
Yushen CHEN	944d32862e	Merge pull request #462 from RASPIAUDIO/patch-1 Shared Model Cards for french from @RASPIAUDIO	2024-11-13 23:26:05 +08:00
RASPIAUDIO	da00b90aae	Update SHARED.md	2024-11-13 16:24:21 +01:00
RASPIAUDIO	e995c123e2	ressources added	2024-11-13 15:35:18 +01:00
RASPIAUDIO	4577644f5c	added french	2024-11-13 15:30:22 +01:00
SWivid	be53fb179c	fix. force reload from custom cache file to override infer_gradio init loaded value in case webpage refreshed	2024-11-11 14:46:50 +08:00
Yushen CHEN	da059e86b2	Update SHARED.md; typo	2024-11-11 11:20:11 +08:00
SWivid	a23ec25b39	v0.1.1	2024-11-11 11:19:04 +08:00
SWivid	59761ec9f8	Update. Cache last used custom model path #447	2024-11-11 11:13:11 +08:00
Yushen CHEN	e6e7bd01d3	Merge pull request #448 from peanutcocktail/main Set PYTORCH_ENABLE_MPS_FALLBACK=1 for MPS by default	2024-11-11 07:11:19 +08:00
cocktailpeanut	5635eac12d	PYTORCH_ENABLE_MPS_FALLBACK	2024-11-10 15:36:22 -05:00
SWivid	3fcdbc70b4	minor update	2024-11-09 04:54:06 +08:00
SWivid	dcd9a19889	v0.1.0. Add custom model support for local deploy; add share model cards, etc.	2024-11-09 04:35:44 +08:00
SWivid	23409af801	Update infer-gradio with ref_text auto-filling; minor fix	2024-11-09 01:14:54 +08:00
SWivid	c33a83c009	fix. better asr_transcript cache logic; global model select init.	2024-11-08 20:57:47 +08:00
Yushen CHEN	e78ae2ce92	Create config.yml	2024-11-07 14:23:52 +08:00
Yushen CHEN	da8853838f	Update question.yml	2024-11-07 13:23:35 +08:00
Yushen CHEN	8a7e8495ff	Merge pull request #401 from hcsolakoglu/fix-finetune-cli-bnb-optimizer Add --bnb_optimizer argument to CLI and pass it to Trainer initialization	2024-11-05 20:20:31 +08:00
Yushen CHEN	dbe35da754	Update finetune_cli.py; formatting	2024-11-05 20:19:53 +08:00
hcsolakoglu	2033993f57	Add --bnb_optimizer argument to CLI and pass it to Trainer initialization Add `--bnb_optimizer` argument to CLI and pass it to Trainer initialization. * Add `--bnb_optimizer` argument to `parse_args()` function in `src/f5_tts/train/finetune_cli.py`. * Pass `bnb_optimizer` argument to `Trainer` initialization in the `main()` function of `src/f5_tts/train/finetune_cli.py`.	2024-11-05 15:11:37 +03:00
SWivid	4a69e6bad2	formatting	2024-11-05 00:02:21 +08:00
Yushen CHEN	8718b1d821	Merge pull request #394 from lpscr/main Fix the glitch effect at the beginning audio	2024-11-05 00:01:48 +08:00
Yushen CHEN	2f914140f4	Update utils_infer.py	2024-11-05 00:00:29 +08:00

1 2 3 4 5 ...

359 Commits