F5-TTS

mirror of https://github.com/SWivid/F5-TTS.git synced 2026-07-28 21:35:22 -07:00

Author	SHA1	Message	Date
SWivid	f05ceda4cb	v1.0.2 fix: torch.utils.checkpoint.checkpoint add use_reentrant=False	2025-03-15 16:34:32 +08:00
Yushen CHENandGitHub	2bd39dd813	Merge pull request #859 from ZhikangNiu/main fix #858 and pass use_reentrant explicitly in checkpoint_activation mode	2025-03-15 16:23:50 +08:00
ZhikangNiu	f017815083	fix #858 and pass use_reentrant explicitly in checkpoint_activation mode	2025-03-15 15:48:47 +08:00
Yushen CHENandGitHub	297755fac3	v1.0.1 VRAM usage management #851	2025-03-14 17:31:44 +08:00
Yushen CHENandGitHub	d05075205f	Merge pull request #851 from niknah/vram-usage VRAM usage on long texts gradually uses up memory.	2025-03-14 17:25:56 +08:00
Yushen CHENandGitHub	8722cf0766	Update utils_infer.py	2025-03-14 17:23:20 +08:00
niknah	48d1a9312e	VRAM usage on long texts gradually uses up memory.	2025-03-14 16:53:58 +11:00
Yushen CHENandGitHub	128f4e4bf3	Update publish-pypi.yaml 1.0.0	2025-03-13 00:08:36 +08:00
SWivid	2695e9305d	v1.0.0 release	2025-03-12 23:47:04 +08:00
SWivid	69909ac167	update README.md	2025-03-12 18:40:07 +08:00
SWivid	79bbde5d76	update README.md add a glance of few demo	2025-03-12 18:37:14 +08:00
SWivid	bf651d541e	update README.md for v1.0.0	2025-03-12 17:39:30 +08:00
SWivid	ca6e49adaa	1.0.0 F5-TTS v1 base model with better training and inference performance	2025-03-12 17:23:10 +08:00
SWivid	09b478b7d7	0.6.2 support socket_server.py with general text chunk 0.6.2	2025-02-25 04:47:40 +08:00
SWivid	a72f2f8efb	0.6.1 fix tqdm func check with difference call behavior from gr.Progress()	2025-02-22 08:33:10 +08:00
Yushen CHENandGitHub	85e6c660b0	0.6.0 chunk stream support #803 from kunci115 chunk stream instead of the whole content process, to make it near realtime possibility	2025-02-21 21:45:07 +08:00
SWivid	c3d415e47a	merging into one infer_batch_process function	2025-02-21 21:41:19 +08:00
SWivid	7ee55d773c	formatting	2025-02-21 17:00:51 +08:00
kunci115	d68b1f304c	[add] new line after gc.collect()	2025-02-21 14:48:58 +07:00
kunci115	7c0eafe240	[add] client use on readme	2025-02-21 14:45:09 +07:00
rino	4ceba6dc24	This patch is to solve a problem where streaming will handle all of the client input [add] numpy tokenizer for stream chunk [add] infer_batch_process_stream in utils_infer [add] file writter after streaming [edit] adjustment for streaming server [edit] data handling processes and sends chunk by chunk [delete] threading on processing the inference, just for file writting	2025-02-21 14:35:01 +07:00
SWivid	d457c3e245	update readme. #784	2025-02-19 15:31:01 +08:00
SWivid	832ecf40b9	formatting, update readme	2025-02-19 08:35:13 +08:00
Yushen CHENandGitHub	6e49f3200c	Merge pull request #797 from YoungPhlo/feat/browser-autolaunch feat: Add autolaunch option to Gradio interface	2025-02-19 08:21:41 +08:00
Phlo	fea67815ae	docs: Update README with autolaunch Gradio interface option	2025-02-18 12:50:26 -06:00
Phlo	3342859c04	feat: Add autolaunch option to Gradio interface	2025-02-18 12:29:21 -06:00
SWivid	5fa0479432	0.5.3 fix MPS device compatibility; update readme	2025-02-18 18:42:03 +08:00
Yushen CHENandGitHub	e40d4462d2	Merge pull request #796 from YoungPhlo/fix/mps-fallback fix: typo in MPS PyTorch env variable	2025-02-18 18:15:16 +08:00
Phlo	f005f1565e	fix: typo in MPS PyTorch env variable	2025-02-18 03:28:44 -06:00
Yushen CHENandGitHub	818d9b8476	Merge pull request #786 from fakerybakery/hf-demo-upd Add link back to GitHub repo, clarify local demo	2025-02-15 05:01:45 +08:00
mrfakenameandGitHub	71ad071c1e	Update Gradio app	2025-02-14 12:44:52 -08:00
SWivid	0923b76d79	update README.md, add nvidia device gradio infer docker compose file example	2025-02-13 02:07:24 +08:00
SWivid	f062403353	0.5.2 Improve prepare_csv_wavs.py from @hcsolakoglu	2025-02-09 14:36:40 +08:00
Yushen CHENandGitHub	5fbcbac6a3	Merge pull request #772 from hcsolakoglu/improve-prepare-csv-wavs Improve prepare_csv_wavs.py	2025-02-09 14:34:11 +08:00
Hasan Can Solakoğlu	eebe337625	Increase batch size for text conversion from 32 to 100	2025-02-07 22:40:16 +03:00
Hasan Can Solakoğlu	0291ac17d2	Fix code formatting	2025-02-07 22:37:00 +03:00
Hasan Can Solakoğlu	bec4ebcae5	Enhance CSV preparation script to preserve order of processed audio files in chunk submissions	2025-02-07 22:35:30 +03:00
Hasan Can Solakoğlu	a9d6509a06	Enhance CSV preparation script with customizable worker count and improved usage examples	2025-02-07 22:32:42 +03:00
Hasan Can Solakoğlu	e7496d0170	Enhance audio processing with concurrent execution and graceful shutdown handling	2025-02-07 22:13:13 +03:00
Hasan Can Solakoğlu	34d94af2a8	Enhance audio duration extraction with ffprobe fallback and error handling	2025-02-07 20:38:42 +03:00
SWivid	261b2774f2	0.5.1 Enhance DynamicBatchSampler to support epoch-based shuffling	2025-02-05 15:12:34 +08:00
Yushen CHENandGitHub	906b1af925	Merge pull request #765 from hcsolakoglu/dynbatchsampler-epoch-shuffle Add Per-Epoch Batch Shuffling to DynamicBatchSampler	2025-02-05 15:10:08 +08:00
Can	33e865120c	Refactor imports and improve code formatting in dataset and trainer modules	2025-02-04 22:20:42 +03:00
Can	93ae7d3fc8	Enhance DynamicBatchSampler to support epoch-based shuffling	2025-02-04 20:21:59 +03:00
Hasan CanandGitHub	bebbfbb916	Fix for incorrect defaults in the finetune_gradio interface (#755 ) * Add missing components to setup_load_settings in finetune_gradio	2025-01-29 17:25:22 +08:00
unknown	f0996492a7	0.5.0 fix grad_accum bug from 0.4.0, #715 #728	2025-01-29 15:18:02 +08:00
unknown	0d95df4a4d	0.4.6 minor fixes for finetune-gradio -cli	2025-01-29 00:06:10 +08:00
Yushen CHENandGitHub	738d502f3b	Merge pull request #751 from hcsolakoglu/fix-finetune-gradio-dropdown Small fix for the checkpoint dropdown menu in finetune gradio	2025-01-28 22:52:25 +08:00
Hasan Can Solakoğlu	f8cc2446c8	Fix for the checkpoint dropdown menu	2025-01-28 15:25:14 +03:00
unknown	607b92b391	0.4.5 fix extremely short case that lengths of text_seq > audio_seq, causing wrong cond_mask	2025-01-28 12:38:16 +08:00

1 2 3 4 5 ...