643 Commits

Author SHA1 Message Date
SWivid
09b478b7d7 0.6.2 support socket_server.py with general text chunk 0.6.2 2025-02-25 04:47:40 +08:00
SWivid
a72f2f8efb 0.6.1 fix tqdm func check with difference call behavior from gr.Progress() 2025-02-22 08:33:10 +08:00
Yushen CHEN
85e6c660b0 0.6.0 chunk stream support #803 from kunci115
chunk stream instead of the whole content process, to make it near realtime possibility
2025-02-21 21:45:07 +08:00
SWivid
c3d415e47a merging into one infer_batch_process function 2025-02-21 21:41:19 +08:00
SWivid
7ee55d773c formatting 2025-02-21 17:00:51 +08:00
kunci115
d68b1f304c [add] new line after gc.collect() 2025-02-21 14:48:58 +07:00
kunci115
7c0eafe240 [add] client use on readme 2025-02-21 14:45:09 +07:00
rino
4ceba6dc24 This patch is to solve a problem where streaming will handle all of the client input
[add] numpy tokenizer for stream chunk
[add] infer_batch_process_stream in utils_infer
[add] file writter after streaming

[edit] adjustment for streaming server
[edit] data handling processes and sends chunk by chunk
[delete] threading on processing the inference, just for file writting
2025-02-21 14:35:01 +07:00
SWivid
d457c3e245 update readme. #784 2025-02-19 15:31:01 +08:00
SWivid
832ecf40b9 formatting, update readme 2025-02-19 08:35:13 +08:00
Yushen CHEN
6e49f3200c Merge pull request #797 from YoungPhlo/feat/browser-autolaunch
feat: Add autolaunch option to Gradio interface
2025-02-19 08:21:41 +08:00
Phlo
fea67815ae docs: Update README with autolaunch Gradio interface option 2025-02-18 12:50:26 -06:00
Phlo
3342859c04 feat: Add autolaunch option to Gradio interface 2025-02-18 12:29:21 -06:00
SWivid
5fa0479432 0.5.3 fix MPS device compatibility; update readme 2025-02-18 18:42:03 +08:00
Yushen CHEN
e40d4462d2 Merge pull request #796 from YoungPhlo/fix/mps-fallback
fix: typo in MPS PyTorch env variable
2025-02-18 18:15:16 +08:00
Phlo
f005f1565e fix: typo in MPS PyTorch env variable 2025-02-18 03:28:44 -06:00
Yushen CHEN
818d9b8476 Merge pull request #786 from fakerybakery/hf-demo-upd
Add link back to GitHub repo, clarify local demo
2025-02-15 05:01:45 +08:00
mrfakename
71ad071c1e Update Gradio app 2025-02-14 12:44:52 -08:00
SWivid
0923b76d79 update README.md, add nvidia device gradio infer docker compose file example 2025-02-13 02:07:24 +08:00
SWivid
f062403353 0.5.2 Improve prepare_csv_wavs.py from @hcsolakoglu 2025-02-09 14:36:40 +08:00
Yushen CHEN
5fbcbac6a3 Merge pull request #772 from hcsolakoglu/improve-prepare-csv-wavs
Improve prepare_csv_wavs.py
2025-02-09 14:34:11 +08:00
Hasan Can Solakoğlu
eebe337625 Increase batch size for text conversion from 32 to 100 2025-02-07 22:40:16 +03:00
Hasan Can Solakoğlu
0291ac17d2 Fix code formatting 2025-02-07 22:37:00 +03:00
Hasan Can Solakoğlu
bec4ebcae5 Enhance CSV preparation script to preserve order of processed audio files in chunk submissions 2025-02-07 22:35:30 +03:00
Hasan Can Solakoğlu
a9d6509a06 Enhance CSV preparation script with customizable worker count and improved usage examples 2025-02-07 22:32:42 +03:00
Hasan Can Solakoğlu
e7496d0170 Enhance audio processing with concurrent execution and graceful shutdown handling 2025-02-07 22:13:13 +03:00
Hasan Can Solakoğlu
34d94af2a8 Enhance audio duration extraction with ffprobe fallback and error handling 2025-02-07 20:38:42 +03:00
SWivid
261b2774f2 0.5.1 Enhance DynamicBatchSampler to support epoch-based shuffling 2025-02-05 15:12:34 +08:00
Yushen CHEN
906b1af925 Merge pull request #765 from hcsolakoglu/dynbatchsampler-epoch-shuffle
Add Per-Epoch Batch Shuffling to DynamicBatchSampler
2025-02-05 15:10:08 +08:00
Can
33e865120c Refactor imports and improve code formatting in dataset and trainer modules 2025-02-04 22:20:42 +03:00
Can
93ae7d3fc8 Enhance DynamicBatchSampler to support epoch-based shuffling 2025-02-04 20:21:59 +03:00
Hasan Can
bebbfbb916 Fix for incorrect defaults in the finetune_gradio interface (#755)
* Add missing components to setup_load_settings in finetune_gradio
2025-01-29 17:25:22 +08:00
unknown
f0996492a7 0.5.0 fix grad_accum bug from 0.4.0, #715 #728 2025-01-29 15:18:02 +08:00
unknown
0d95df4a4d 0.4.6 minor fixes for finetune-gradio -cli 2025-01-29 00:06:10 +08:00
Yushen CHEN
738d502f3b Merge pull request #751 from hcsolakoglu/fix-finetune-gradio-dropdown
Small fix for the checkpoint dropdown menu in finetune gradio
2025-01-28 22:52:25 +08:00
Hasan Can Solakoğlu
f8cc2446c8 Fix for the checkpoint dropdown menu 2025-01-28 15:25:14 +03:00
unknown
607b92b391 0.4.5 fix extremely short case that lengths of text_seq > audio_seq, causing wrong cond_mask 2025-01-28 12:38:16 +08:00
unknown
ee2b77064e 0.4.4 fix hard coded stdout for finetune-gradio gui 2025-01-28 11:39:54 +08:00
unknown
d1f708d442 0.4.3 Bug-fixes for finetune-gradio component mismatch & checkpoint loading error loop 2025-01-27 21:22:29 +08:00
Yushen CHEN
1e7d6da992 Merge pull request #746 from mwzkhalil/patch-1
Update finetune_gradio.py, set weights_only=True
2025-01-27 21:12:14 +08:00
Yushen CHEN
c2cf31e0c5 Merge pull request #729 from hcsolakoglu/fix-ckpt-rotation
Exclude pretrained models from the checkpoint rotation logic
2025-01-27 19:57:05 +08:00
Yushen CHEN
46266f1d14 Merge pull request #741 from Chiyan200/main
Fix Settings Loader Issues: Resolve KeyErrors, Path Handling, and Component Assignment (#731)
2025-01-27 19:28:22 +08:00
mahwiz khalil
c54f4e7fc0 Update finetune_gradio.py
The safest approach here is to explicitly set weights_only=True to load only the model weights and avoid executing potentially unsafe code
2025-01-24 00:31:53 -08:00
Zhikang Niu
129014c5b4 Merge pull request #735 from DDXDB/main
Added intel XPU support
2025-01-22 12:52:00 +08:00
98440
6fb913fadf Merge remote-tracking branch 'origin/main'
# Conflicts:
#	src/f5_tts/api.py
#	src/f5_tts/infer/speech_edit.py
#	src/f5_tts/infer/utils_infer.py
#	src/f5_tts/socket_server.py
#	src/f5_tts/train/finetune_gradio.py
2025-01-22 03:36:55 +08:00
98440
964064094a Added intel XPU support 2025-01-22 03:36:10 +08:00
[Chiyan200]
24fe39dc3c Fix : Settings Loader Issues: Resolve KeyErrors, Path Handling, and Component Assignment (#731) 2025-01-22 00:07:34 +05:30
[Chiyan200]
a74d0d0f83 Fix: Robust settings loader to handle missing keys, incorrect file paths, and dynamic assignment
- Ensured default settings are properly merged with file-based settings to prevent KeyErrors.
- Added logic to handle _pinyin and _char suffixes in project names, ensuring correct file paths.
- Implemented tuple-based ordered mapping for consistent and error-free component assignment.
- Added safety check to verify the existence of setting.json before loading.
- Improved maintainability by centralizing default settings and enhancing error handling.
2025-01-21 23:17:59 +05:30
98440
81ce1d8670 Added intel XPU support 2025-01-20 00:47:57 +08:00
Hasan Can Solakoğlu
2d27d2c1b2 Exclude pretrained models from the checkpoint rotation logic 2025-01-17 19:35:19 +03:00