SWivid
f05ceda4cb
v1.0.2 fix: torch.utils.checkpoint.checkpoint add use_reentrant=False
2025-03-15 16:34:32 +08:00
Yushen CHEN
2bd39dd813
Merge pull request #859 from ZhikangNiu/main
...
fix #858 and pass use_reentrant explicitly in checkpoint_activation mode
2025-03-15 16:23:50 +08:00
ZhikangNiu
f017815083
fix #858 and pass use_reentrant explicitly in checkpoint_activation mode
2025-03-15 15:48:47 +08:00
Yushen CHEN
297755fac3
v1.0.1 VRAM usage management #851
2025-03-14 17:31:44 +08:00
Yushen CHEN
d05075205f
Merge pull request #851 from niknah/vram-usage
...
VRAM usage on long texts gradually uses up memory.
2025-03-14 17:25:56 +08:00
Yushen CHEN
8722cf0766
Update utils_infer.py
2025-03-14 17:23:20 +08:00
niknah
48d1a9312e
VRAM usage on long texts gradually uses up memory.
2025-03-14 16:53:58 +11:00
Yushen CHEN
128f4e4bf3
Update publish-pypi.yaml
1.0.0
2025-03-13 00:08:36 +08:00
SWivid
2695e9305d
v1.0.0 release
2025-03-12 23:47:04 +08:00
SWivid
69909ac167
update README.md
2025-03-12 18:40:07 +08:00
SWivid
79bbde5d76
update README.md add a glance of few demo
2025-03-12 18:37:14 +08:00
SWivid
bf651d541e
update README.md for v1.0.0
2025-03-12 17:39:30 +08:00
SWivid
ca6e49adaa
1.0.0 F5-TTS v1 base model with better training and inference performance
2025-03-12 17:23:10 +08:00
SWivid
09b478b7d7
0.6.2 support socket_server.py with general text chunk
0.6.2
2025-02-25 04:47:40 +08:00
SWivid
a72f2f8efb
0.6.1 fix tqdm func check with difference call behavior from gr.Progress()
2025-02-22 08:33:10 +08:00
Yushen CHEN
85e6c660b0
0.6.0 chunk stream support #803 from kunci115
...
chunk stream instead of the whole content process, to make it near realtime possibility
2025-02-21 21:45:07 +08:00
SWivid
c3d415e47a
merging into one infer_batch_process function
2025-02-21 21:41:19 +08:00
SWivid
7ee55d773c
formatting
2025-02-21 17:00:51 +08:00
kunci115
d68b1f304c
[add] new line after gc.collect()
2025-02-21 14:48:58 +07:00
kunci115
7c0eafe240
[add] client use on readme
2025-02-21 14:45:09 +07:00
rino
4ceba6dc24
This patch is to solve a problem where streaming will handle all of the client input
...
[add] numpy tokenizer for stream chunk
[add] infer_batch_process_stream in utils_infer
[add] file writter after streaming
[edit] adjustment for streaming server
[edit] data handling processes and sends chunk by chunk
[delete] threading on processing the inference, just for file writting
2025-02-21 14:35:01 +07:00
SWivid
d457c3e245
update readme. #784
2025-02-19 15:31:01 +08:00
SWivid
832ecf40b9
formatting, update readme
2025-02-19 08:35:13 +08:00
Yushen CHEN
6e49f3200c
Merge pull request #797 from YoungPhlo/feat/browser-autolaunch
...
feat: Add autolaunch option to Gradio interface
2025-02-19 08:21:41 +08:00
Phlo
fea67815ae
docs: Update README with autolaunch Gradio interface option
2025-02-18 12:50:26 -06:00
Phlo
3342859c04
feat: Add autolaunch option to Gradio interface
2025-02-18 12:29:21 -06:00
SWivid
5fa0479432
0.5.3 fix MPS device compatibility; update readme
2025-02-18 18:42:03 +08:00
Yushen CHEN
e40d4462d2
Merge pull request #796 from YoungPhlo/fix/mps-fallback
...
fix: typo in MPS PyTorch env variable
2025-02-18 18:15:16 +08:00
Phlo
f005f1565e
fix: typo in MPS PyTorch env variable
2025-02-18 03:28:44 -06:00
Yushen CHEN
818d9b8476
Merge pull request #786 from fakerybakery/hf-demo-upd
...
Add link back to GitHub repo, clarify local demo
2025-02-15 05:01:45 +08:00
mrfakename
71ad071c1e
Update Gradio app
2025-02-14 12:44:52 -08:00
SWivid
0923b76d79
update README.md, add nvidia device gradio infer docker compose file example
2025-02-13 02:07:24 +08:00
SWivid
f062403353
0.5.2 Improve prepare_csv_wavs.py from @hcsolakoglu
2025-02-09 14:36:40 +08:00
Yushen CHEN
5fbcbac6a3
Merge pull request #772 from hcsolakoglu/improve-prepare-csv-wavs
...
Improve prepare_csv_wavs.py
2025-02-09 14:34:11 +08:00
Hasan Can Solakoğlu
eebe337625
Increase batch size for text conversion from 32 to 100
2025-02-07 22:40:16 +03:00
Hasan Can Solakoğlu
0291ac17d2
Fix code formatting
2025-02-07 22:37:00 +03:00
Hasan Can Solakoğlu
bec4ebcae5
Enhance CSV preparation script to preserve order of processed audio files in chunk submissions
2025-02-07 22:35:30 +03:00
Hasan Can Solakoğlu
a9d6509a06
Enhance CSV preparation script with customizable worker count and improved usage examples
2025-02-07 22:32:42 +03:00
Hasan Can Solakoğlu
e7496d0170
Enhance audio processing with concurrent execution and graceful shutdown handling
2025-02-07 22:13:13 +03:00
Hasan Can Solakoğlu
34d94af2a8
Enhance audio duration extraction with ffprobe fallback and error handling
2025-02-07 20:38:42 +03:00
SWivid
261b2774f2
0.5.1 Enhance DynamicBatchSampler to support epoch-based shuffling
2025-02-05 15:12:34 +08:00
Yushen CHEN
906b1af925
Merge pull request #765 from hcsolakoglu/dynbatchsampler-epoch-shuffle
...
Add Per-Epoch Batch Shuffling to DynamicBatchSampler
2025-02-05 15:10:08 +08:00
Can
33e865120c
Refactor imports and improve code formatting in dataset and trainer modules
2025-02-04 22:20:42 +03:00
Can
93ae7d3fc8
Enhance DynamicBatchSampler to support epoch-based shuffling
2025-02-04 20:21:59 +03:00
Hasan Can
bebbfbb916
Fix for incorrect defaults in the finetune_gradio interface ( #755 )
...
* Add missing components to setup_load_settings in finetune_gradio
2025-01-29 17:25:22 +08:00
unknown
f0996492a7
0.5.0 fix grad_accum bug from 0.4.0, #715 #728
2025-01-29 15:18:02 +08:00
unknown
0d95df4a4d
0.4.6 minor fixes for finetune-gradio -cli
2025-01-29 00:06:10 +08:00
Yushen CHEN
738d502f3b
Merge pull request #751 from hcsolakoglu/fix-finetune-gradio-dropdown
...
Small fix for the checkpoint dropdown menu in finetune gradio
2025-01-28 22:52:25 +08:00
Hasan Can Solakoğlu
f8cc2446c8
Fix for the checkpoint dropdown menu
2025-01-28 15:25:14 +03:00
unknown
607b92b391
0.4.5 fix extremely short case that lengths of text_seq > audio_seq, causing wrong cond_mask
2025-01-28 12:38:16 +08:00