Commit Graph

28 Commits

Author SHA1 Message Date
Tom Hunn
a4ca14b5f6 add and run pre-commit with ruff 2024-10-21 14:46:45 +10:00
SWivid
2c0924378d add sanity check ensuring mono audio input for training 2024-10-21 04:14:52 +08:00
SWivid
5600d9079a minor fix. 2024-10-21 03:40:20 +08:00
SWivid
d3badb95cf fp16 inference only for cuda devices now 2024-10-21 03:34:28 +08:00
SWivid
03a20e0258 reorganize inference scripts with shared funcs 2024-10-21 02:21:13 +08:00
SWivid
b4f81425f3 disable fp16 for cpu device 2024-10-20 22:45:54 +08:00
SWivid
28b46d32d3 rewrite without einx & einops; clean up 2024-10-20 20:41:24 +08:00
SWivid
554f3189e1 Use default fp16 inference 2024-10-20 16:32:18 +08:00
SWivid
69850fa236 fix. address #179 2024-10-20 12:43:01 +08:00
cocktailpeanut
ae6e97b836 user-friendly wandb support 2024-10-18 14:59:59 -04:00
Jarod Mica
2a521e9050 Merge branch 'SWivid:main' into main 2024-10-15 22:58:31 -07:00
SWivid
bc6331529a split pkgs only for eval usage address #97; clean-up 2024-10-15 21:14:44 +08:00
Jarod Mica
31e5051d51 default to weights_only=True for safer loading 2024-10-15 00:37:46 -07:00
mrfakename
923e95cadb Fix unexpected indent issue 2024-10-14 21:07:39 -07:00
Yushen CHEN
3acf3e2a9b Update dataset.py, fix typo 2024-10-15 12:05:15 +08:00
Yushen CHEN
4cdcccf7a3 Update dataset.py 2024-10-15 12:03:16 +08:00
Jarod Mica
6fda7e5f6f Update to make passing in custom paths easier for finetuning/training 2024-10-14 20:13:07 -07:00
SWivid
9d2b8cb3da fix inference-cli; clean-up 2024-10-14 23:40:31 +08:00
SWivid
e938b40bee add more detailed instruct. on inference. address #49 #50 2024-10-14 10:15:40 +08:00
SWivid
615d183a0d add code-switch friendly synth. and a smoother silence remover 2024-10-14 00:29:30 +08:00
SWivid
46d391a876 fix replacement of ckpt keys when do finetune training 2024-10-13 17:20:18 +08:00
SWivid
0d7b47bc3b enable correct ckpt loading for finetune 2024-10-13 14:41:08 +08:00
SWivid
83fbd34dc8 convert all input audio to mono 2024-10-13 13:39:16 +08:00
SWivid
9395289d7a add ckpt load opt. for .safetensor 2024-10-13 10:55:18 +08:00
Zhikang Niu
edc189fa96 Update trainer.py 2024-10-13 10:04:13 +08:00
SWivid
a621c223ec add speech edit test script 2024-10-11 00:41:23 +08:00
SWivid
39ce201c4e disable mask for single infer to save mem; add custom trans for vocab to address oov 2024-10-10 17:05:39 +08:00
SWivid
074881635d basic 2024-10-08 21:56:51 +08:00