用ModelScope 粤语模型推理声音的时候出现了以下问题,有遇到过的吗?

用ModelScope 粤语模型推理声音的时候出现了以下问题,有遇到过的吗?
Traceback (most recent call last):
File ""kantts/bin/text_to_wav.py"", line 234, in
args.lang,
File ""kantts/bin/text_to_wav.py"", line 161, in text_to_wav
am_infer(symbols_file, am_ckpt, output_dir, se_file)
File ""/root/KAN-TTS/kantts/bin/infer_sambert.py"", line 222, in am_infer
line[1], fsnet, ling_unit, device, se=se
File ""/root/KAN-TTS/kantts/bin/infer_sambert.py"", line 87, in am_synthesis
[inputs_sy, inputs_tone, inputs_syllable, inputs_ws], dim=-1
RuntimeError: stack expects each tensor to be equal size, but got [5] at entry 0 and [21] at entry 1

展开
收起
小小爱吃香菜 2024-01-22 17:17:00 93 分享 版权
0 条回答
写回答
取消 提交回答

包含图像分类、图像生成、人体人脸识别、动作识别、目标分割、视频生成、卡通画、视觉评价、三维视觉等多个领域

热门讨论

热门文章

还有其他疑问?
咨询AI助理