Abstract: Current sEMG-based speech generation methods primarily rely on large-scale datasets from single participants, which imposes a burden on users. Moreover, previous research methods often ...
Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.
Abstract: Recent advances in automatic speech recognition (ASR) have led to substantial improvements in system accuracy and robustness, particularly in converting speech signals into text sequences.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results