Summary:Natural voice interaction is an important form of human-machine interfacing. However, as high-precision ASR models are computationally intensive, these computation projects are usually run on the cloud, worsening the experience. This challenge aims to port high-precision streaming ASR to Sophgo’s SG2002 processor. You may choose between Chinese and English models. The goal is to achieve minimum word error rate whilst running on limited RAM (256MBytes) and a rate of RTF<1. You may reference Kaldi, Wenet, and other open source speech recognition projects for your port.