speech recognition - Trying to find better models for CMU Sphinx - OGeek|极客中国-技术改变生活,极客改变未来

I'm writing a program to transcribe audio using CMU Sphinx. I'm not happy with the quality and I thought maybe I could find a better model. But I don't really understand the difference between the models available. There are the models that are in the sphinx4-data jar and then I found this page, https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/US%20English/, but I don't fully understand what the differences are. And I'm not even sure what files to use.

There is the Accoustic Model, Dictionary and Language Model.
I'd like my program to be as general as possible, i.e., to be able to transcribe any speech (English, to start with). What are the best models to use?

question from:https://stackoverflow.com/questions/65904192/trying-to-find-better-models-for-cmu-sphinx

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

Categories

speech recognition - Trying to find better models for CMU Sphinx

speech recognition - Trying to find better models for CMU Sphinx

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags