a speaker-independent large vocabulary continuous speech recognizer.
Sphinx is a speaker-independent large vocabulary continuous speech recognizer. It is also a collection of free and open source tools and resources that allows researchers and developers to build speech recognition systems. The packages that the CMU Sphinx Group is releasing are a set of reasonably mature, world-class speech components that provide a basic level of technology to anyone interested in creating speech-using applications without the once-prohibitive initial investment cost in research and development; the same components are open to peer review by all researchers in the field, and are used for linguistic research as well. Sphinx-3 is CMU's state-of-the-art large vocabulary speech recognition system. It uses Hidden Markov Models (HMM) with continuous output probability density functions (PDF). It supports several modes of operation. The more accurate mode, known as the "flat decoder", is descended from the original Sphinx-3 release (still available for reference purposes at https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/archive_s3/s3). The faster mode, known as the "tree decoder", was developed separately. The two decoders were merged in Sphinx 3.5, though the flat decoder was not fully functional until Sphinx 3.7. Further documentation can be found in the release documentation, or at the online documentation.
Documentationhttp://www.speech.cs.cmu.edu/cmusphinx/moinmoin/ and http://www.speech.cs.cmu.edu/sphinxman/
released on 1 January 2009
17 February 2009
17 February 2009
Leaders and contributors
Resources and communication
|Bug Tracking||Bug Tracking||http://sourceforge.net/tracker/?group_id=1904&atid=101904|
This entry (in part or in whole) was last reviewed on 15 September 2016.