Categories
CMUSphinx3
Sphinx is a speaker-independent large vocabulary continuous speech recognizer. It is also a collection of free and open source tools and resources that allows researchers and developers to build speech recognition systems.
The packages that the CMU Sphinx Group is releasing are a set of reasonably mature, world-class speech components that provide a basic level of technology to anyone interested in creating speech-using applications without the once-prohibitive initial investment cost in research and development; the same components are open to peer review by all researchers in the field, and are used for linguistic research as well.
Sphinx-3 is CMU's state-of-the-art large vocabulary speech recognition system. It uses Hidden Markov Models (HMM) with continuous output probability density functions (PDF). It supports several modes of operation. The more accurate mode, known as the "flat decoder", is descended from the original Sphinx-3 release (still available for reference purposes at https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/archive_s3/s3). The faster mode, known as the "tree decoder", was developed separately. The two decoders were merged in Sphinx 3.5, though the flat decoder was not fully functional until Sphinx 3.7. Further documentation can be found in the release documentation, or at the online documentation.
Last updated 17 Feb, 2009
About
Leadership
- Evandro GouvĂȘa - Maintainer
- Mosur Ravishankar - Maintainer
- Arthur Chan - Maintainer
Related Projects
CMUSphinx- PocketSphinx, CMUSphinx- Training, CMUSphinx- base
Versions
0.8
- Released: 17 Feb, 2009
- Code Maturity: Beta
- Source Archive: http://downloads.sourceforge.net/cmusphinx/sphi...
- Licenses: BSD_2Clause, X11
- Interfaces: Command Line
User Community and Support
http://www.speech.cs.cmu.edu/cmusphinx/moinmoin/ and http://www.speech.cs.cmu.edu/sphinxman/
