CMUSphinx3

Sphinx is a speaker-independent large vocabulary continuous speech recognizer. It is also a collection of free and open source tools and resources that allows researchers and developers to build speech recognition systems.

The packages that the CMU Sphinx Group is releasing are a set of reasonably mature, world-class speech components that provide a basic level of technology to anyone interested in creating speech-using applications without the once-prohibitive initial investment cost in research and development; the same components are open to peer review by all researchers in the field, and are used for linguistic research as well.

Sphinx-3 is CMU's state-of-the-art large vocabulary speech recognition system. It uses Hidden Markov Models (HMM) with continuous output probability density functions (PDF). It supports several modes of operation. The more accurate mode, known as the "flat decoder", is descended from the original Sphinx-3 release (still available for reference purposes at https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/archive_s3/s3). The faster mode, known as the "tree decoder", was developed separately. The two decoders were merged in Sphinx 3.5, though the flat decoder was not fully functional until Sphinx 3.7. Further documentation can be found in the release documentation, or at the online documentation.

Last updated 17 Feb, 2009


User level: Advanced

User Rating:

Homepage

License(s) :

BSD_2Clause
X11

Rate it!

 

About

Leadership
Related Projects

CMUSphinx- PocketSphinx, CMUSphinx- Training, CMUSphinx- base

Versions

0.8

User Community and Support

http://www.speech.cs.cmu.edu/cmusphinx/moinmoin/ and http://www.speech.cs.cmu.edu/sphinxman/

General Resources
Support Resources

Development

Developer Resources
Bug Tracking Resources
 

Please send comments on these web pages to bug-directory@fsf.org, send other questions to info@fsf.org.

Copyright © 2000 - 2010 Free Software Foundation, Inc., 51 Franklin Street, 5th Floor, Boston, MA 02110-1301, USA

The copyright licensing notice below applies to this text. Any software described in this text has its own copyright notice and license, which can usually be found in the distribution itself.

Permission is granted to copy, distribute, and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts.