Difference between revisions of "Spchcat"

From Free Software Directory
Jump to: navigation, search
(Add entry for spchcat)
 
(approved new entry, licensing check)
 
Line 8: Line 8:
 
spchcat is a command-line tool that reads in audio from .WAV files, a microphone, or system audio inputs and converts any speech found into text. It runs locally on your machine, with no web API calls or network activity. It is built on top of Coqui's speech to text library, TensorFlow, KenLM, and data from Mozilla's Common Voice project.
 
spchcat is a command-line tool that reads in audio from .WAV files, a microphone, or system audio inputs and converts any speech found into text. It runs locally on your machine, with no web API calls or network activity. It is built on top of Coqui's speech to text library, TensorFlow, KenLM, and data from Mozilla's Common Voice project.
 
|Homepage URL=https://github.com/petewarden/spchcat
 
|Homepage URL=https://github.com/petewarden/spchcat
 +
|Version download=https://github.com/petewarden/spchcat/archive/refs/tags/v0.0.2-rpi-alpha.tar.gz
 
}}
 
}}
 
{{Project license
 
{{Project license
 
|License=MPL-2.0
 
|License=MPL-2.0
|License copyright=https://github.com/petewarden/spchcat/blob/main/LICENSE
+
|License copyright=Pete Warden
|License verified by=mmcmahon
+
|License verified by=mmcmahon, craigt
 
|License verified date=2022-01-11
 
|License verified date=2022-01-11
 +
|License note=https://github.com/petewarden/spchcat/blob/main/LICENSE
 
}}
 
}}
 
{{Software category
 
{{Software category

Latest revision as of 15:50, 14 January 2022


[edit]

spchcat

https://github.com/petewarden/spchcat
Speech recognition tool to convert audio to text transcripts

From GitHub:

Speech recognition tool to convert audio to text transcripts for GNU/Linux.

spchcat is a command-line tool that reads in audio from .WAV files, a microphone, or system audio inputs and converts any speech found into text. It runs locally on your machine, with no web API calls or network activity. It is built on top of Coqui's speech to text library, TensorFlow, KenLM, and data from Mozilla's Common Voice project.





Licensing

License

Verified by

Verified on

Notes

License

MPL-2.0

Verified by

mmcmahon, craigt

Verified on

11 January 2022




Leaders and contributors

Resources and communication

Software prerequisites

KindDescription
Required to usehttps://directory.fsf.org/wiki/STT




Entry












Property "Submitted by" (as page type) with input value "{{{Submitted by}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process. "{{{Submitted date}}}" contains an extrinsic dash or other characters that are invalid for a date interpretation.








Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.