TextAnalyzer
TextAnalyzer
http://martin.ankerl.com/2007/01/09/textanalyzer-automatically-extract-characteristic-words/
Is a text analyzer tool that finds out words that are characteristic for a given input file.
TextAnalzyer is a text analyzer tool that finds out words that are characteristic for a given input file. It is independent from any language, and even seems to work well with HTML files. First you have to train the program with sample text. (e.g. all of Grimm's fairy tales). Using this index you can analyze a single text file to find out the characteristic words. For the story of "Little Red Riding Hood" you get "hood, grandma, riding, hunter, red" as the most characteristic words of the text.
Licensing
License
Verified by
Verified on
Notes
Leaders and contributors
Contact(s) | Role |
---|---|
Martin Ankerl | Maintainer |
Resources and communication
Audience | Resource type | URI |
---|---|---|
Bug Tracking,Developer | mailto:martin.ankerl@gmail.com |
Software prerequisites
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.
The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.