Jsoup

From Free Software Directory
 
Jump to: navigation, search


[edit]

Jsoup

http://jsoup.org
Java HTML parser that makes sense of real-world HTML soup

Jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.

jsoup implements the WHATWG HTML specification (http://whatwg.org/html), and parses HTML to the same DOM as modern browsers do.

  • parse HTML from a URL, file, or string * find and extract

data, using DOM traversal or CSS selectors * manipulate the HTML elements, attributes, and text * clean user-submitted content against a safe white-list, to prevent XSS * output tidy HTML

jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree.





Licensing

License

Verified by

Verified on

Notes

License

Other

Verified by

Debian: Emmanuel Bourg <ebourg@apache.org>

Verified on

21 October 2014

Notes

License: mit




Leaders and contributors

Contact(s)Role
Jonathan Hedley contact


Resources and communication

AudienceResource typeURI
Downloadhttp://jsoup.org
Debian (Ref)https://tracker.debian.org/pkg/jsoup


Software prerequisites




Entry








"contact" is not in the list (Maintainer, Contributor, Developer, Sponsor, Unknown) of allowed values for the "Role" property.


"Debian (Ref)" is not in the list (General, Help, Bug Tracking, Support, Developer) of allowed values for the "Resource audience" property.








Date 2015-07-17
Source Debian
Source link http://packages.debian.org/sid/jsoup

[[Category:]]



Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.