Web-Scraper

From Free Software Directory
Revision as of 13:52, 18 August 2015 by WikiSysop (talk | contribs)

(diff) ← Older revision | Approved revision (diff) | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


[edit]

Web-scraper

https://metacpan.org/release/Web-Scraper
Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions

Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi. It provides a DSL-ish interface for traversing HTML documents and returning a neatly arranged Perl data strcuture.

The scraper and process blocks provide a method to define what segments of a document to extract. It understands HTML and CSS Selectors as well as XPath expressions.





Licensing

License

Verified by

Verified on

Notes

License

Other

Verified by

Debian: gregor herrmann <gregoa@debian.org>

Verified on

22 October 2014

Notes

License: artistic or gpl-1+




Leaders and contributors

Contact(s)Role
Tatsuhiko Miyagawa contact


Resources and communication

AudienceResource typeURI
Perl (Ref)https://metacpan.org/release/Web-Scraper
Debian (Ref) (R)https://tracker.debian.org/pkg/libweb-scraper-perl


Software prerequisites




Entry




Date 2015-07-17
Source Debian
Source link http://packages.debian.org/sid/libweb-scraper-perl

[[Category:]]





"contact" is not in the list (Maintainer, Contributor, Developer, Sponsor, Unknown) of allowed values for the "Role" property.






Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.

The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.