Difference between revisions of "Larbin"
(Created page with "{{Entry |Name=Larbin |Short description=Web crawler |Full description=Larbin is an HTTP Web crawler that can fetch more than 5 million pages a day on a standard PC (pentium II 30...") |
(No version changes) |
||
Line 3: | Line 3: | ||
|Short description=Web crawler | |Short description=Web crawler | ||
|Full description=Larbin is an HTTP Web crawler that can fetch more than 5 million pages a day on a standard PC (pentium II 300, 128 Mo SDRAM and a 10 Mbit ethernet card, with a good network). Larbin uses standard libraries, plus adns. The program is multithreaded but prefers using select instead of a lot of threads (for efficiency purposes). The advantage of Larbin over wget or ht://dig is that it is much faster (because it opens a lot of connexions at a time) and very easy to customize). Common uses include: a crawler for a standard search engine, a crawler for a specialized search engine (xml, images, mp3...), and to provide statistics about servers or page contents). | |Full description=Larbin is an HTTP Web crawler that can fetch more than 5 million pages a day on a standard PC (pentium II 300, 128 Mo SDRAM and a 10 Mbit ethernet card, with a good network). Larbin uses standard libraries, plus adns. The program is multithreaded but prefers using select instead of a lot of threads (for efficiency purposes). The advantage of Larbin over wget or ht://dig is that it is much faster (because it opens a lot of connexions at a time) and very easy to customize). Common uses include: a crawler for a standard search engine, a crawler for a specialized search engine (xml, images, mp3...), and to provide statistics about servers or page contents). | ||
+ | |Homepage URL=http://larbin.sourceforge.net/index-eng.html | ||
|User level=none | |User level=none | ||
− | |||
− | |||
− | |||
− | |||
|Computer languages=C++ | |Computer languages=C++ | ||
|Documentation note=User guide included | |Documentation note=User guide included | ||
− | |||
− | |||
− | |||
− | |||
|Related projects=ht:_Dig,Wget | |Related projects=ht:_Dig,Wget | ||
|Keywords=HTTP,Web,crawler,Larbin | |Keywords=HTTP,Web,crawler,Larbin | ||
− | |||
− | |||
− | |||
− | |||
− | |||
|Version identifier=2.6.3 | |Version identifier=2.6.3 | ||
|Version date=2002-07-16 | |Version date=2002-07-16 | ||
|Version status=stable | |Version status=stable | ||
|Version download=http://prdownloads.sourceforge.net/larbin/larbin-2.6.3.tar.gz?download | |Version download=http://prdownloads.sourceforge.net/larbin/larbin-2.6.3.tar.gz?download | ||
+ | |Version comment=2.6.3 stable released 2002-07-16 | ||
+ | |Last review by=Alejandroindependiente | ||
+ | |Last review date=2008/06/24 | ||
+ | |Submitted by=Database conversion | ||
+ | |Submitted date=2011-04-01 | ||
+ | |Status= | ||
+ | |Is GNU=No | ||
+ | |License verified date=2001-04-04 | ||
+ | }} | ||
+ | {{Project license | ||
+ | |License=GPLv2 | ||
+ | |License verified by=Janet Casey | ||
|License verified date=2001-04-04 | |License verified date=2001-04-04 | ||
− | |||
}} | }} | ||
{{Person | {{Person | ||
+ | |Real name=Sebastien Ailleret | ||
|Role=Maintainer | |Role=Maintainer | ||
− | |||
|Email=sebastien@ailleret.com | |Email=sebastien@ailleret.com | ||
|Resource URL= | |Resource URL= | ||
Line 44: | Line 43: | ||
|Use=internet-application | |Use=internet-application | ||
}} | }} | ||
− | {{ | + | {{Featured}} |
− | |||
− | |||
− | |||
− | }} |
Revision as of 22:25, 19 January 2017
Larbin
http://larbin.sourceforge.net/index-eng.html
Web crawler
Larbin is an HTTP Web crawler that can fetch more than 5 million pages a day on a standard PC (pentium II 300, 128 Mo SDRAM and a 10 Mbit ethernet card, with a good network). Larbin uses standard libraries, plus adns. The program is multithreaded but prefers using select instead of a lot of threads (for efficiency purposes). The advantage of Larbin over wget or ht://dig is that it is much faster (because it opens a lot of connexions at a time) and very easy to customize). Common uses include: a crawler for a standard search engine, a crawler for a specialized search engine (xml, images, mp3...), and to provide statistics about servers or page contents).
Licensing
License
Verified by
Verified on
Notes
Leaders and contributors
Contact(s) | Role |
---|---|
Sebastien Ailleret | Maintainer |
Resources and communication
Audience | Resource type | URI |
---|---|---|
Bug Tracking,Developer,Support | mailto:sebastien@ailleret.com |
Software prerequisites
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.
The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.