Categories
Larbin
Larbin is an HTTP Web crawler that can fetch more than 5 million pages a day on a standard PC (pentium II 300, 128 Mo SDRAM and a 10 Mbit ethernet card, with a good network). Larbin uses standard libraries, plus adns. The program is multithreaded but prefers using select instead of a lot of threads (for efficiency purposes). The advantage of Larbin over wget or ht://dig is that it is much faster (because it opens a lot of connexions at a time) and very easy to customize).
Common uses include: a crawler for a standard search engine, a crawler for a specialized search engine (xml, images, mp3...), and to provide statistics about servers or page contents).
Last updated 24 Jun, 2008
Versions
2.6.3
2.6.3 stable released 2002-07-16
- Released: 16 Jul, 2002
- Code Maturity: Stable
- Source Archive: http://prdownloads.sourceforge.net/larbin/larbi...
- Licenses: GPLv2
- Interfaces: Web




