Categories
Beautiful Soup
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three features make it powerful:
Beautiful Soup parses anything you give it, and does the tree traversal stuff for you. You can tell it "Find all the links", or "Find all the links of class externalLink", or "Find all the links whose urls match "foo.com", or "Find the table heading that's got bold text, then give me that text."
Valuable data that was once locked up in poorly-designed websites is now within your reach. Projects that would have taken hours take only minutes with Beautiful Soup.
Last updated 9 Jun, 2009
Versions
3.1.0.1
- Released: 6 Jan, 2009
- Code Maturity: Stable
- Source Archive: http://www.crummy.com/software/BeautifulSoup/do...
- Licenses: BSD_3Clause
- Interfaces: Command Line
User Community and Support
http://www.crummy.com/software/BeautifulSoup/documentation.html




