diff options
author | Isaac Yu <isaacyu1@isaacyu1.com> | 2020-05-22 21:36:13 +0700 |
---|---|---|
committer | Willy Sudiarto Raharjo <willysr@slackbuilds.org> | 2020-05-22 23:36:13 +0700 |
commit | 8a62cd46201c9cdce19915fcacb6541e8e326ae9 (patch) | |
tree | 00f1de983c5b0e66f6607a078f1fcc896309c9cb /python/BeautifulSoup4/README | |
parent | a289f8b6c7d2a91bafc485fc50ed9cb6f1c899b0 (diff) | |
download | slackbuilds-8a62cd46201c9cdce19915fcacb6541e8e326ae9.tar.gz slackbuilds-8a62cd46201c9cdce19915fcacb6541e8e326ae9.tar.xz |
python/BeautifulSoup4: Updated for v 4.9.1, moved from libraries.
Signed-off-by: Willy Sudiarto Raharjo <willysr@slackbuilds.org>
Diffstat (limited to 'python/BeautifulSoup4/README')
-rw-r--r-- | python/BeautifulSoup4/README | 32 |
1 files changed, 32 insertions, 0 deletions
diff --git a/python/BeautifulSoup4/README b/python/BeautifulSoup4/README new file mode 100644 index 0000000000..e0e102b270 --- /dev/null +++ b/python/BeautifulSoup4/README @@ -0,0 +1,32 @@ +Beautiful Soup is a Python HTML/XML parser designed for quick +turnaround projects like screen-scraping. Three features make it +powerful: + +1. Beautiful Soup won't choke if you give it bad markup. It yields a +parse tree that makes approximately as much sense as your original +document. This is usually good enough to collect the data you need +and run away. + +2. Beautiful Soup provides a few simple methods and Pythonic idioms for +navigating, searching, and modifying a parse tree: a toolkit for +dissecting a document and extracting what you need. You don't have to +create a custom parser for each application. + +3. Beautiful Soup automatically converts incoming documents to Unicode +and outgoing documents to UTF-8. You don't have to think about +encodings, unless the document doesn't specify an encoding and +Beautiful Soup can't autodetect one. Then you just have to specify +the original encoding. + +Beautiful Soup parses anything you give it, and does the tree traversal +stuff for you. You can tell it "Find all the links", or "Find all the +links of class externalLink", or "Find all the links whose urls match +"foo.com", or "Find the table heading that's got bold text, then give +me that text." + +Valuable data that was once locked up in poorly-designed websites is +now within your reach. Projects that would have taken hours take only +minutes with Beautiful Soup. + +If python3-soupsieve is installed, then this will also build for +Python 3. |