aboutsummaryrefslogblamecommitdiff
path: root/www/py-w3lib/pkg-descr
blob: 7deeb4b4eb6574ba0ff49ef5b10ff0dc93b2d721 (plain) (tree)
1
2
3
4
5
6
7
8
9
10
11
12
13
14













                                                           
This is a Python library of web-related functions, such as:

  - remove comments, or tags from HTML snippets
  - extract base url from HTML snippets
  - translate entites on HTML strings
  - encoding mulitpart/form-data
  - convert raw HTTP headers to dicts and vice-versa
  - construct HTTP auth header
  - converting HTML pages to unicode
  - RFC-compliant url joining
  - sanitize urls (like browsers do)
  - extract arguments from urls

WWW: http://github.com/scrapy/w3lib