aboutsummaryrefslogtreecommitdiff
path: root/textproc/py-html-text/pkg-descr
blob: 3ded2dd0baf6891aee62dc74bf9affd877c8ad32 (plain) (blame)
1
2
3
4
5
6
7
Extract text from HTML.

html_text is a library for extracting text from HTML, with a few handy
features:
- It removes leading and trailing whitespace
- It handles HTML entities
- It uses lxml for parsing