blob: 3ded2dd0baf6891aee62dc74bf9affd877c8ad32 (
plain) (
blame)
1
2
3
4
5
6
7
|
Extract text from HTML.
html_text is a library for extracting text from HTML, with a few handy
features:
- It removes leading and trailing whitespace
- It handles HTML entities
- It uses lxml for parsing
|