Extract text contained within an HTML document