From caa2748cb1c774d18a1664c3434cbc7c862bb46f Mon Sep 17 00:00:00 2001 From: crayzeewulf Date: Thu, 21 Mar 2013 14:15:52 -0700 Subject: Corrected the sample output of clean_html() The output of clean_html() does not include html and body tags. The example output in the documentation was corrected. --- doc/lxmlhtml.txt | 29 ++++++++++++----------------- 1 file changed, 12 insertions(+), 17 deletions(-) diff --git a/doc/lxmlhtml.txt b/doc/lxmlhtml.txt index 776a4ae3..940e65bb 100644 --- a/doc/lxmlhtml.txt +++ b/doc/lxmlhtml.txt @@ -515,24 +515,19 @@ To remove the all suspicious content from this unparsed document, use the .. sourcecode:: pycon >>> from lxml.html.clean import clean_html - >>> print clean_html(html) - - -

- - a link - another link -

a paragraph

secret EVIL!

- of EVIL! - Password: - annoying EVIL! - spam spam SPAM! -

- - +

+ + a link + another link +

a paragraph

secret EVIL!

+ of EVIL! + + + Password: + annoying EVIL!spam spam SPAM! +

The ``Cleaner`` class supports several keyword arguments to control exactly which content is removed: -- cgit v1.2.1