summaryrefslogtreecommitdiff
path: root/src/lxml
Commit message (Collapse)AuthorAgeFilesLines
* simplify constructor argumentStefan Behnel2015-02-071-3/+2
|
* fix crash when cleaning up XMLSchemas created from non-root elements: ↵Stefan Behnel2015-02-072-32/+49
| | | | pointer to fake-root document was actually stored in the schema, which lead to double-free errors
* fix LP#1369362: do not try to remove pseudo-attributes from processing ↵Stefan Behnel2014-12-141-2/+2
| | | | | | | instructions in HTML cleaner --HG-- extra : transplant_source : %FA%01c%AF%5E%13%95%25%1E%01%19%ADA%C6%2A%5D%12%C4n%5E
* clarify docs: lxml.html.fragment_fromstring(create_parent=True) allows ↵Stefan Behnel2014-12-061-4/+6
| | | | multiple elements as parse result
* clarify docs: ElementTree.iter() does not visit siblings of the root elementStefan Behnel2014-12-061-1/+3
|
* include CDATA sections when serialising tail text of top-level nodeStefan Behnel2014-11-232-3/+2
|
* accept CDATA wrapped content for tail textStefan Behnel2014-11-232-19/+40
|
* do not depend on formatting of "file:" URL in test (differs across libxml2 ↵Stefan Behnel2014-10-191-1/+4
| | | | versions)
* fix testStefan Behnel2014-10-091-2/+2
|
* clean up some test codeStefan Behnel2014-10-092-3/+5
|
* init superclass in ElementTreeContentHandlerStefan Behnel2014-10-091-0/+1
|
* clean up duplicated test methodsStefan Behnel2014-10-092-41/+29
| | | | | --HG-- extra : transplant_source : 7%08%A1%E2%24u%BE%09%3D%CD%89P%D5h%D9%E8%E4%21%F9W
* do not reject namespaced tags in htmlfile, clean up htmlfile tests a littleStefan Behnel2014-09-252-20/+36
|
* minor docstring cleanupStefan Behnel2014-09-161-3/+2
|
* dried htmlfileBurak Arslan2014-09-151-31/+8
|
* add testcases for etree.htmlfileBurak Arslan2014-09-151-0/+24
|
* modify write_declaration to fail for every document type except xmlBurak Arslan2014-09-151-2/+2
|
* don't write namespaces in html modeBurak Arslan2014-09-151-3/+10
|
* write_declaration now throws an exception for html files.Burak Arslan2014-09-151-0/+3
|
* implement a etree.htmlfile that passes OUTPUT_METHOD_HTML to ↵Burak Arslan2014-09-151-0/+37
| | | | _IncrementalFileWriter
* add a method argument to _IncrementalFileWriterBurak Arslan2014-09-151-3/+6
|
* fix tests in Py3Stefan Behnel2014-09-051-15/+13
|
* allow disabling output buffering in xmlfile() and explicit flushingStefan Behnel2014-09-052-9/+93
|
* lxml.html.document_fromstring ensure_head_bodyjab2014-09-042-1/+22
| | | | | | | | | | | | | | | | | | When using lxml.html.document_fromstring to process html outside your control, you can't be sure it will have a head element or body element. Allowing document_fromstring to accept an ensure_head_body option saves you from having to write code like: doc = document_fromstring(html) try: doc.head except IndexError: doc.insert(0, Element('head')) # now we can safely reference doc.head You can instead just write: doc = document_fromstring(html, ensure_head_body=True)
* merge lxml-3.3 branch into masterStefan Behnel2014-08-283-29/+113
|\
| * refactor some duplicated codeStefan Behnel2014-08-281-18/+16
| |
| * prevent tree cycle creation when adding siblingsStefan Behnel2014-08-282-14/+56
| |
| * fix crash when deallocating sibling Element proxies that do not have a parentStefan Behnel2014-08-282-3/+47
| |
* | refactor tag processing code in iterlinks()Stefan Behnel2014-08-231-22/+22
| |
* | include links in meta refresh tags in iterlinksjab2014-08-222-0/+21
| |
* | merge lxml-3.3 branch into masterStefan Behnel2014-08-092-3/+63
|\ \ | |/
| * fix LP#1354652: crash when traversing internally loaded documents in XSLT ↵Stefan Behnel2014-08-092-3/+63
| | | | | | | | extension functions
* | make legacy import absolute (otherwise fails in Py3)Stefan Behnel2014-08-051-1/+1
| |
* | Add CDATA support in ElementBuilder.Ionel Cristian Mărieș2014-06-162-1/+17
| |
* | use per-document hash tables for XML IDs and allow disabling them completely ↵Stefan Behnel2014-05-286-31/+162
| | | | | | | | with collect_ids=False
* | minor doc fixesStefan Behnel2014-05-251-3/+3
| |
* | only apply decoding error change to XML parsing (not HTML for now)Stefan Behnel2014-05-241-1/+1
| |
* | raise a parser error even in recovery mode when encountering undecodable ↵Stefan Behnel2014-05-242-3/+26
| | | | | | | | input to avoid having to deal with mixed-encoding trees
* | minor code cleanupStefan Behnel2014-05-241-2/+3
| |
* | remove unused importStefan Behnel2014-05-241-2/+4
| |
* | simplify CSS link parsing code in lxml.htmlStefan Behnel2014-04-251-6/+5
| |
* | simplify regex usage in lxml.htmlStefan Behnel2014-04-251-5/+5
| |
* | merge lxml-3.3 branch into masterStefan Behnel2014-04-172-5/+13
|\ \ | |/
| * strip control characters before looking for evil text content in CleanerStefan Behnel2014-04-172-5/+13
| |
| * add test for high XML line numbersStefan Behnel2014-03-181-0/+16
| | | | | | | | | | --HG-- extra : transplant_source : %0EX%40u%B3%C1m%13%B3%82%11L%2C%98X%E7%D1%E1%13%1A
| * use XML_PARSE_BIG_LINES parser option if available (libxml2 2.9.0+)Stefan Behnel2014-03-182-1/+6
| | | | | | | | | | --HG-- extra : transplant_source : %96%B9%CB%8E%81h%96%1D%F6%F0E%CF%26%AD%17%9FgP%83%C7
| * add parse options added in recent libxml2 versionsStefan Behnel2014-03-181-0/+6
| | | | | | | | | | --HG-- extra : transplant_source : %28%B1%7E%87%D1%E4%98%85%94xao%06%929%A6%C0%5E%1B%A1
| * fix HTML wrapping for bytes strings in Py3Stefan Behnel2014-03-161-1/+3
| | | | | | | | | | --HG-- extra : transplant_source : %FA%A2p%CD%F9b%DD%AC%01%5E%CAt1%14%9C%97%C12b%5E
| * PyPy fixStefan Behnel2014-03-101-1/+2
| |
* | add a method tree.getelementpath(element) that generates a structural ↵Stefan Behnel2014-04-062-1/+135
| | | | | | | | ElementPath expression for an Element