summaryrefslogtreecommitdiff
path: root/src/lxml/parser.pxi
Commit message (Collapse)AuthorAgeFilesLines
* use per-document hash tables for XML IDs and allow disabling them completely ↵Stefan Behnel2014-05-281-19/+62
| | | | with collect_ids=False
* minor doc fixesStefan Behnel2014-05-251-3/+3
|
* only apply decoding error change to XML parsing (not HTML for now)Stefan Behnel2014-05-241-1/+1
|
* raise a parser error even in recovery mode when encountering undecodable ↵Stefan Behnel2014-05-241-3/+12
| | | | input to avoid having to deal with mixed-encoding trees
* minor code cleanupStefan Behnel2014-05-241-2/+3
|
* remove legacy code for now unsupported libxml2/libxslt versionsStefan Behnel2014-03-221-7/+1
| | | | | --HG-- extra : amend_source : 5f766bb41c74b8ea7bba7f71905fb18cb90a19f2
* use XML_PARSE_BIG_LINES parser option if available (libxml2 2.9.0+)Stefan Behnel2014-03-181-1/+2
|
* remove some legacy codeStefan Behnel2014-03-101-6/+1
|
* improve docstring description of "remove_blank_text" parser optionStefan Behnel2014-02-281-2/+2
|
* undo doc freeing change: crashes when doc has already been used elsewhereStefan Behnel2014-02-251-2/+0
|
* safety fix: free parsed document if it's left in the parser context for some ↵Stefan Behnel2014-02-251-0/+2
| | | | reason
* fix corner case where name of HTML root node was not put into parser dictStefan Behnel2014-01-311-2/+2
|
* fix up tag dict usage also for the feed parserStefan Behnel2014-01-311-2/+37
|
* implement iterparse() parsing of BOM prefixed filesStefan Behnel2014-01-291-0/+21
|
* fix several error/exception handling cases throughout the code baseStefan Behnel2014-01-171-15/+23
|
* _ParserContext.__dealloc__() doesn't need to disconnect its XMLSchema ↵Stefan Behnel2014-01-171-3/+0
| | | | validator since the validators __dealloc__() does it anyway
* fix GC crashesStefan Behnel2014-01-171-0/+1
|
* provide Py_UNICODE parsing fallback even in Py3.3+ (might be useful for ↵Stefan Behnel2014-01-161-23/+23
| | | | | | | Windows systems) --HG-- extra : amend_source : 2278c552b8be65e7eaf1075ec72f359cab4bd2ac
* make class lookups work in iterparse, pull parsers and target parserStefan Behnel2014-01-091-2/+2
|
* try fixing build with MSVCStefan Behnel2014-01-081-2/+2
|
* safety fixesStefan Behnel2014-01-021-3/+3
|
* optimise Unicode string parsing for PEP393 (Py3.3+)Stefan Behnel2014-01-021-8/+35
|
* remove redundant input normalisation code from parser entry functions and ↵Stefan Behnel2014-01-021-28/+15
| | | | fix Unicode HTML string input parsing also in the case of an unknown native encoding
* implement recover mode for pull parsers and iterparseStefan Behnel2013-12-141-0/+6
|
* minor code cleanupStefan Behnel2013-10-031-1/+1
|
* fix event collection with custom targets, write some tests for it and add an ↵Stefan Behnel2013-09-131-7/+8
| | | | example to the documentation
* fix DTD loading and base URL setting for iterparse() and pull parsersStefan Behnel2013-09-121-13/+27
|
* extend pull parser docstringsStefan Behnel2013-09-091-2/+24
|
* adapt the PullParser interfaces (mostly) to the one in ElementTree (Py3.4)Stefan Behnel2013-09-091-3/+20
|
* refactor iterparse() event collection into a general parser feature to ↵Stefan Behnel2013-09-081-32/+61
| | | | support arbitrary parser targets
* free GIL in resolver code when libxml2 potentially does I/OStefan Behnel2013-04-281-4/+14
|
* always restore parser context options after calling into libxml2's parserStefan Behnel2013-04-281-1/+6
|
* safely report IOErrors even in the face of unexpectedly encoded file namesStefan Behnel2013-04-271-1/+9
| | | | | --HG-- extra : rebase_source : e19f5f1b11ba54126f147b2a67110b2d9b66754d
* remove _BaseParser from lxml.etree module dict and clean up some codeStefan Behnel2013-03-231-3/+2
|
* remove some useless explicit usages of C-API calls from the code baseStefan Behnel2013-02-171-6/+6
|
* clean up string parsing code a bit and improve unicode XML declaration errorStefan Behnel2013-02-171-9/+10
|
* Fix undefined C symbol in Python runtimes compiled without threading supportStefan Behnel2013-01-241-1/+1
|
* fix another load of C compiler warnings about xmlChar* unsignedness etc.Stefan Behnel2012-08-111-1/+1
|
* deleted dead codeStefan Behnel2012-08-111-10/+0
|
* fix parsing when long Unicode strings are passed into the feed() method: ↵Stefan Behnel2012-08-091-10/+3
| | | | part of the string was dropped
* fixed libxml2 API usage by appropriately using 'const' and 'xmlChar*'Stefan Behnel2012-07-301-24/+23
|
* use absolute cimports to prevent build interference with already installed ↵Stefan Behnel2012-07-301-2/+2
| | | | lxml packages
* code cleanup: replace call to PyErr_NoMemory() by explicit exceptionStefan Behnel2012-04-071-7/+7
|
* fix some compiler warningsStefan Behnel2012-04-071-1/+1
|
* minor code cleanupStefan Behnel2012-04-051-2/+2
|
* instead of a global setup, use execution local error callbacks for XMLSchema ↵Stefan Behnel2012-04-051-10/+39
| | | | and parsing
* use 'cstring_h.*' instead of 'string' for string.h functionsTay Ray Chuan2012-01-101-2/+2
|
* reuse cython definition for INT_MAXTay Ray Chuan2012-01-081-8/+8
|
* reuse cython definitions for stdio.hTay Ray Chuan2012-01-081-3/+3
| | | | For existing cimports from libc.stdio, apply the stdio.* convention.
* reuse cython definitions for string.hTay Ray Chuan2012-01-081-2/+2
|