summaryrefslogtreecommitdiff
path: root/HTMLparser.c
Commit message (Collapse)AuthorAgeFilesLines
* Fix a bunch of scan 'dead increments' and cleanupDaniel Veillard2009-09-051-1/+1
| | | | | | | | * HTMLparser.c c14n.c debugXML.c entities.c nanohttp.c parser.c testC14N.c uri.c xmlcatalog.c xmllint.c xmlregexp.c xpath.c: fix unused variables, or unneeded increments as well as a couple of space issues * runtest.c: check for NULL before calling unlink()
* 444994 HTML chunked failure for attribute with <>Daniel Veillard2009-08-251-73/+85
| | | | | | * HTMLparser.c: fix htmlParseLookupSequence to not save ctxt->checkIndex when the current buffer ends within an attribute value, as this information would be missed in next pass.
* Non ASCII character may be split at buffer endAdiel Mittmann2009-08-251-3/+9
| | | | | * HTMLparser.c: make sure when we call xmlParserInputGrow in htmlCurrentChar, to reset the current pointer
* 572129 speed up parasing of large HTML text nodesMarkus Kull2009-08-241-2/+77
| | | | | * HTMLparser.c: use a different lookup function htmlParseLookupChars() to avoid the quadratic behaviour
* Remove a pedantic warningDaniel Veillard2009-08-241-1/+2
|
* Fix HTML parsing with 0 character in CDATADaniel Veillard2009-08-241-0/+7
| | | | | * HTMLparser.c: 0 before the end of the input need some special case handling, raise the error and return a space instead
* 541335 HTML avoid creating 2 head or 2 body elementDaniel Veillard2009-08-241-7/+19
| | | | | | | * HTMLparser.c: check when we see an head or a body tag and avoid autogenerating them * include/libxml/parser.h: the values for ctxt->html change depending on the head or body tags being seen
* 541237 error correcting missing end tags in HTMLDaniel Veillard2009-08-241-1/+1
| | | | * HTMLparser.c: make sure /p closes the FONTSTYLE list of elements
* Fix a small problem on previous HTML parser patchDaniel Veillard2009-08-221-3/+4
|
* 592430 - HTML parser runs into endless loopDaniel Veillard2009-08-221-125/+139
| | | | | | * HTMLparser.c: fix the problem with detection erroring absolutely, and properly popping up the stack when in EOF, also passes XML_PARSE_HUGE when decoding options.
* 588441 allow '.' in HTML Names even if invalidDaniel Veillard2009-08-131-2/+3
| | | | * HTMLparser.c: just allow '.' in htmlParseHTMLName list of characters
* 579317 Try to find the HTML encoding informationDaniel Veillard2009-08-121-2/+74
| | | | | | * HTMLparser.c: if we hit an encoding error before parsing a potential <meta> with the info look in the input buffer to see if we can find it instead of forcing a blind switch to ISO-8859-1
* 576368 – htmlChunkParser with special attributesJiri Netolicky2009-08-071-17/+36
| | | | | * HTMLparser.c: htmlChunkParsing failed when the chunk ends inside element after some attribute which has a '>' char in its value.
* * HTMLparser.c: make sure we keep line numbers fixes #580705Daniel Veillard2009-05-151-0/+1
| | | | | based Aaron Patterson patch Daniel
* * HTMLparser.c: a broken HTML table attributes initialization,Roland Steiner2009-05-121-1/+1
| | | | | fixes #581803, by Roland Steiner <rolandsteiner@google.com> Daniel
* preparing the release of 2.7.2 fix the Solaris portability issueLIBXML2.7.2Daniel Veillard2008-10-031-3/+22
| | | | | | | | | | | | * configure.in doc/* NEWS: preparing the release of 2.7.2 * dict.c: fix the Solaris portability issue * parser.c: additional cleanup on #554660 fix * test/ent13 result/ent13* result/noent/ent13*: added the example in the regression test suite. * HTMLparser.c: handle leading BOM in htmlParseElement() Daniel svn path=/trunk/; revision=3799
* fix an HTML parsing error on large data sections reported by Mike Day addDaniel Veillard2008-09-251-0/+7
| | | | | | | | | | | * HTMLparser.c: fix an HTML parsing error on large data sections reported by Mike Day * test/HTML/utf8bug.html result/HTML/utf8bug.html.err result/HTML/utf8bug.html.sax result/HTML/utf8bug.html: add the reproducer to the test suite daniel svn path=/trunk/; revision=3797
* patch from Robert Schwebel , allows to compile the example if configuredDaniel Veillard2008-08-291-1/+5
| | | | | | | | | | | | * doc/examples/reader3.c: patch from Robert Schwebel , allows to compile the example if configured without output support fixes #545582 * Makefile.am: add testrecurse to the make check tests * HTMLparser.c: if the parser got a encoding argument it should be used over what the meta specifies, patch fixing #536346 Daniel svn path=/trunk/; revision=3785
* more progresses against the official regression tests small cleanup forDaniel Veillard2008-07-311-0/+1
| | | | | | | | | | | | | | | | | * runxmlconf.c: more progresses against the official regression tests * runsuite.c: small cleanup for non-leak reports * include/libxml/tree.h: parsing flags and other properties are now added to the document node, this is generally useful and allow to make Name and NmToken validations based on the parser flags, more specifically the 5th edition of XML or not * HTMLparser.c tree.c: small side effects for the previous changes * parser.c SAX2.c valid.c: the bulk of teh changes are here, the parser and validation behaviour can be affected, parsing flags need to be copied, lot of changes. Also fixing various validation problems in the regression tests. Daniel svn path=/trunk/; revision=3762
* applied patch from Ashwin fixing a number of realloc problems improveDaniel Veillard2008-04-241-1/+15
| | | | | | | | * uri.c: applied patch from Ashwin fixing a number of realloc problems * HTMLparser.c: improve handling for misplaced html/head/body Daniel svn path=/trunk/; revision=3740
* apparently it's okay to forget the semicolumn after entity refs in HTML,Daniel Veillard2008-04-031-4/+4
| | | | | | | | | * HTMLparser.c: apparently it's okay to forget the semicolumn after entity refs in HTML, fixing char refs parsing accordingly based on T. Manske patch, this should fix #517653 Daniel svn path=/trunk/; revision=3726
* patch from Arnold Hendriks improving parsing of html within html bogusDaniel Veillard2008-03-121-15/+16
| | | | | | | | * HTMLparser.c: patch from Arnold Hendriks improving parsing of html within html bogus data, still not a complete fix though Daniel svn path=/trunk/; revision=3704
* avoid stopping parsing when encountering out of range characters in anDaniel Veillard2008-01-111-2/+7
| | | | | | | | | * HTMLparser.c: avoid stopping parsing when encountering out of range characters in an HTML file, report and continue processing instead, should fix #472696 Daniel svn path=/trunk/; revision=3675
* fix definition for <embed> to avoid error when saving back, patch fromDaniel Veillard2008-01-111-1/+1
| | | | | | | | * HTMLparser.c: fix definition for <embed> to avoid error when saving back, patch from Stefan Behnel fixing 495213 Daniel svn path=/trunk/; revision=3671
* fixed bug #381877, avoid reading over the end of stream when generating anDaniel Veillard2007-06-121-3/+7
| | | | | | | | * HTMLparser.c: fixed bug #381877, avoid reading over the end of stream when generating an UTF-8 encoding error. Daniel svn path=/trunk/; revision=3627
* applied patch from Michael Day to add support for <embed> DanielDaniel Veillard2007-05-021-2/+6
| | | | | | | * HTMLparser.c: applied patch from Michael Day to add support for <embed> Daniel svn path=/trunk/; revision=3611
* Dohh !Daniel Veillard2007-04-271-1/+2
| | | | | | Daniel svn path=/trunk/; revision=3610
* Jean-Daniel Dupas pointed a couple of problems in htmlCreateDocParserCtxt.Daniel Veillard2007-04-261-2/+2
| | | | | | | | * HTMLparser.c: Jean-Daniel Dupas pointed a couple of problems in htmlCreateDocParserCtxt. Daniel svn path=/trunk/; revision=3609
* change the way script/style are parsed to not try to detect comments,Daniel Veillard2007-04-161-17/+1
| | | | | | | | | * HTMLparser.c: change the way script/style are parsed to not try to detect comments, reported by Mike Day * result/HTML/doc3.*: affects the result of that test Daniel svn path=/trunk/; revision=3598
* fixed memory access error on parsing of meta data which had errors (bugWilliam M. Brack2007-03-211-6/+6
| | | | | | | | * HTMLparser.c: fixed memory access error on parsing of meta data which had errors (bug #382206). Also cleaned up a few warnings by adding some additional DECL macros. svn path=/trunk/; revision=3593
* applied patch from Steven Rainwater to fix UTF8ToHtml behaviour on codeDaniel Veillard2006-11-231-6/+8
| | | | | | | * HTMLparser.c: applied patch from Steven Rainwater to fix UTF8ToHtml behaviour on code points which are not mappable to predefined HTML entities, fixes #377544 Daniel
* change htmlCtxtReset() following Michael Day bug report and suggestion.Daniel Veillard2006-11-081-1/+1
| | | | | | * HTMLparser.c: change htmlCtxtReset() following Michael Day bug report and suggestion. Daniel
* applied a reworked version of Usamah Malik patch to avoid growing theDaniel Veillard2006-10-271-0/+58
| | | | | | | * HTMLparser.c: applied a reworked version of Usamah Malik patch to avoid growing the parser stack in some autoclose cases, should fix #361221 Daniel
* fix one problem found in htmlCtxtUseOptions() and pointed in #340591Daniel Veillard2006-10-171-0/+1
| | | | | | * HTMLparser.c: fix one problem found in htmlCtxtUseOptions() and pointed in #340591 Daniel
* fixed teh 2 stupid bugs affecting htmlReadDoc() and htmlReadIO() thisDaniel Veillard2006-10-171-3/+4
| | | | | | * HTMLparser.c: fixed teh 2 stupid bugs affecting htmlReadDoc() and htmlReadIO() this should fix #340322 Daniel
* fixing HTML minimized attribute values to be generated internally if notDaniel Veillard2006-10-171-6/+5
| | | | | | | | | * HTMLparser.c: fixing HTML minimized attribute values to be generated internally if not present, fixes bug #332124 * result/HTML/doc2.htm.sax result/HTML/doc3.htm.sax result/HTML/wired.html.sax: this affects the SAX event strem for a few test cases Daniel
* fixing HTML entities in attributes parsing bug #362552 added to theDaniel Veillard2006-10-171-1/+1
| | | | | | | * HTMLparser.c: fixing HTML entities in attributes parsing bug #362552 * result/HTML/entities2.html* test/HTML/entities2.html: added to the regression suite Daniel
* fix #348252 if the document clains to be in a different encoding in theDaniel Veillard2006-10-161-1/+12
| | | | | | | * HTMLparser.c: fix #348252 if the document clains to be in a different encoding in the meta tag and it's obviously wrong, don't screw up the end of the content. Daniel
* fix a chunking and script bug #347708 DanielDaniel Veillard2006-10-161-4/+13
| | | | | * HTMLparser.c: fix a chunking and script bug #347708 Daniel
* remove a warning check with uppercase for AIX iconv() should fix #352644Daniel Veillard2006-10-161-1/+1
| | | | | | | * HTMLparser.c: remove a warning * encoding.c: check with uppercase for AIX iconv() should fix #352644 * doc/examples/Makefile.am: partially handle one bug report Daniel
* added --html --memory to test htmlReadMemory to test #321632 added variousDaniel Veillard2006-10-131-0/+7
| | | | | | | | | | | * xmllint.c: added --html --memory to test htmlReadMemory to test #321632 * HTMLparser.c: added various initialization calls which may help #321632 but not conclusive * testapi.c tree.c include/libxml/tree.h: fixed compilation with --with-minimum --with-sax1 and --with-minimum --with-schemas fixing #326442 Daniel
* exports htmlNewParserCtxt() as Michael Day pointed out this is needed toDaniel Veillard2006-09-211-2/+2
| | | | | | * HTMLparser.c include/libxml/HTMLparser.h: exports htmlNewParserCtxt() as Michael Day pointed out this is needed to use htmlCtxtRead*() Daniel
* applied const'ification of strings patch from Matthias Clasen DanielDaniel Veillard2006-07-031-90/+91
| | | | | | * HTMLparser.c: applied const'ification of strings patch from Matthias Clasen Daniel
* a bunch of small cleanups based on coverity reports. DanielDaniel Veillard2006-03-091-3/+6
| | | | | | * HTMLparser.c parser.c parserInternals.c pattern.c uri.c: a bunch of small cleanups based on coverity reports. Daniel
* try to fix xmlParseInNodeContext when operating on an HTML document.Daniel Veillard2006-01-181-2/+13
| | | | | | * HTMLparser.c libxml.h parser.c: try to fix xmlParseInNodeContext when operating on an HTML document. Daniel
* fixed a number of warnings shown by HP-UX compiler and reported by RickDaniel Veillard2005-12-101-4/+4
| | | | | | | * HTMLparser.c configure.in parserInternals.c runsuite.c runtest.c testapi.c xmlschemas.c xmlschemastypes.c xmlstring.c: fixed a number of warnings shown by HP-UX compiler and reported by Rick Jones Daniel
* script HTML parser error fix, corrects bug #319715 added test from MichaelDaniel Veillard2005-10-251-0/+1
| | | | | | | * HTMLparser.c: script HTML parser error fix, corrects bug #319715 * result/HTML/53867* test/HTML/53867.html: added test from Michael Day to the regression suite Daniel
* typo fix from Michael Day DanielDaniel Veillard2005-10-251-1/+1
| | | | | * HTMLparser.c: typo fix from Michael Day Daniel
* Applied the last patch from Gary Coady for #304637 changing the behaviourDaniel Veillard2005-09-011-3/+12
| | | | | | | * HTMLparser.c: Applied the last patch from Gary Coady for #304637 changing the behaviour when text nodes are found in body * result/HTML/*: this changes the output of some tests Daniel
* added a parser XML_PARSE_COMPACT option to allocate small text nodes (lessDaniel Veillard2005-08-251-0/+4
| | | | | | | | | | | * HTMLparser.c parser.c SAX2.c debugXML.c tree.c valid.c xmlreader.c xmllint.c include/libxml/HTMLparser.h include/libxml/parser.h: added a parser XML_PARSE_COMPACT option to allocate small text nodes (less than 8 bytes on 32bits, less than 16bytes on 64bits) directly within the node, various changes to cope with this. * result/XPath/tests/* result/XPath/xptr/* result/xmlid/*: this slightly change the output Daniel