Log

Author Commit Date CI Message
Nick Wellnhofer b52a3044 2024-10-24T18:18:47 parser: Use counted_by attribute if supported We only have a single struct with a flexible array member.
Nick Wellnhofer 944e5fe8 2024-10-23T16:46:03 nanohttp: Fix another stdout file descriptor
Nick Wellnhofer 607ada90 2024-10-23T14:19:01 nanohttp: Fix stdout file descriptor Fixes #813.
Nick Wellnhofer b7c0f9d2 2024-10-19T14:26:39 string: Fix va_copy fallback Fix va_copy fallback reworked in 5cffba83. Should fix #812.
Nick Wellnhofer a870088f 2024-10-14T19:58:23 xpath: Hide internal sort functions
Yegor Yefremov 51394929 2024-10-15T11:11:38 python/tests: fix typos Typos were found with codespell.
Nick Wellnhofer f9a6469a 2024-10-14T16:14:55 Update NEWS
Satadru Pramanik c7b27866 2024-10-12T11:55:50 Avoid Python 'licence' distribution option is deprecated; use 'license' error
Nick Wellnhofer bf3619c3 2024-10-10T12:14:47 fuzz: Don't unlink DTD when replacing nodes OP_XML_REPLACE_NODE needs the same check as OP_XML_UNLINK_NODE.
Nick Wellnhofer 24a6149f 2024-09-15T19:18:40 html: Make sure that character data mode is reset
Nick Wellnhofer 9f04cce6 2024-09-11T17:43:07 html: Remove unused or useless return codes htmlParseStartTag should always succeed (except for malloc failures).
Nick Wellnhofer e179f3ec 2024-09-11T17:29:59 html: Stop reporting syntax errors It doesn't make much sense to keep the old syntax error handling which doesn't conform to HTML5. Handling HTML5 parser errors is rather involved and not essential for parsers.
Nick Wellnhofer c32397d5 2024-09-12T22:39:05 html: Improve character class macros
Nick Wellnhofer e8406554 2024-09-12T15:21:03 html: Rewrite parsing of most data
Nick Wellnhofer f77ec16d 2024-09-12T01:45:34 html: Optimize htmlParseCharData
Nick Wellnhofer a4c16a14 2024-09-27T23:49:02 xmllint: Improve --memory and --testIO options Support --memory and --testIO in SAX mode. Keep memory-mapped file across repetitions. Options `--sax --memory --noout --repeat` can now be used to benchmark the core parser without building a DOM tree or repeatedly reading files from disk.
Nick Wellnhofer 440bd64c 2024-09-12T04:01:38 html: Optimize htmlParseHTMLName
Nick Wellnhofer c34d0ae9 2024-09-12T23:50:20 html: Deprecate htmlIsBooleanAttr
Nick Wellnhofer 3ac214f0 2024-09-27T22:54:14 xmllint: Support --html --sax
Nick Wellnhofer 6040785a 2024-09-12T23:12:01 html: Deprecate AutoClose API
Nick Wellnhofer 188cad68 2024-09-12T02:51:20 html: Remove obsolete content model
Nick Wellnhofer 225ed707 2024-09-26T22:38:24 html: Accelerate htmlParseCharData
Nick Wellnhofer 0144f662 2024-09-12T02:30:10 html: Remove obsolete code
Nick Wellnhofer 0ce7bfe5 2024-09-12T01:44:18 html: Try to avoid passing XML options to HTML parser
Nick Wellnhofer 74dfc49b 2024-09-26T21:24:00 parser: Clarify logic in xmlParseStartTag2
Nick Wellnhofer 76cc6394 2024-09-12T01:43:42 test: Fix XML_PARSE_HTML constant
Nick Wellnhofer 575be6c1 2024-09-12T01:40:07 html: Fix line numbers with CRs
Nick Wellnhofer 20799979 2024-09-26T17:09:40 html: Handle numeric character references directly
Nick Wellnhofer be874d78 2024-09-11T19:47:07 html: Ignore unexpected DOCTYPE declarations
Nick Wellnhofer 462bf0b7 2024-09-11T19:06:06 html: Rework options Introduce htmlCtxtSetOptions, see similar changes made to XML parser. Add HTML_PARSE_HUGE alias. Support HTML_PARSE_BIG_LINES.
Nick Wellnhofer 0bc4608c 2024-09-15T20:28:49 html: Use hash table to check for duplicate attributes
Nick Wellnhofer 16de1346 2024-09-11T19:05:38 parser: Make new options actually work
Nick Wellnhofer 42c3823d 2024-09-11T19:05:09 html: Update comment
Nick Wellnhofer c6af1017 2024-09-08T20:45:48 html: Test tokenizer against html5lib test suite
Nick Wellnhofer 27752f75 2024-09-11T15:06:55 html: Fix EOF handling in start tags
Nick Wellnhofer b19d3539 2024-09-11T15:03:49 html: Fix EOF handling in comments
Nick Wellnhofer 17e56ac5 2024-09-11T14:24:58 html: Fix parsing of end tags
Nick Wellnhofer 24a09033 2024-09-09T02:53:14 html: Fix bogus end tags
Nick Wellnhofer bca64854 2024-09-09T02:30:18 html: Allow U+000C FORM FEED as whitespace
Nick Wellnhofer 6edf1a64 2024-09-09T02:09:20 html: Fix DOCTYPE parsing
Nick Wellnhofer 9678163f 2024-09-09T02:01:19 html: Don't check for valid XML characters
Nick Wellnhofer a6955c13 2024-09-08T23:19:49 html: Parse numeric character references according to HTML5
Nick Wellnhofer 4eeac309 2024-09-08T22:20:20 html: Start to fix EOF and U+0000 handling
Nick Wellnhofer e062a4a9 2024-09-08T20:40:36 html: Add HTML5 parser option This option passes tokenizer output directly to the SAX callbacks, making it possible to test the tokenizer against the html5lib test suite. This will produce unbalanced calls to the startElement and endElement callbacks, but it's the only way to support a SAX like interface for HTML5. It can be used for filtering or rewriting HTML5, for example. A HTML5 tree builder could then be implemented on top of the SAX callbacks.
Nick Wellnhofer 17da54c5 2024-09-08T19:16:12 html: Normalize newlines
Nick Wellnhofer 341dc78f 2024-09-08T19:11:14 html: Deduplicate code in htmlCurrentChar
Nick Wellnhofer 3adb396d 2024-09-07T15:18:13 html: Parse bogus comments instead of ignoring them Also treat XML processing instructions as bogus comments.
Nick Wellnhofer 84440175 2024-09-07T14:21:12 html: Add missing calls to htmlCheckParagraph()
Nick Wellnhofer 86d6b9b0 2024-09-07T04:18:06 html: Deduplicate some code
Nick Wellnhofer 0d324bde 2024-09-07T03:45:09 html: Simplify node info accounting
Nick Wellnhofer ccb61f59 2024-09-07T03:15:50 html: Remove duplicate calls to htmlAutoClose
Nick Wellnhofer e1834745 2024-09-07T00:54:25 html: Add character data tests
Nick Wellnhofer f9ed30e9 2024-09-06T17:49:04 html: HTML5 character data states
Nick Wellnhofer 59511792 2024-09-03T15:52:44 html: Parse named character references according to HTML5
Nick Wellnhofer d5cd0f07 2022-07-15T17:00:36 html: Prefer SKIP(1) over NEXT in HTML parser Use SKIP(1) where it's safe to avoid a function call.
Nick Wellnhofer dc2d4983 2023-05-04T17:47:38 html: Rework htmlLookupSequence Rename to htmlLookupString and use strstr for increased performance.
Nick Wellnhofer 637215a4 2023-05-04T17:16:51 html: Always terminate doctype declarations on '>' Align with HTML5 spec. This allows to remove the old quote handling in htmlLookupSequence.
Nick Wellnhofer 72e29f9a 2023-05-04T17:03:22 html: Fix quadratic behavior in push parser Fix quadratic behavior related to unquoted attribute values. We really have to replicate parts of the HTML5 state machine to find the end of tags relibably. Fixes #533.
Nick Wellnhofer a80f8b64 2023-05-04T15:59:31 html: Allow attributes in end tags Attribute are syntactically allowed in HTML5 end tags but otherwise ignored.
Nick Wellnhofer f2272c23 2023-05-04T15:33:27 html: Handle unexpected-solidus-in-tag according to HTML5
Nick Wellnhofer 939b53ee 2023-05-04T15:25:24 html: Stop skipping tag content Tag and attributes names should always be parsed succesfully now.
Nick Wellnhofer dcb2abb2 2023-05-04T15:16:29 html: Parse tag and attribute names according to HTML5 HTML5 allows bascially all characters in tag and attribute names.
Nick Wellnhofer d67833a3 2024-09-26T19:21:24 xmllint: Use proper type to store seconds since epoch Should avoid year 2038 problem. Fixes #801.
correctmost 81d38ed0 2024-09-25T07:52:10 meson: Fix duplicate listing of libxml2.devhelp2 The duplication caused a warning when uninstalling.
Nick Wellnhofer b1c5aa65 2024-09-19T12:50:59 xpath: Deprecate xmlXPathNAN and xmlXPath*INF Users should simply use the C99 macros.
Nick Wellnhofer 55ddccb6 2024-09-14T00:03:56 io: Make sure not to pass partial UTF-8 to write callback We cannot split UTF-8 at arbitrary boundaries.
Nick Wellnhofer c46b89e2 2024-09-13T21:06:36 xpath: Deprecate xmlXPathEvalExpr Also check the argument instead of crashing if there's no context.
Nick Wellnhofer 03f1bdd2 2024-09-13T20:59:47 xpath: Make recursion check work with xmlXPathCompile The check for maximum recursion depth required a parser context with an xmlXPathContext which xmlXPathCompile didn't provide. All other code should already set up or require an xmlXPathContext.
Nick Wellnhofer dae160c6 2024-09-13T12:08:20 encoding: Fix table entry for "UTF16"
Nick Wellnhofer 5e787401 2024-09-10T17:12:25 save: Make xmlEscapeTab signed Fixes issues in platforms where char is unsigned. Fixes #797.
Nick Wellnhofer 6e503eb7 2024-09-10T03:32:37 encoding: Handle more ICU error codes U_ILLEGAL_ESCAPE_SEQUENCE and U_UNSUPPORTED_ESCAPE_SEQUENCE can occur with ISO-2022.
Nick Wellnhofer 55d36c59 2024-09-10T03:11:18 encoding: Fix error code in xmlUconvConvert Broke in 46ec621e.
Nick Wellnhofer de10d4cd 2024-09-04T16:19:42 include: Check whether _MSC_VER is defined Should fix #795.
Nick Wellnhofer bd9eed46 2024-09-02T18:37:41 parser: Make unsupported encodings an error in declarations This was changed in 45157261, but in encoding declarations, unsupported encodings should raise a fatal error. Fixes #794.
Nick Wellnhofer 40abebbc 2024-08-29T01:02:33 python: Fix SAX driver with character streams This apparently broke with Python 3.5 which introduced character streams. Fixes #790.
Nick Wellnhofer 8ae06d52 2024-08-29T00:07:27 SAX2: Don't merge CDATA sections The Document Object Model (DOM) Level 3 Core Specification says: > Adjacent CDATASection nodes are not merged by use of the normalize > method of the Node interface. Fixes #412.
Nick Wellnhofer dde62ae5 2024-08-28T23:58:20 parser: Align push parsing of CDATA sections with pull parser Remove special handling of CDATA sections in push parser. This makes sure that only a single callback is generated for large sections. Fixes #22 and needed for #412.
Nick Wellnhofer 4d10e53a 2024-08-28T22:47:20 parser: Make sure to set and increment input id Revert part of commits 410931e3 and b9d2f3c9.
Nick Wellnhofer 6d365ca0 2024-08-28T22:09:30 doc: XML_PARSE_NO_XXE is available since 2.13.0
Nick Wellnhofer 8ad618d2 2024-08-28T22:03:30 doc: Document all xmllint options Remove --pushsmall. Fixes #785.
triallax 67ff748c 2024-08-26T23:53:29 io: don't set the executable bit when creating files Issue seems to have been introduced in 0bef93bf24def68c448af0e71844b942e0ed93ec.
Nick Wellnhofer 0bb0012e 2024-08-19T21:15:28 catalog: Set xmlCatalogInitialized after setting default catalog
Nick Wellnhofer 8625db28 2024-08-19T20:02:47 xmlcatalog: Improved fix for #699
Nick Wellnhofer 4b007878 2024-08-19T20:44:07 Revert "catalog: Fetch XML catalog before dumping" This reverts commit 723b4de04015c5acccd3cda5dd60db7d00702064.
Nick Wellnhofer 57b92cab 2024-08-19T20:35:54 catalog: Fix regression in xmlCatalogAdd Don't load system catalog.
Nick Wellnhofer 2abb9033 2024-08-19T17:58:47 tests: Add fallback for glob() Fix build on platforms without glob().
Nick Wellnhofer 567f612d 2024-08-19T17:43:32 build: Check for declaration of glob() function Don't rely on presence of glob.h.
Nick Wellnhofer 2191ccdf 2024-08-18T12:16:34 autotools: Fix EXTRA_DIST Fixes #786.
makise-homura 5c608609 2024-08-16T22:35:54 Eliminate false positive -Warray-bounds on some compilers This affects GCC 7.3, MinGW 7.3, and LCC 1.25 and 1.26. In some sense it reverts ad93f087.
makise-homura a3043b47 2024-08-14T23:40:16 threads: define _WIN32_WINNT as 0x0600 to use InitOnceExecuteOnce()
makise-homura f59da1a3 2024-08-14T23:22:59 hash: Suppress GCC 7.3 and MINGW maybe-uninitialized warning
makise-homura 103aadbc 2024-08-14T23:15:30 parser: Suppress EDG maybe-uninitialized warning
Nick Wellnhofer f2c48847 2024-08-13T14:38:07 io: Add missing calls to xmlInitParser This is required after c9a46a91. Should fix #782.
Nick Wellnhofer 0c56eb82 2024-08-12T21:38:50 tree: Restore return value of xmlNodeListGetString with NULL list When passing a NULL list to xmlNodeListGetString or xmlNodeListGetRawString, return NULL instead of "" to match the old behavior. Fixes #783.
Taylor R Campbell b45a0f0e 2024-08-06T22:34:29 nanohttp: Avoid ctype(3) misuse. fix https://gitlab.gnome.org/GNOME/libxml2/-/issues/780
Nick Wellnhofer 1d009fe3 2024-08-05T15:14:21 parser: Report at least one fatal error
Nick Wellnhofer bfed6e6a 2024-08-05T14:58:37 parser: Fix error handling after reaching limit Mark document as non-wellformed and stop parser even if error limit was reached. Regressed in abd74186.
Nick Wellnhofer 6e1e22dc 2024-08-04T11:45:05 malloc-fail: Fix null-deref in xmlPatPushState
Nick Wellnhofer a530ff12 2024-07-29T14:18:57 io: Always consume encoding handler when creating output buffers Also free encoding handler in error case. Remove xmlAllocOutputBufferInternal which was identical to xmlAllocOutputBuffer.
Nick Wellnhofer 36ea881b 2024-07-26T18:07:27 malloc-fail: Fix memory leak in xmlOutputBufferCreateFilename Close encoding handler on error.