Log

Author Commit Date CI Message
Nick Wellnhofer f77ec16d 2024-09-12T01:45:34 html: Optimize htmlParseCharData
Nick Wellnhofer 440bd64c 2024-09-12T04:01:38 html: Optimize htmlParseHTMLName
Nick Wellnhofer 6040785a 2024-09-12T23:12:01 html: Deprecate AutoClose API
Nick Wellnhofer 76cc6394 2024-09-12T01:43:42 test: Fix XML_PARSE_HTML constant
Nick Wellnhofer 575be6c1 2024-09-12T01:40:07 html: Fix line numbers with CRs
Nick Wellnhofer 188cad68 2024-09-12T02:51:20 html: Remove obsolete content model
Nick Wellnhofer be874d78 2024-09-11T19:47:07 html: Ignore unexpected DOCTYPE declarations
Nick Wellnhofer 462bf0b7 2024-09-11T19:06:06 html: Rework options Introduce htmlCtxtSetOptions, see similar changes made to XML parser. Add HTML_PARSE_HUGE alias. Support HTML_PARSE_BIG_LINES.
Nick Wellnhofer c34d0ae9 2024-09-12T23:50:20 html: Deprecate htmlIsBooleanAttr
Nick Wellnhofer 0144f662 2024-09-12T02:30:10 html: Remove obsolete code
Nick Wellnhofer 16de1346 2024-09-11T19:05:38 parser: Make new options actually work
Nick Wellnhofer 42c3823d 2024-09-11T19:05:09 html: Update comment
Nick Wellnhofer 0ce7bfe5 2024-09-12T01:44:18 html: Try to avoid passing XML options to HTML parser
Nick Wellnhofer 9f04cce6 2024-09-11T17:43:07 html: Remove unused or useless return codes htmlParseStartTag should always succeed (except for malloc failures).
Nick Wellnhofer e179f3ec 2024-09-11T17:29:59 html: Stop reporting syntax errors It doesn't make much sense to keep the old syntax error handling which doesn't conform to HTML5. Handling HTML5 parser errors is rather involved and not essential for parsers.
Nick Wellnhofer c6af1017 2024-09-08T20:45:48 html: Test tokenizer against html5lib test suite
Nick Wellnhofer 0d324bde 2024-09-07T03:45:09 html: Simplify node info accounting
Nick Wellnhofer ccb61f59 2024-09-07T03:15:50 html: Remove duplicate calls to htmlAutoClose
Nick Wellnhofer 27752f75 2024-09-11T15:06:55 html: Fix EOF handling in start tags
Nick Wellnhofer e1834745 2024-09-07T00:54:25 html: Add character data tests
Nick Wellnhofer f9ed30e9 2024-09-06T17:49:04 html: HTML5 character data states
Nick Wellnhofer b19d3539 2024-09-11T15:03:49 html: Fix EOF handling in comments
Nick Wellnhofer 59511792 2024-09-03T15:52:44 html: Parse named character references according to HTML5
Nick Wellnhofer d5cd0f07 2022-07-15T17:00:36 html: Prefer SKIP(1) over NEXT in HTML parser Use SKIP(1) where it's safe to avoid a function call.
Nick Wellnhofer 17e56ac5 2024-09-11T14:24:58 html: Fix parsing of end tags
Nick Wellnhofer dc2d4983 2023-05-04T17:47:38 html: Rework htmlLookupSequence Rename to htmlLookupString and use strstr for increased performance.
Nick Wellnhofer 637215a4 2023-05-04T17:16:51 html: Always terminate doctype declarations on '>' Align with HTML5 spec. This allows to remove the old quote handling in htmlLookupSequence.
Nick Wellnhofer 24a09033 2024-09-09T02:53:14 html: Fix bogus end tags
Nick Wellnhofer 72e29f9a 2023-05-04T17:03:22 html: Fix quadratic behavior in push parser Fix quadratic behavior related to unquoted attribute values. We really have to replicate parts of the HTML5 state machine to find the end of tags relibably. Fixes #533.
Nick Wellnhofer a80f8b64 2023-05-04T15:59:31 html: Allow attributes in end tags Attribute are syntactically allowed in HTML5 end tags but otherwise ignored.
Nick Wellnhofer bca64854 2024-09-09T02:30:18 html: Allow U+000C FORM FEED as whitespace
Nick Wellnhofer f2272c23 2023-05-04T15:33:27 html: Handle unexpected-solidus-in-tag according to HTML5
Nick Wellnhofer 939b53ee 2023-05-04T15:25:24 html: Stop skipping tag content Tag and attributes names should always be parsed succesfully now.
Nick Wellnhofer 6edf1a64 2024-09-09T02:09:20 html: Fix DOCTYPE parsing
Nick Wellnhofer dcb2abb2 2023-05-04T15:16:29 html: Parse tag and attribute names according to HTML5 HTML5 allows bascially all characters in tag and attribute names.
Nick Wellnhofer 9678163f 2024-09-09T02:01:19 html: Don't check for valid XML characters
Nick Wellnhofer a6955c13 2024-09-08T23:19:49 html: Parse numeric character references according to HTML5
Nick Wellnhofer 4eeac309 2024-09-08T22:20:20 html: Start to fix EOF and U+0000 handling
Nick Wellnhofer e062a4a9 2024-09-08T20:40:36 html: Add HTML5 parser option This option passes tokenizer output directly to the SAX callbacks, making it possible to test the tokenizer against the html5lib test suite. This will produce unbalanced calls to the startElement and endElement callbacks, but it's the only way to support a SAX like interface for HTML5. It can be used for filtering or rewriting HTML5, for example. A HTML5 tree builder could then be implemented on top of the SAX callbacks.
Nick Wellnhofer 17da54c5 2024-09-08T19:16:12 html: Normalize newlines
Nick Wellnhofer 341dc78f 2024-09-08T19:11:14 html: Deduplicate code in htmlCurrentChar
Nick Wellnhofer 3adb396d 2024-09-07T15:18:13 html: Parse bogus comments instead of ignoring them Also treat XML processing instructions as bogus comments.
Nick Wellnhofer 84440175 2024-09-07T14:21:12 html: Add missing calls to htmlCheckParagraph()
Nick Wellnhofer 86d6b9b0 2024-09-07T04:18:06 html: Deduplicate some code
Nick Wellnhofer d67833a3 2024-09-26T19:21:24 xmllint: Use proper type to store seconds since epoch Should avoid year 2038 problem. Fixes #801.
correctmost 81d38ed0 2024-09-25T07:52:10 meson: Fix duplicate listing of libxml2.devhelp2 The duplication caused a warning when uninstalling.
Nick Wellnhofer b1c5aa65 2024-09-19T12:50:59 xpath: Deprecate xmlXPathNAN and xmlXPath*INF Users should simply use the C99 macros.
Nick Wellnhofer 55ddccb6 2024-09-14T00:03:56 io: Make sure not to pass partial UTF-8 to write callback We cannot split UTF-8 at arbitrary boundaries.
Nick Wellnhofer c46b89e2 2024-09-13T21:06:36 xpath: Deprecate xmlXPathEvalExpr Also check the argument instead of crashing if there's no context.
Nick Wellnhofer 03f1bdd2 2024-09-13T20:59:47 xpath: Make recursion check work with xmlXPathCompile The check for maximum recursion depth required a parser context with an xmlXPathContext which xmlXPathCompile didn't provide. All other code should already set up or require an xmlXPathContext.
Nick Wellnhofer dae160c6 2024-09-13T12:08:20 encoding: Fix table entry for "UTF16"
Nick Wellnhofer 5e787401 2024-09-10T17:12:25 save: Make xmlEscapeTab signed Fixes issues in platforms where char is unsigned. Fixes #797.
Nick Wellnhofer 6e503eb7 2024-09-10T03:32:37 encoding: Handle more ICU error codes U_ILLEGAL_ESCAPE_SEQUENCE and U_UNSUPPORTED_ESCAPE_SEQUENCE can occur with ISO-2022.
Nick Wellnhofer 55d36c59 2024-09-10T03:11:18 encoding: Fix error code in xmlUconvConvert Broke in 46ec621e.
Nick Wellnhofer de10d4cd 2024-09-04T16:19:42 include: Check whether _MSC_VER is defined Should fix #795.
Nick Wellnhofer bd9eed46 2024-09-02T18:37:41 parser: Make unsupported encodings an error in declarations This was changed in 45157261, but in encoding declarations, unsupported encodings should raise a fatal error. Fixes #794.
Nick Wellnhofer 40abebbc 2024-08-29T01:02:33 python: Fix SAX driver with character streams This apparently broke with Python 3.5 which introduced character streams. Fixes #790.
Nick Wellnhofer 8ae06d52 2024-08-29T00:07:27 SAX2: Don't merge CDATA sections The Document Object Model (DOM) Level 3 Core Specification says: > Adjacent CDATASection nodes are not merged by use of the normalize > method of the Node interface. Fixes #412.
Nick Wellnhofer dde62ae5 2024-08-28T23:58:20 parser: Align push parsing of CDATA sections with pull parser Remove special handling of CDATA sections in push parser. This makes sure that only a single callback is generated for large sections. Fixes #22 and needed for #412.
Nick Wellnhofer 4d10e53a 2024-08-28T22:47:20 parser: Make sure to set and increment input id Revert part of commits 410931e3 and b9d2f3c9.
Nick Wellnhofer 6d365ca0 2024-08-28T22:09:30 doc: XML_PARSE_NO_XXE is available since 2.13.0
Nick Wellnhofer 8ad618d2 2024-08-28T22:03:30 doc: Document all xmllint options Remove --pushsmall. Fixes #785.
triallax 67ff748c 2024-08-26T23:53:29 io: don't set the executable bit when creating files Issue seems to have been introduced in 0bef93bf24def68c448af0e71844b942e0ed93ec.
Nick Wellnhofer 0bb0012e 2024-08-19T21:15:28 catalog: Set xmlCatalogInitialized after setting default catalog
Nick Wellnhofer 8625db28 2024-08-19T20:02:47 xmlcatalog: Improved fix for #699
Nick Wellnhofer 4b007878 2024-08-19T20:44:07 Revert "catalog: Fetch XML catalog before dumping" This reverts commit 723b4de04015c5acccd3cda5dd60db7d00702064.
Nick Wellnhofer 57b92cab 2024-08-19T20:35:54 catalog: Fix regression in xmlCatalogAdd Don't load system catalog.
Nick Wellnhofer 2abb9033 2024-08-19T17:58:47 tests: Add fallback for glob() Fix build on platforms without glob().
Nick Wellnhofer 567f612d 2024-08-19T17:43:32 build: Check for declaration of glob() function Don't rely on presence of glob.h.
Nick Wellnhofer 2191ccdf 2024-08-18T12:16:34 autotools: Fix EXTRA_DIST Fixes #786.
makise-homura 5c608609 2024-08-16T22:35:54 Eliminate false positive -Warray-bounds on some compilers This affects GCC 7.3, MinGW 7.3, and LCC 1.25 and 1.26. In some sense it reverts ad93f087.
makise-homura a3043b47 2024-08-14T23:40:16 threads: define _WIN32_WINNT as 0x0600 to use InitOnceExecuteOnce()
makise-homura f59da1a3 2024-08-14T23:22:59 hash: Suppress GCC 7.3 and MINGW maybe-uninitialized warning
makise-homura 103aadbc 2024-08-14T23:15:30 parser: Suppress EDG maybe-uninitialized warning
Nick Wellnhofer f2c48847 2024-08-13T14:38:07 io: Add missing calls to xmlInitParser This is required after c9a46a91. Should fix #782.
Nick Wellnhofer 0c56eb82 2024-08-12T21:38:50 tree: Restore return value of xmlNodeListGetString with NULL list When passing a NULL list to xmlNodeListGetString or xmlNodeListGetRawString, return NULL instead of "" to match the old behavior. Fixes #783.
Taylor R Campbell b45a0f0e 2024-08-06T22:34:29 nanohttp: Avoid ctype(3) misuse. fix https://gitlab.gnome.org/GNOME/libxml2/-/issues/780
Nick Wellnhofer 1d009fe3 2024-08-05T15:14:21 parser: Report at least one fatal error
Nick Wellnhofer bfed6e6a 2024-08-05T14:58:37 parser: Fix error handling after reaching limit Mark document as non-wellformed and stop parser even if error limit was reached. Regressed in abd74186.
Nick Wellnhofer 6e1e22dc 2024-08-04T11:45:05 malloc-fail: Fix null-deref in xmlPatPushState
Nick Wellnhofer a530ff12 2024-07-29T14:18:57 io: Always consume encoding handler when creating output buffers Also free encoding handler in error case. Remove xmlAllocOutputBufferInternal which was identical to xmlAllocOutputBuffer.
Nick Wellnhofer 36ea881b 2024-07-26T18:07:27 malloc-fail: Fix memory leak in xmlOutputBufferCreateFilename Close encoding handler on error.
Nick Wellnhofer 02fcb1ef 2024-07-25T17:07:18 parser: Make xmlParseChunk return an error if parser was stopped This regressed after enhancing the disableSAX member in 2.13. Should fix #777.
Nick Wellnhofer bc14d70f 2024-07-25T00:26:48 xmlsave: Improve "unsupported encoding" error message Incomplete support of XML_SAVE_* error codes was removed. Error handling still needs work. xmlOutputBufferCreateFilename should return an error code.
Nick Wellnhofer 1a893230 2024-07-06T01:03:46 [CVE-2024-40896] Fix XXE protection in downstream code Some users set an entity's children manually in the getEntity SAX callback to restrict entity expansion. This stopped working after renaming the "checked" member of xmlEntity, making at least one downstream project and its dependants susceptible to XXE attacks. See #761.
Nick Wellnhofer 6cc2387e 2024-07-24T11:42:39 shell: Only use readline on terminals Should fix xmllint shell tests.
Nick Wellnhofer d04e152d 2024-07-24T11:40:01 shell: Remove access(2) checks The checks didn't handle stdout. Since unistd.h wasn't included, W_OK wasn't defined and the checks wouldn't run anyway.
Nick Wellnhofer aa6ca0b1 2024-07-23T03:29:41 module: Deprecate module API This was only used by libxslt which switched to a private implementation.
Nick Wellnhofer ec4340b8 2024-07-23T01:32:43 Update NEWS
Nick Wellnhofer e1291059 2024-07-22T22:22:48 build: Don't check for pthread.h The Autotools build still checks for pthread.h (and zlib.h and lzma.h) but the result isn't needed in config.h.
Nick Wellnhofer cc03c069 2024-07-22T22:15:09 build: Don't check for standard POSIX headers These headers should be available since early POSIX days and are required no matter what.
Nick Wellnhofer d7dc2eaf 2024-07-22T21:39:54 build: Don't check for dlfcn.h and dl.h It's enough to check for symbols.
Nick Wellnhofer 7c10393f 2024-07-22T17:25:15 build: Fix config.h macros We have to emulate AC_CHECK_DECLS. Instead of leaving the macro undefined if a symbol wasn't found, it has to be set to 0.
Nick Wellnhofer 095b3c7f 2024-07-22T15:37:13 cmake: Implement READLINE and HISTORY options
Nick Wellnhofer 0172ffa9 2024-07-22T15:21:36 build: Only check for required headers
Nick Wellnhofer 3ef66611 2024-07-22T14:58:16 build: Rework mmap checks Switch to AC_CHECK_DECLS/check_symbol_exists. Don't check for sys/mman.h separately. Don't check for munmap.
Nick Wellnhofer e1657f3f 2024-07-22T14:33:14 build: Use AC_CHECK_DECLS/check_symbol_exists for getentropy This assumes that getentropy is declared in sys/random.h. Should fix issues on iOS. See #774.
Nick Wellnhofer 278fcf13 2024-07-22T13:02:07 buf: Limit xmlBuffer size to INT_MAX Some API functions like xmlBufUse only support ints.
Nick Wellnhofer 6a3c0b0d 2024-07-22T12:53:00 parser: Increase XML_MAX_DICTIONARY_LIMIT This limit is somewhat arbitrary and can be reached when fuzzing documents up to 1 MB. Increase limit to 100 MB and disable limit if XML_PARSE_HUGE is set.
Nick Wellnhofer d2755cdb 2024-07-22T12:35:29 buf: Fix memory leak if malloc fails before xmlBufBackToBuffer Short-lived regression.