|
b52a3044
|
2024-10-24T18:18:47
|
|
parser: Use counted_by attribute if supported
We only have a single struct with a flexible array member.
|
|
944e5fe8
|
2024-10-23T16:46:03
|
|
nanohttp: Fix another stdout file descriptor
|
|
607ada90
|
2024-10-23T14:19:01
|
|
nanohttp: Fix stdout file descriptor
Fixes #813.
|
|
b7c0f9d2
|
2024-10-19T14:26:39
|
|
string: Fix va_copy fallback
Fix va_copy fallback reworked in 5cffba83.
Should fix #812.
|
|
a870088f
|
2024-10-14T19:58:23
|
|
xpath: Hide internal sort functions
|
|
51394929
|
2024-10-15T11:11:38
|
|
python/tests: fix typos
Typos were found with codespell.
|
|
f9a6469a
|
2024-10-14T16:14:55
|
|
Update NEWS
|
|
c7b27866
|
2024-10-12T11:55:50
|
|
Avoid Python 'licence' distribution option is deprecated; use 'license' error
|
|
bf3619c3
|
2024-10-10T12:14:47
|
|
fuzz: Don't unlink DTD when replacing nodes
OP_XML_REPLACE_NODE needs the same check as OP_XML_UNLINK_NODE.
|
|
24a6149f
|
2024-09-15T19:18:40
|
|
html: Make sure that character data mode is reset
|
|
9f04cce6
|
2024-09-11T17:43:07
|
|
html: Remove unused or useless return codes
htmlParseStartTag should always succeed (except for malloc failures).
|
|
e179f3ec
|
2024-09-11T17:29:59
|
|
html: Stop reporting syntax errors
It doesn't make much sense to keep the old syntax error handling which
doesn't conform to HTML5.
Handling HTML5 parser errors is rather involved and not essential for
parsers.
|
|
c32397d5
|
2024-09-12T22:39:05
|
|
html: Improve character class macros
|
|
e8406554
|
2024-09-12T15:21:03
|
|
html: Rewrite parsing of most data
|
|
f77ec16d
|
2024-09-12T01:45:34
|
|
html: Optimize htmlParseCharData
|
|
a4c16a14
|
2024-09-27T23:49:02
|
|
xmllint: Improve --memory and --testIO options
Support --memory and --testIO in SAX mode.
Keep memory-mapped file across repetitions.
Options `--sax --memory --noout --repeat` can now be used to benchmark
the core parser without building a DOM tree or repeatedly reading files
from disk.
|
|
440bd64c
|
2024-09-12T04:01:38
|
|
html: Optimize htmlParseHTMLName
|
|
c34d0ae9
|
2024-09-12T23:50:20
|
|
html: Deprecate htmlIsBooleanAttr
|
|
3ac214f0
|
2024-09-27T22:54:14
|
|
xmllint: Support --html --sax
|
|
6040785a
|
2024-09-12T23:12:01
|
|
html: Deprecate AutoClose API
|
|
188cad68
|
2024-09-12T02:51:20
|
|
html: Remove obsolete content model
|
|
225ed707
|
2024-09-26T22:38:24
|
|
html: Accelerate htmlParseCharData
|
|
0144f662
|
2024-09-12T02:30:10
|
|
html: Remove obsolete code
|
|
0ce7bfe5
|
2024-09-12T01:44:18
|
|
html: Try to avoid passing XML options to HTML parser
|
|
74dfc49b
|
2024-09-26T21:24:00
|
|
parser: Clarify logic in xmlParseStartTag2
|
|
76cc6394
|
2024-09-12T01:43:42
|
|
test: Fix XML_PARSE_HTML constant
|
|
575be6c1
|
2024-09-12T01:40:07
|
|
html: Fix line numbers with CRs
|
|
20799979
|
2024-09-26T17:09:40
|
|
html: Handle numeric character references directly
|
|
be874d78
|
2024-09-11T19:47:07
|
|
html: Ignore unexpected DOCTYPE declarations
|
|
462bf0b7
|
2024-09-11T19:06:06
|
|
html: Rework options
Introduce htmlCtxtSetOptions, see similar changes made to XML parser.
Add HTML_PARSE_HUGE alias. Support HTML_PARSE_BIG_LINES.
|
|
0bc4608c
|
2024-09-15T20:28:49
|
|
html: Use hash table to check for duplicate attributes
|
|
16de1346
|
2024-09-11T19:05:38
|
|
parser: Make new options actually work
|
|
42c3823d
|
2024-09-11T19:05:09
|
|
html: Update comment
|
|
c6af1017
|
2024-09-08T20:45:48
|
|
html: Test tokenizer against html5lib test suite
|
|
27752f75
|
2024-09-11T15:06:55
|
|
html: Fix EOF handling in start tags
|
|
b19d3539
|
2024-09-11T15:03:49
|
|
html: Fix EOF handling in comments
|
|
17e56ac5
|
2024-09-11T14:24:58
|
|
html: Fix parsing of end tags
|
|
24a09033
|
2024-09-09T02:53:14
|
|
html: Fix bogus end tags
|
|
bca64854
|
2024-09-09T02:30:18
|
|
html: Allow U+000C FORM FEED as whitespace
|
|
6edf1a64
|
2024-09-09T02:09:20
|
|
html: Fix DOCTYPE parsing
|
|
9678163f
|
2024-09-09T02:01:19
|
|
html: Don't check for valid XML characters
|
|
a6955c13
|
2024-09-08T23:19:49
|
|
html: Parse numeric character references according to HTML5
|
|
4eeac309
|
2024-09-08T22:20:20
|
|
html: Start to fix EOF and U+0000 handling
|
|
e062a4a9
|
2024-09-08T20:40:36
|
|
html: Add HTML5 parser option
This option passes tokenizer output directly to the SAX callbacks,
making it possible to test the tokenizer against the html5lib test
suite.
This will produce unbalanced calls to the startElement and endElement
callbacks, but it's the only way to support a SAX like interface for
HTML5. It can be used for filtering or rewriting HTML5, for example.
A HTML5 tree builder could then be implemented on top of the SAX
callbacks.
|
|
17da54c5
|
2024-09-08T19:16:12
|
|
html: Normalize newlines
|
|
341dc78f
|
2024-09-08T19:11:14
|
|
html: Deduplicate code in htmlCurrentChar
|
|
3adb396d
|
2024-09-07T15:18:13
|
|
html: Parse bogus comments instead of ignoring them
Also treat XML processing instructions as bogus comments.
|
|
84440175
|
2024-09-07T14:21:12
|
|
html: Add missing calls to htmlCheckParagraph()
|
|
86d6b9b0
|
2024-09-07T04:18:06
|
|
html: Deduplicate some code
|
|
0d324bde
|
2024-09-07T03:45:09
|
|
html: Simplify node info accounting
|
|
ccb61f59
|
2024-09-07T03:15:50
|
|
html: Remove duplicate calls to htmlAutoClose
|
|
e1834745
|
2024-09-07T00:54:25
|
|
html: Add character data tests
|
|
f9ed30e9
|
2024-09-06T17:49:04
|
|
html: HTML5 character data states
|
|
59511792
|
2024-09-03T15:52:44
|
|
html: Parse named character references according to HTML5
|
|
d5cd0f07
|
2022-07-15T17:00:36
|
|
html: Prefer SKIP(1) over NEXT in HTML parser
Use SKIP(1) where it's safe to avoid a function call.
|
|
dc2d4983
|
2023-05-04T17:47:38
|
|
html: Rework htmlLookupSequence
Rename to htmlLookupString and use strstr for increased performance.
|
|
637215a4
|
2023-05-04T17:16:51
|
|
html: Always terminate doctype declarations on '>'
Align with HTML5 spec. This allows to remove the old quote handling in
htmlLookupSequence.
|
|
72e29f9a
|
2023-05-04T17:03:22
|
|
html: Fix quadratic behavior in push parser
Fix quadratic behavior related to unquoted attribute values. We really
have to replicate parts of the HTML5 state machine to find the end of
tags relibably.
Fixes #533.
|
|
a80f8b64
|
2023-05-04T15:59:31
|
|
html: Allow attributes in end tags
Attribute are syntactically allowed in HTML5 end tags but otherwise
ignored.
|
|
f2272c23
|
2023-05-04T15:33:27
|
|
html: Handle unexpected-solidus-in-tag according to HTML5
|
|
939b53ee
|
2023-05-04T15:25:24
|
|
html: Stop skipping tag content
Tag and attributes names should always be parsed succesfully now.
|
|
dcb2abb2
|
2023-05-04T15:16:29
|
|
html: Parse tag and attribute names according to HTML5
HTML5 allows bascially all characters in tag and attribute names.
|
|
d67833a3
|
2024-09-26T19:21:24
|
|
xmllint: Use proper type to store seconds since epoch
Should avoid year 2038 problem.
Fixes #801.
|
|
81d38ed0
|
2024-09-25T07:52:10
|
|
meson: Fix duplicate listing of libxml2.devhelp2
The duplication caused a warning when uninstalling.
|
|
b1c5aa65
|
2024-09-19T12:50:59
|
|
xpath: Deprecate xmlXPathNAN and xmlXPath*INF
Users should simply use the C99 macros.
|
|
55ddccb6
|
2024-09-14T00:03:56
|
|
io: Make sure not to pass partial UTF-8 to write callback
We cannot split UTF-8 at arbitrary boundaries.
|
|
c46b89e2
|
2024-09-13T21:06:36
|
|
xpath: Deprecate xmlXPathEvalExpr
Also check the argument instead of crashing if there's no context.
|
|
03f1bdd2
|
2024-09-13T20:59:47
|
|
xpath: Make recursion check work with xmlXPathCompile
The check for maximum recursion depth required a parser context with an
xmlXPathContext which xmlXPathCompile didn't provide.
All other code should already set up or require an xmlXPathContext.
|
|
dae160c6
|
2024-09-13T12:08:20
|
|
encoding: Fix table entry for "UTF16"
|
|
5e787401
|
2024-09-10T17:12:25
|
|
save: Make xmlEscapeTab signed
Fixes issues in platforms where char is unsigned.
Fixes #797.
|
|
6e503eb7
|
2024-09-10T03:32:37
|
|
encoding: Handle more ICU error codes
U_ILLEGAL_ESCAPE_SEQUENCE and U_UNSUPPORTED_ESCAPE_SEQUENCE can occur
with ISO-2022.
|
|
55d36c59
|
2024-09-10T03:11:18
|
|
encoding: Fix error code in xmlUconvConvert
Broke in 46ec621e.
|
|
de10d4cd
|
2024-09-04T16:19:42
|
|
include: Check whether _MSC_VER is defined
Should fix #795.
|
|
bd9eed46
|
2024-09-02T18:37:41
|
|
parser: Make unsupported encodings an error in declarations
This was changed in 45157261, but in encoding declarations, unsupported
encodings should raise a fatal error.
Fixes #794.
|
|
40abebbc
|
2024-08-29T01:02:33
|
|
python: Fix SAX driver with character streams
This apparently broke with Python 3.5 which introduced character
streams.
Fixes #790.
|
|
8ae06d52
|
2024-08-29T00:07:27
|
|
SAX2: Don't merge CDATA sections
The Document Object Model (DOM) Level 3 Core Specification says:
> Adjacent CDATASection nodes are not merged by use of the normalize
> method of the Node interface.
Fixes #412.
|
|
dde62ae5
|
2024-08-28T23:58:20
|
|
parser: Align push parsing of CDATA sections with pull parser
Remove special handling of CDATA sections in push parser. This makes
sure that only a single callback is generated for large sections.
Fixes #22 and needed for #412.
|
|
4d10e53a
|
2024-08-28T22:47:20
|
|
parser: Make sure to set and increment input id
Revert part of commits 410931e3 and b9d2f3c9.
|
|
6d365ca0
|
2024-08-28T22:09:30
|
|
doc: XML_PARSE_NO_XXE is available since 2.13.0
|
|
8ad618d2
|
2024-08-28T22:03:30
|
|
doc: Document all xmllint options
Remove --pushsmall.
Fixes #785.
|
|
67ff748c
|
2024-08-26T23:53:29
|
|
io: don't set the executable bit when creating files
Issue seems to have been introduced in
0bef93bf24def68c448af0e71844b942e0ed93ec.
|
|
0bb0012e
|
2024-08-19T21:15:28
|
|
catalog: Set xmlCatalogInitialized after setting default catalog
|
|
8625db28
|
2024-08-19T20:02:47
|
|
xmlcatalog: Improved fix for #699
|
|
4b007878
|
2024-08-19T20:44:07
|
|
Revert "catalog: Fetch XML catalog before dumping"
This reverts commit 723b4de04015c5acccd3cda5dd60db7d00702064.
|
|
57b92cab
|
2024-08-19T20:35:54
|
|
catalog: Fix regression in xmlCatalogAdd
Don't load system catalog.
|
|
2abb9033
|
2024-08-19T17:58:47
|
|
tests: Add fallback for glob()
Fix build on platforms without glob().
|
|
567f612d
|
2024-08-19T17:43:32
|
|
build: Check for declaration of glob() function
Don't rely on presence of glob.h.
|
|
2191ccdf
|
2024-08-18T12:16:34
|
|
autotools: Fix EXTRA_DIST
Fixes #786.
|
|
5c608609
|
2024-08-16T22:35:54
|
|
Eliminate false positive -Warray-bounds on some compilers
This affects GCC 7.3, MinGW 7.3, and LCC 1.25 and 1.26.
In some sense it reverts ad93f087.
|
|
a3043b47
|
2024-08-14T23:40:16
|
|
threads: define _WIN32_WINNT as 0x0600 to use InitOnceExecuteOnce()
|
|
f59da1a3
|
2024-08-14T23:22:59
|
|
hash: Suppress GCC 7.3 and MINGW maybe-uninitialized warning
|
|
103aadbc
|
2024-08-14T23:15:30
|
|
parser: Suppress EDG maybe-uninitialized warning
|
|
f2c48847
|
2024-08-13T14:38:07
|
|
io: Add missing calls to xmlInitParser
This is required after c9a46a91.
Should fix #782.
|
|
0c56eb82
|
2024-08-12T21:38:50
|
|
tree: Restore return value of xmlNodeListGetString with NULL list
When passing a NULL list to xmlNodeListGetString or
xmlNodeListGetRawString, return NULL instead of "" to match the old
behavior.
Fixes #783.
|
|
b45a0f0e
|
2024-08-06T22:34:29
|
|
nanohttp: Avoid ctype(3) misuse.
fix https://gitlab.gnome.org/GNOME/libxml2/-/issues/780
|
|
1d009fe3
|
2024-08-05T15:14:21
|
|
parser: Report at least one fatal error
|
|
bfed6e6a
|
2024-08-05T14:58:37
|
|
parser: Fix error handling after reaching limit
Mark document as non-wellformed and stop parser even if error limit was
reached.
Regressed in abd74186.
|
|
6e1e22dc
|
2024-08-04T11:45:05
|
|
malloc-fail: Fix null-deref in xmlPatPushState
|
|
a530ff12
|
2024-07-29T14:18:57
|
|
io: Always consume encoding handler when creating output buffers
Also free encoding handler in error case.
Remove xmlAllocOutputBufferInternal which was identical to
xmlAllocOutputBuffer.
|
|
36ea881b
|
2024-07-26T18:07:27
|
|
malloc-fail: Fix memory leak in xmlOutputBufferCreateFilename
Close encoding handler on error.
|