|
764b8086
|
2024-11-13T20:22:32
|
|
tests: Fix sanitizer version check on old Apple clang
See #669.
|
|
b57e022d
|
2024-11-13T19:08:47
|
|
build: Check for icu-uc instead of icu-i18n
This should be the ICU component we actually need.
|
|
aaecdc92
|
2024-11-12T16:42:36
|
|
parser: Assign value without if-statement
This avoids an if-statement, because effectively it does nothing. And,
for example, binary artifact generated by GCC with -O2 optimization
settings does not contain that if-statement -- the code just uses the
hprefix->name field explicitly.
No functional changes intended.
Signed-off-by: Ruslan Garipov <ruslanngaripov@gmail.com>
|
|
1e4d8c55
|
2024-11-06T16:42:05
|
|
xmlIO: Fix reading from non-regular files like pipes
Commit 7e14c05d removed unnecessary copying of uncompressed input
through zlib or xzlib. This broke input from non-regular files like
pipes which can't be reopened. Try to detect such files by checking
whether they're seekable and always pipe them through zlib or xzlib.
Also remove seemingly unnecessary calls to gzread and gzrewind to
support unseekable files.
Fixes https://gitlab.gnome.org/GNOME/libxslt/-/issues/124.
|
|
45914614
|
2024-11-05T12:05:14
|
|
xpath: Fix parsing of non-ASCII names
Fix a long-standing issue where QNames starting with a non-ASCII
character would be rejected. This became more visible after "streaming"
XPath evaluation was disabled since the latter handled non-ASCII names
correctly.
Fixes #818.
|
|
9201173c
|
2024-11-05T11:41:28
|
|
xmlreader: Fix return value of xmlTextReaderReadString
Return NULL if the node has no children or the children were already
deleted to match the 2.12 behavior.
Fixes #817.
|
|
869e3fd4
|
2024-11-01T16:52:31
|
|
parser: Fix loading of parameter entities in external DTDs
Regressed with commit 12f0bb94.
Fixes #816.
|
|
36117723
|
2024-10-31T17:38:37
|
|
Update README
|
|
467f4445
|
2024-10-30T14:03:39
|
|
SAX2: Add NULL check for ctxt->myDoc
|
|
efb57ddb
|
2024-10-30T14:02:36
|
|
parser: Fix downstream code that swaps DTDs
Downstream code like the nginx xslt module can change the document's DTD
pointers in a SAX callback. If an entity from a separate DTD is parsed
lazily, its content must not reference the current document.
Regressed with commit d025cfbb.
Fixes #815.
|
|
0ec5687e
|
2024-10-28T20:41:56
|
|
parser: Rework xmlCtxtGrowAttrs
Remove unneeded argument.
Check for integer overflow. We probably hit the buffer size limit in
xmlParserGrow before, but better be safe.
|
|
ffb058f4
|
2024-10-28T20:12:52
|
|
parser: Fix detection of duplicate attributes
We really need a second scan if more than one namespace clash was
detected.
|
|
89b9f457
|
2024-10-25T18:02:58
|
|
entities: Allow control chars when serializing HTML
|
|
b52a3044
|
2024-10-24T18:18:47
|
|
parser: Use counted_by attribute if supported
We only have a single struct with a flexible array member.
|
|
944e5fe8
|
2024-10-23T16:46:03
|
|
nanohttp: Fix another stdout file descriptor
|
|
607ada90
|
2024-10-23T14:19:01
|
|
nanohttp: Fix stdout file descriptor
Fixes #813.
|
|
b7c0f9d2
|
2024-10-19T14:26:39
|
|
string: Fix va_copy fallback
Fix va_copy fallback reworked in 5cffba83.
Should fix #812.
|
|
a870088f
|
2024-10-14T19:58:23
|
|
xpath: Hide internal sort functions
|
|
51394929
|
2024-10-15T11:11:38
|
|
python/tests: fix typos
Typos were found with codespell.
|
|
f9a6469a
|
2024-10-14T16:14:55
|
|
Update NEWS
|
|
c7b27866
|
2024-10-12T11:55:50
|
|
Avoid Python 'licence' distribution option is deprecated; use 'license' error
|
|
bf3619c3
|
2024-10-10T12:14:47
|
|
fuzz: Don't unlink DTD when replacing nodes
OP_XML_REPLACE_NODE needs the same check as OP_XML_UNLINK_NODE.
|
|
f77ec16d
|
2024-09-12T01:45:34
|
|
html: Optimize htmlParseCharData
|
|
440bd64c
|
2024-09-12T04:01:38
|
|
html: Optimize htmlParseHTMLName
|
|
c34d0ae9
|
2024-09-12T23:50:20
|
|
html: Deprecate htmlIsBooleanAttr
|
|
6040785a
|
2024-09-12T23:12:01
|
|
html: Deprecate AutoClose API
|
|
188cad68
|
2024-09-12T02:51:20
|
|
html: Remove obsolete content model
|
|
0144f662
|
2024-09-12T02:30:10
|
|
html: Remove obsolete code
|
|
0ce7bfe5
|
2024-09-12T01:44:18
|
|
html: Try to avoid passing XML options to HTML parser
|
|
76cc6394
|
2024-09-12T01:43:42
|
|
test: Fix XML_PARSE_HTML constant
|
|
575be6c1
|
2024-09-12T01:40:07
|
|
html: Fix line numbers with CRs
|
|
be874d78
|
2024-09-11T19:47:07
|
|
html: Ignore unexpected DOCTYPE declarations
|
|
462bf0b7
|
2024-09-11T19:06:06
|
|
html: Rework options
Introduce htmlCtxtSetOptions, see similar changes made to XML parser.
Add HTML_PARSE_HUGE alias. Support HTML_PARSE_BIG_LINES.
|
|
16de1346
|
2024-09-11T19:05:38
|
|
parser: Make new options actually work
|
|
42c3823d
|
2024-09-11T19:05:09
|
|
html: Update comment
|
|
9f04cce6
|
2024-09-11T17:43:07
|
|
html: Remove unused or useless return codes
htmlParseStartTag should always succeed (except for malloc failures).
|
|
a4c16a14
|
2024-09-27T23:49:02
|
|
xmllint: Improve --memory and --testIO options
Support --memory and --testIO in SAX mode.
Keep memory-mapped file across repetitions.
Options `--sax --memory --noout --repeat` can now be used to benchmark
the core parser without building a DOM tree or repeatedly reading files
from disk.
|
|
e179f3ec
|
2024-09-11T17:29:59
|
|
html: Stop reporting syntax errors
It doesn't make much sense to keep the old syntax error handling which
doesn't conform to HTML5.
Handling HTML5 parser errors is rather involved and not essential for
parsers.
|
|
3ac214f0
|
2024-09-27T22:54:14
|
|
xmllint: Support --html --sax
|
|
225ed707
|
2024-09-26T22:38:24
|
|
html: Accelerate htmlParseCharData
|
|
74dfc49b
|
2024-09-26T21:24:00
|
|
parser: Clarify logic in xmlParseStartTag2
|
|
20799979
|
2024-09-26T17:09:40
|
|
html: Handle numeric character references directly
|
|
0bc4608c
|
2024-09-15T20:28:49
|
|
html: Use hash table to check for duplicate attributes
|
|
24a6149f
|
2024-09-15T19:18:40
|
|
html: Make sure that character data mode is reset
|
|
c32397d5
|
2024-09-12T22:39:05
|
|
html: Improve character class macros
|
|
e8406554
|
2024-09-12T15:21:03
|
|
html: Rewrite parsing of most data
|
|
ccb61f59
|
2024-09-07T03:15:50
|
|
html: Remove duplicate calls to htmlAutoClose
|
|
e1834745
|
2024-09-07T00:54:25
|
|
html: Add character data tests
|
|
f9ed30e9
|
2024-09-06T17:49:04
|
|
html: HTML5 character data states
|
|
59511792
|
2024-09-03T15:52:44
|
|
html: Parse named character references according to HTML5
|
|
d5cd0f07
|
2022-07-15T17:00:36
|
|
html: Prefer SKIP(1) over NEXT in HTML parser
Use SKIP(1) where it's safe to avoid a function call.
|
|
dc2d4983
|
2023-05-04T17:47:38
|
|
html: Rework htmlLookupSequence
Rename to htmlLookupString and use strstr for increased performance.
|
|
637215a4
|
2023-05-04T17:16:51
|
|
html: Always terminate doctype declarations on '>'
Align with HTML5 spec. This allows to remove the old quote handling in
htmlLookupSequence.
|
|
72e29f9a
|
2023-05-04T17:03:22
|
|
html: Fix quadratic behavior in push parser
Fix quadratic behavior related to unquoted attribute values. We really
have to replicate parts of the HTML5 state machine to find the end of
tags relibably.
Fixes #533.
|
|
a80f8b64
|
2023-05-04T15:59:31
|
|
html: Allow attributes in end tags
Attribute are syntactically allowed in HTML5 end tags but otherwise
ignored.
|
|
f2272c23
|
2023-05-04T15:33:27
|
|
html: Handle unexpected-solidus-in-tag according to HTML5
|
|
939b53ee
|
2023-05-04T15:25:24
|
|
html: Stop skipping tag content
Tag and attributes names should always be parsed succesfully now.
|
|
dcb2abb2
|
2023-05-04T15:16:29
|
|
html: Parse tag and attribute names according to HTML5
HTML5 allows bascially all characters in tag and attribute names.
|
|
c6af1017
|
2024-09-08T20:45:48
|
|
html: Test tokenizer against html5lib test suite
|
|
27752f75
|
2024-09-11T15:06:55
|
|
html: Fix EOF handling in start tags
|
|
b19d3539
|
2024-09-11T15:03:49
|
|
html: Fix EOF handling in comments
|
|
17e56ac5
|
2024-09-11T14:24:58
|
|
html: Fix parsing of end tags
|
|
24a09033
|
2024-09-09T02:53:14
|
|
html: Fix bogus end tags
|
|
bca64854
|
2024-09-09T02:30:18
|
|
html: Allow U+000C FORM FEED as whitespace
|
|
6edf1a64
|
2024-09-09T02:09:20
|
|
html: Fix DOCTYPE parsing
|
|
9678163f
|
2024-09-09T02:01:19
|
|
html: Don't check for valid XML characters
|
|
a6955c13
|
2024-09-08T23:19:49
|
|
html: Parse numeric character references according to HTML5
|
|
4eeac309
|
2024-09-08T22:20:20
|
|
html: Start to fix EOF and U+0000 handling
|
|
e062a4a9
|
2024-09-08T20:40:36
|
|
html: Add HTML5 parser option
This option passes tokenizer output directly to the SAX callbacks,
making it possible to test the tokenizer against the html5lib test
suite.
This will produce unbalanced calls to the startElement and endElement
callbacks, but it's the only way to support a SAX like interface for
HTML5. It can be used for filtering or rewriting HTML5, for example.
A HTML5 tree builder could then be implemented on top of the SAX
callbacks.
|
|
17da54c5
|
2024-09-08T19:16:12
|
|
html: Normalize newlines
|
|
341dc78f
|
2024-09-08T19:11:14
|
|
html: Deduplicate code in htmlCurrentChar
|
|
3adb396d
|
2024-09-07T15:18:13
|
|
html: Parse bogus comments instead of ignoring them
Also treat XML processing instructions as bogus comments.
|
|
84440175
|
2024-09-07T14:21:12
|
|
html: Add missing calls to htmlCheckParagraph()
|
|
86d6b9b0
|
2024-09-07T04:18:06
|
|
html: Deduplicate some code
|
|
0d324bde
|
2024-09-07T03:45:09
|
|
html: Simplify node info accounting
|
|
d67833a3
|
2024-09-26T19:21:24
|
|
xmllint: Use proper type to store seconds since epoch
Should avoid year 2038 problem.
Fixes #801.
|
|
81d38ed0
|
2024-09-25T07:52:10
|
|
meson: Fix duplicate listing of libxml2.devhelp2
The duplication caused a warning when uninstalling.
|
|
b1c5aa65
|
2024-09-19T12:50:59
|
|
xpath: Deprecate xmlXPathNAN and xmlXPath*INF
Users should simply use the C99 macros.
|
|
55ddccb6
|
2024-09-14T00:03:56
|
|
io: Make sure not to pass partial UTF-8 to write callback
We cannot split UTF-8 at arbitrary boundaries.
|
|
c46b89e2
|
2024-09-13T21:06:36
|
|
xpath: Deprecate xmlXPathEvalExpr
Also check the argument instead of crashing if there's no context.
|
|
03f1bdd2
|
2024-09-13T20:59:47
|
|
xpath: Make recursion check work with xmlXPathCompile
The check for maximum recursion depth required a parser context with an
xmlXPathContext which xmlXPathCompile didn't provide.
All other code should already set up or require an xmlXPathContext.
|
|
dae160c6
|
2024-09-13T12:08:20
|
|
encoding: Fix table entry for "UTF16"
|
|
5e787401
|
2024-09-10T17:12:25
|
|
save: Make xmlEscapeTab signed
Fixes issues in platforms where char is unsigned.
Fixes #797.
|
|
6e503eb7
|
2024-09-10T03:32:37
|
|
encoding: Handle more ICU error codes
U_ILLEGAL_ESCAPE_SEQUENCE and U_UNSUPPORTED_ESCAPE_SEQUENCE can occur
with ISO-2022.
|
|
55d36c59
|
2024-09-10T03:11:18
|
|
encoding: Fix error code in xmlUconvConvert
Broke in 46ec621e.
|
|
de10d4cd
|
2024-09-04T16:19:42
|
|
include: Check whether _MSC_VER is defined
Should fix #795.
|
|
bd9eed46
|
2024-09-02T18:37:41
|
|
parser: Make unsupported encodings an error in declarations
This was changed in 45157261, but in encoding declarations, unsupported
encodings should raise a fatal error.
Fixes #794.
|
|
40abebbc
|
2024-08-29T01:02:33
|
|
python: Fix SAX driver with character streams
This apparently broke with Python 3.5 which introduced character
streams.
Fixes #790.
|
|
8ae06d52
|
2024-08-29T00:07:27
|
|
SAX2: Don't merge CDATA sections
The Document Object Model (DOM) Level 3 Core Specification says:
> Adjacent CDATASection nodes are not merged by use of the normalize
> method of the Node interface.
Fixes #412.
|
|
dde62ae5
|
2024-08-28T23:58:20
|
|
parser: Align push parsing of CDATA sections with pull parser
Remove special handling of CDATA sections in push parser. This makes
sure that only a single callback is generated for large sections.
Fixes #22 and needed for #412.
|
|
4d10e53a
|
2024-08-28T22:47:20
|
|
parser: Make sure to set and increment input id
Revert part of commits 410931e3 and b9d2f3c9.
|
|
6d365ca0
|
2024-08-28T22:09:30
|
|
doc: XML_PARSE_NO_XXE is available since 2.13.0
|
|
8ad618d2
|
2024-08-28T22:03:30
|
|
doc: Document all xmllint options
Remove --pushsmall.
Fixes #785.
|
|
67ff748c
|
2024-08-26T23:53:29
|
|
io: don't set the executable bit when creating files
Issue seems to have been introduced in
0bef93bf24def68c448af0e71844b942e0ed93ec.
|
|
0bb0012e
|
2024-08-19T21:15:28
|
|
catalog: Set xmlCatalogInitialized after setting default catalog
|
|
8625db28
|
2024-08-19T20:02:47
|
|
xmlcatalog: Improved fix for #699
|
|
4b007878
|
2024-08-19T20:44:07
|
|
Revert "catalog: Fetch XML catalog before dumping"
This reverts commit 723b4de04015c5acccd3cda5dd60db7d00702064.
|
|
57b92cab
|
2024-08-19T20:35:54
|
|
catalog: Fix regression in xmlCatalogAdd
Don't load system catalog.
|
|
2abb9033
|
2024-08-19T17:58:47
|
|
tests: Add fallback for glob()
Fix build on platforms without glob().
|
|
567f612d
|
2024-08-19T17:43:32
|
|
build: Check for declaration of glob() function
Don't rely on presence of glob.h.
|