kc3-lang/md4c/test

Branch :


Log

Author Commit Date CI Message
481fbfbd 2024-02-25 20:51:06 Check for hard breaks more carefully to avoid false positives... ... caused by trailing tab characters. Fixes #250.
64f36805 2024-02-25 16:24:50 Fix handling tab when removing trailing whitespace. Espacially in connection with ATX headers.
3848bfb6 2024-02-21 08:51:05 Make striketrough spans follow same flanking rules... ... as other emphasis spans. Fixes #242.
aa53f82c 2024-02-07 11:44:39 Introduce an overall limit to link. ref. defs instantiations. This is to prevent time and output size explosion in case of input pattern generated by this: $ python -c 'N=1000; print("[x]: " + "x" * N + "\n[x]" * N)' We roughly allow to blowing up the input size of the document 16 times by link reference definitions or up to 1 MB, whatever is smaller. When the threshold is reached, following reference definitions are sent to output unresolved as a text. Fixes #238.
f37a89f5 2024-02-01 21:55:45 md_is_inline_link_spec: Use md_lookup_line() instead of walking. Fixes #236.
485619fe 2024-01-30 01:46:15 test/spec.txt: Upgrade to spec version 0.31.2. It's essentially same as 0.31 and 0.31.1, it only fixes release date metadata in the spec.txt file. Also fix link in CHANGELOG.md accordingly.
f852aaed 2024-01-28 20:37:08 test/LICENSE: Update to reflect recent file renaming. Also rename the file to test/LICENSE.md.
1883132b 2024-01-28 19:05:39 Update test/spec.txt from upstream. (The spec 0.31 was released errorneously still with version 0.30 inside of it. Re-release 0.31.1 fixes it.)
136b39ac 2024-01-09 02:10:43 Update test/spec.txt from upstream.
aeddaf58 2024-01-25 22:24:17 Simplify and fix handling of newline in code span. Fixes #223 properly (one corner case has been unnoticed/hidden due test suite normalization feature). Fixes #230 (strictly speaking duplicate of the corner case).
d082cdd8 2024-01-25 21:25:26 test/run-testsuite.py: Allow disabling normalisation on per-unittest basis. And use it for few tests in regressions.txt where the whitespace matters.
a3c510ac 2024-01-21 14:11:47 Improve coverage testing of UTF-8 routines.
cd7c326f 2024-01-21 13:20:38 Add code coverage test for MD_FLAG_COLLAPSEWHITESPACE.
65957f53 2024-01-19 10:37:33 Limit number of table columns to prevent explosion of output... with the input pattern in the form of geneated by this one-liner: $ python3 -c 'N=1000; print("x|" * N + "\n" + "-|" * N + "\n" + "x\n" * N)' Here the amount of HTML otput grows with N^2.
70b247cf 2024-01-19 13:59:45 md_analyze_permissive_autolink: Accept path ending with '/'. Fixes #226.
601ff053 2024-01-18 16:28:16 Fix handling new line at beginning/end of a code span. Fixes #223.
23b14168 2024-01-18 15:11:22 pathological-tests.py: Fix output if a test unit ends with non-zero exit code.
a08f6a05 2024-01-18 12:29:31 Improve/fix latex math extension. To mitigate false positives: * We accept $ and $$ as a potential opener only if it's not preceded with alnum char. * Similarly closer cannot be followed with alnum char. * We now also match closer with last preceding pontential opener, not the first one. (And to avoid nesting, any previous openers are ignored.) * Also revert an unintended change in 3fc207affaba313cc1f4ef3b4e9e57df89b0e028 which allowed keeping nested resolved marks in it.
4728cd98 2024-01-17 16:04:14 md_analyze_tilde: Pop from chain tail like other emphasis. The function incorrectly used header from the head, leading to wrong result (incompatible with e.. GFM) but even worse to bad internal state md_rollback() is then potentially unable to solve. Fixes #222.
f45dd442 2024-01-17 02:58:12 Add regression test for #213. As it's now possible to add tests with multiple cmdline options easily.
d955c495 2024-01-17 02:48:57 Rework permissive autolinks. (#220) * We have now dedicated run over the inline marks for them. * We check more throughly whether it really looks as an URL or e-mail address. The old implementation recognized even heavily broken ones. * This allows us to be much more careful in order not to cross already resolved marks. * Share substantial parts of the code between all three types of the permissive autolinks (URL, WWW, e-mail). * Merge their tests into one file, spec-permissive-autolinks.txt. * Add one pathological case which triggered quadratic behavior in the old implementation.
a715b884 2024-01-16 15:29:35 Rename many files in test dir for better organization.
4b9e4d7c 2024-01-16 15:32:21 Move one more forgotten regression test to regressions.txt.
6685df9c 2024-01-16 15:09:33 Move all regression tests into new tests/regressions.txt. (And update scripts/run-tests.sh accordingly.)
74e5f7a9 2024-01-16 14:56:09 Tests: Specify md2html command line options for each example as needed. Previously the caller (or the script scripts/run_tests.sh) needed to know what options to specify.
359406bf 2024-01-16 14:25:46 Test: Add support for per-example command line options. (We also removed direct call support into the library. It was inherited from cmark as the testsuite was originally taken from there, but it actually was never updated to work with MD4C.)
78829427 2024-01-13 02:59:35 Fix some emphasis parsing issues. * We incorrectly applied the infamous rule of three only to asterisk-encoded emphasis, it has to be applied to underscore as well. * We incorrectly applied the rule of three only if the opener and/or closer was inside a word. It has also to be applied if the mark is both preceded and followed by punctuation. Fixes #217.
5592352f 2024-01-13 00:30:08 HTML declaration doesn't require whitespace before the closer. Fixes #216.
7497ea92 2024-01-13 00:17:08 Allow tabs after setext header underline. Fixes #215.
0d10b60b 2024-01-12 22:44:31 Move test/fuzz-input/ to test/fuzzers/seed-corpus/.
821477b1 2024-01-10 17:35:46 Fix typo in fuzz-mdhtml.c, preventing oss-fuzz from working.
c6942ef0 2024-01-10 17:31:55 Treat TABLECELLBOUNDARIES chain as special one. It's not an ordinary openers chain as (most of) the others, and md_rollback() must not touch it. Fixes #212.
ca169a92 2024-01-10 12:22:04 Fix HTML renderer to handle neted images correctly. Fixes #210.
38303af3 2024-01-09 00:01:35 Make md_is_html_block_end_condition() reuse the same data... ... as md_is_html_block_start_condition() for the type 1 so we make all tags are used consistently there. Fixes #207.
8699cd5d 2024-01-08 21:58:26 test/hard-soft-breaks.txt: Fix wording.
6ef3be6e 2024-01-08 20:09:57 `MD_FLAG_HARD_SOFT_BREAKS` (#193)
4d2f8a2e 2024-01-08 19:35:53 Add test for issue #201. Seems the issue got fixed by combination of previous commits. Fixes #201.
132c29dc 2024-01-08 19:31:37 Allow indented code block to follow any block except paragraph without a blank line. Fixes #200.
601c8ab7 2024-01-08 19:06:04 Restore parent's block indentation when interruping a list item with double blank line. Fixes #190.
a27f8dc0 2023-12-12 19:31:30 test/fuzzers.fuzz-mdhtml.c: Remove stale comment.
d3c1c0bb 2022-01-14 17:27:05 fuzz-mdhtml.c: Cleanup of the code.
b42e7f5c 2022-01-10 11:41:25 md_resolve_links: Avoid link ref. def. lookup if... if we know that the bracket pair contains nested brackets. That makes the label invalid anyway, therefore we know that there is no link ref. def. to be found anyway. In case of heavily nested bracket pairs, the lookup could lead to quadratic parsing times. Fixes #172.
7f44e1ad 2022-01-10 10:39:29 pathological_tests.py: Improve code alignment.
a8bb4d30 2022-01-06 16:01:55 md_is_table_underline: Remove requirement for minimal length of a cell underline. Fixes #169.
c01aa6b3 2021-06-27 18:28:26 Update CommonMark spec file to v. 0.30
bcb55d0d 2021-04-14 09:18:09 md_resolve_links: Suppress bogus nested permissive autolink. Fixes #152.
3478ec69 2021-02-23 14:01:31 Added fuzzer for oss-fuzz integration. (#151)
fd7b5fe0 2021-02-05 21:40:47 md_analyze_line: Fix implicit ending of HTML blocks... ... when the HTML block is not explicitly ended (before the enclosing container block ends). Fixes #149.
da5821ae 2020-12-14 19:53:40 Fix testcase for issue #142.
5a44e327 2020-12-14 18:59:56 md_link_label_cmp: Fix the loop end condition. The old version likely could stop prematurely in a corner case when there was a Unicode character at the end of the either string, which maps into multiple fold info codepoints. Fixes #142.
3254b7cb 2020-11-13 12:02:39 md_process_table_block_contents: Suppress empty TBODY block generation. When the table has no body rows, do not call the callback with MD_BLOCK_TBODY events. Fixes #138.
4585088a 2020-11-13 10:16:34 md_analyze_permissive_url_autolink: Better GFM compatibility. The autolinks now allow unmatched parenthesis, only the trailing parenthesis closers are handled specially to deal with the situation the autolink is all inside an outer parenthesis. Somehow our tests were broken and avoided the cases with unmatched parenthesis pairs inside the auto-link. That's now fixed and in sync with GFM specs too. Fixes #135.
002f76c9 2020-10-18 09:37:45 md_resolve_links: Skip [...] used as a reference link/image label. Fixes #131.
c501c891 2020-07-30 10:13:05 Fix spelling of "than" in many occurances. I often spell it errorneously as "then". Doing this mistake way too often when typing fast.
c595c2ed 2020-07-30 08:38:19 md_process_verbatim_block_contents: Fix off by 1 error. This caused outputting wrong indentation inside a fenced code blocks for lines indented with mor ethan 16 spaces. Fixes #124.
0c4d7f3d 2020-07-28 07:18:23 test/normalize.py: Use html.escape instead of cgi.escape. Fixes #123.
d0e3ed79 2020-03-12 22:45:32 md2html: Skip UTF-8 BOM, if present in the input.
9e6ab76c 2020-02-17 12:41:50 Minor fuzz-input cleanup. Move some permissive links incorrectly placed in commonmark.md into gfm.md.
cc9a9d28 2020-02-16 15:29:54 test/fuzz-test: Add some fuzzing testing initial input.
5d7c3597 2020-02-16 13:46:16 md_analyze_emph: Detect correctly opener chain when resolving the range. Fixes #107.
b4c30cd6 2020-02-13 02:23:03 Improve wiki-link parsing. * Get rid of MD_LINE::total_indent. * Remove some special complicated branching for nested images: Instead we use md_rollback() the wiki-link destination span to kill _any_ marks resolved so far, including the images. * Remove any length limit from label. Only destination length is limited, regardless of whether '|' is present or not. * Move the special handling of `[[foo|]]` from md_process_inlines() into md_resolve_links(). We simply expand the closer mark to consume the `|`. * Do not modify the opener and closer marks until we really know it is indeed a wiki-link.
403043bb 2020-01-16 16:15:08 md_mark_chain_append: Set next of the tail mark to -1. Fixes #104.
e6661f23 2020-01-10 19:27:10 Implement an underline extension. (#103) Closes #101.
82d7d087 2020-01-10 15:48:00 Rework/improve recognition of strike-through spans. Closes #102.
561f52e0 2020-01-05 18:33:46 md_is_autolink_email: Fix an off-by-one error. Fixes #100.
46f25f0b 2019-11-12 21:48:26 md_analyze_emph: Call md_resolve_range() with proper chain. Errorneously, we have called md_resolve_range() with mark chain derived from the closer mark. In the case that the opener and closer marks differ in length (and we have split one or the other), we pass in an incorrect chain, which may lead to strange behavior in subsequent analysis. Fixes #98.
e336e640 2019-11-04 15:20:59 Add support for Wiki links (#92) With a new flag MD_FLAG_WIKILINKS, recoginize wiki-style links as [[foo]] and [[foo|bar]]. Update also the HTML renderer accordingly, to output a custom HTML tag <x-wikilink> when seeing it.
ef85cfc2 2019-11-04 15:05:07 Simplify parsing of tables (#97) We do so by removing the function md_is_table_row(). md_is_table_row() did some crazy inline parsing to detect whether the line contains at least one pipe which is not inside a code span or other high-priority inline element. This was very complicated under the hood and to was actually breaking the clean design which separates block analysis parse and inline analysis of each block contents. We now just use the table underline for determining the block is table and its properties like e.g. the column count. This means a paragraph now cannot interrupt a table. This is a change in a behavior but likely acceptable one as it actually brings the behavior closer to behavior of tables in cmark-gfm in this regard. Last but not least, it seems to prevent adoption of other useful features, for about that, see the discussion in PR #92.
993c7b9b 2019-11-03 23:32:46 Render LaTeX math into HTML as a tag <x-equation>... ... instead of <equation>. This is to highlight that it is not a standard HTML tag.
e97d0250 2019-11-03 13:44:29 Link label comparision fixes. * md_link_label_cmp: To match the labels, the loop has to reach ends of the labels for both of them. * md_link_label_cmp_load_fold_info: Collapse consequtive whitespace into a single ' ' for the label comparison purposes. Fixes #96.
0354e1ab 2019-10-04 22:34:08 md_is_container_mark: Ordered list mark requires at least one digit. Fixes #95.
97606369 2019-07-07 11:19:21 Fix the last test case in latex-math.txt.
099ce69b 2019-07-07 11:15:44 Add missing file into git.
2e965941 2019-07-07 10:59:20 Add/improve docs for the LaTeX math spans.
8bac86aa 2019-07-07 09:46:10 Added support for LaTeX math (#87) Addresses #86.
ce8b5d94 2019-05-27 22:16:35 md_analyze_line: Blockquote with blank line can interrupt a paragraph. Fixes #83.
51386164 2019-05-19 11:46:26 md_link_label_cmp: Fix handling non-trivial folding info. Fixes #78.
4f6a9e54 2019-05-19 10:46:26 Update Unicode support to 12.1. * scipts/build_*_map.py: Implement helper pythonic scripts used to generate some Unicode search maps and data for helper Unicode functions used in MD4C. This should simplify updating to future Unicode versions. * md_get_unicode_fold_info: Use data generated by the scripts. * md_is_unicode_whitespace__: Ditto. * md_is_unicode_punct__: Ditto.
aca5c27f 2019-05-16 22:48:08 test/spec.txt: Update from upstream head.
64a1bc37 2019-05-15 23:25:05 test/coverage.txt: Sort the regression test cases by the issue number.
919a0cc9 2019-05-08 07:38:33 test/*.txt: Fix some formatting.
1757ff55 2019-05-07 23:10:46 test/spec_tests.py: Make ready for spec.txt from cmark-gfm project. This allows easier checking of our GFM dialect compatibility.
83047d3e 2019-05-07 22:24:29 md_analyze_permissive_url_autolink: Improve. * Fix domain recognition so that it has to have at least two dot-delimited components. * Fix handling if parenthesis so that they have to form balanced pairs; i.e. the first ')' not having a preceding opener ends the path. Fixes #76.
609dfb0b 2019-05-05 15:56:51 md_analyze_line: Treat blank lines inside a HTML block more carefully... ... with respect to the parent list containers. Fixes #10 (but now really).
95279131 2019-04-30 00:32:36 When undoing complete block from ctx->block_bytesp[], reset ctx->current_block properly. Fixes #74.
d4d10915 2019-04-29 19:03:16 Improve parsing of inline raw HTML. * Isolate some common code for scanning HTML closer into a new function so most HTML scanner functions reuse the same code. * Improve the scanning for the closer so that on failure we remember the range where no closer is present. So any later scanning attempts may fail early. Fixes #73.
d7920b9c 2019-04-08 19:35:06 Merge pull request #67 from mity/spec-0.29 This merges all changes for CommonMark specification 0.28 -> 0.29 transition.
5b78f295 2019-04-08 11:00:27 test/spec.txt: Update from upstream head.
2a7b97ed 2019-04-05 08:18:54 test/spec.txt: Update from upstream head.
b8586987 2019-04-03 08:28:27 md_collect_mark: Add missing 'continue' to '~' branch. Fixes #69.
855a1bfc 2019-03-27 02:04:24 test/spec.txt: Update from upstream head.
94c86fe2 2019-03-26 14:45:23 Revert "Fix problematic link destinations with angle brackets." The updated specification now explicitly requests the behavior we implemented before fixing #24. This reverts commit 2e0a74ba990e291ef4eace047d50af05ca81daef. Also remove associated regression test as it is no longer valid.
0959975a 2019-03-26 14:01:02 md_analyze_emph: Follow specs changes to the "rule of three".
98968e22 2019-03-26 13:33:05 Update spec.txt from upstream head. (I previously used an updated revision of it by mistake.)
1edd0c9c 2019-03-26 11:49:25 test/spec.txt: Update to current upstream HEAD.
2dd96ab4 2019-03-12 09:56:11 Fix O(n^2) in handling the "rule of three". We had to break the list of potential '*' openers into multiple ones so we do not have to walk it when looking for matching length due to the "rule of three" for intraword delimiter runs. Fixes #63.
b2108652 2019-03-11 21:13:15 md_analyze_line: Fix O(n^2) in thematic break handling. Fixes #66.
37104fc2 2019-03-11 20:26:58 md_is_code_span: Fix crash at EOF. Fixes #65.
966b8e39 2019-03-11 19:56:46 md_is_link_title: Stop on ')' lin ()-style title. Fixes #60.
fc27108e 2019-03-11 19:55:08 test/pathological_tests.py: Output test durations.