kmx git

Commit	Date	Message
d3c1c0bb	2022-01-14T17:27:05	fuzz-mdhtml.c: Cleanup of the code.
b42e7f5c	2022-01-10T11:41:25	md_resolve_links: Avoid link ref. def. lookup if... if we know that the bracket pair contains nested brackets. That makes the label invalid anyway, therefore we know that there is no link ref. def. to be found anyway. In case of heavily nested bracket pairs, the lookup could lead to quadratic parsing times. Fixes #172.
7f44e1ad	2022-01-10T10:39:29	pathological_tests.py: Improve code alignment.
a8bb4d30	2022-01-06T16:01:55	md_is_table_underline: Remove requirement for minimal length of a cell underline. Fixes #169.
c01aa6b3	2021-06-27T18:28:26	Update CommonMark spec file to v. 0.30
bcb55d0d	2021-04-14T09:18:09	md_resolve_links: Suppress bogus nested permissive autolink. Fixes #152.
3478ec69	2021-02-23T14:01:31	Added fuzzer for oss-fuzz integration. (#151)
fd7b5fe0	2021-02-05T21:40:47	md_analyze_line: Fix implicit ending of HTML blocks... ... when the HTML block is not explicitly ended (before the enclosing container block ends). Fixes #149.
da5821ae	2020-12-14T19:53:40	Fix testcase for issue #142.
5a44e327	2020-12-14T18:59:56	md_link_label_cmp: Fix the loop end condition. The old version likely could stop prematurely in a corner case when there was a Unicode character at the end of the either string, which maps into multiple fold info codepoints. Fixes #142.
3254b7cb	2020-11-13T12:02:39	md_process_table_block_contents: Suppress empty TBODY block generation. When the table has no body rows, do not call the callback with MD_BLOCK_TBODY events. Fixes #138.
4585088a	2020-11-13T10:16:34	md_analyze_permissive_url_autolink: Better GFM compatibility. The autolinks now allow unmatched parenthesis, only the trailing parenthesis closers are handled specially to deal with the situation the autolink is all inside an outer parenthesis. Somehow our tests were broken and avoided the cases with unmatched parenthesis pairs inside the auto-link. That's now fixed and in sync with GFM specs too. Fixes #135.
002f76c9	2020-10-18T09:37:45	md_resolve_links: Skip [...] used as a reference link/image label. Fixes #131.
c501c891	2020-07-30T10:13:05	Fix spelling of "than" in many occurances. I often spell it errorneously as "then". Doing this mistake way too often when typing fast.
c595c2ed	2020-07-30T08:38:19	md_process_verbatim_block_contents: Fix off by 1 error. This caused outputting wrong indentation inside a fenced code blocks for lines indented with mor ethan 16 spaces. Fixes #124.
0c4d7f3d	2020-07-28T07:18:23	test/normalize.py: Use html.escape instead of cgi.escape. Fixes #123.
d0e3ed79	2020-03-12T22:45:32	md2html: Skip UTF-8 BOM, if present in the input.
9e6ab76c	2020-02-17T12:41:50	Minor fuzz-input cleanup. Move some permissive links incorrectly placed in commonmark.md into gfm.md.
cc9a9d28	2020-02-16T15:29:54	test/fuzz-test: Add some fuzzing testing initial input.
5d7c3597	2020-02-16T13:46:16	md_analyze_emph: Detect correctly opener chain when resolving the range. Fixes #107.
b4c30cd6	2020-02-13T02:23:03	Improve wiki-link parsing. * Get rid of MD_LINE::total_indent. * Remove some special complicated branching for nested images: Instead we use md_rollback() the wiki-link destination span to kill _any_ marks resolved so far, including the images. * Remove any length limit from label. Only destination length is limited, regardless of whether '\|' is present or not. * Move the special handling of `[[foo\|]]` from md_process_inlines() into md_resolve_links(). We simply expand the closer mark to consume the `\|`. * Do not modify the opener and closer marks until we really know it is indeed a wiki-link.
403043bb	2020-01-16T16:15:08	md_mark_chain_append: Set next of the tail mark to -1. Fixes #104.
e6661f23	2020-01-10T19:27:10	Implement an underline extension. (#103) Closes #101.
82d7d087	2020-01-10T15:48:00	Rework/improve recognition of strike-through spans. Closes #102.
561f52e0	2020-01-05T18:33:46	md_is_autolink_email: Fix an off-by-one error. Fixes #100.
46f25f0b	2019-11-12T21:48:26	md_analyze_emph: Call md_resolve_range() with proper chain. Errorneously, we have called md_resolve_range() with mark chain derived from the closer mark. In the case that the opener and closer marks differ in length (and we have split one or the other), we pass in an incorrect chain, which may lead to strange behavior in subsequent analysis. Fixes #98.
e336e640	2019-11-04T15:20:59	Add support for Wiki links (#92) With a new flag MD_FLAG_WIKILINKS, recoginize wiki-style links as [[foo]] and [[foo\|bar]]. Update also the HTML renderer accordingly, to output a custom HTML tag <x-wikilink> when seeing it.
ef85cfc2	2019-11-04T15:05:07	Simplify parsing of tables (#97) We do so by removing the function md_is_table_row(). md_is_table_row() did some crazy inline parsing to detect whether the line contains at least one pipe which is not inside a code span or other high-priority inline element. This was very complicated under the hood and to was actually breaking the clean design which separates block analysis parse and inline analysis of each block contents. We now just use the table underline for determining the block is table and its properties like e.g. the column count. This means a paragraph now cannot interrupt a table. This is a change in a behavior but likely acceptable one as it actually brings the behavior closer to behavior of tables in cmark-gfm in this regard. Last but not least, it seems to prevent adoption of other useful features, for about that, see the discussion in PR #92.
993c7b9b	2019-11-03T23:32:46	Render LaTeX math into HTML as a tag <x-equation>... ... instead of <equation>. This is to highlight that it is not a standard HTML tag.
e97d0250	2019-11-03T13:44:29	Link label comparision fixes. * md_link_label_cmp: To match the labels, the loop has to reach ends of the labels for both of them. * md_link_label_cmp_load_fold_info: Collapse consequtive whitespace into a single ' ' for the label comparison purposes. Fixes #96.
0354e1ab	2019-10-04T22:34:08	md_is_container_mark: Ordered list mark requires at least one digit. Fixes #95.
97606369	2019-07-07T11:19:21	Fix the last test case in latex-math.txt.
099ce69b	2019-07-07T11:15:44	Add missing file into git.
2e965941	2019-07-07T10:59:20	Add/improve docs for the LaTeX math spans.
8bac86aa	2019-07-07T09:46:10	Added support for LaTeX math (#87) Addresses #86.
ce8b5d94	2019-05-27T22:16:35	md_analyze_line: Blockquote with blank line can interrupt a paragraph. Fixes #83.
51386164	2019-05-19T11:46:26	md_link_label_cmp: Fix handling non-trivial folding info. Fixes #78.
4f6a9e54	2019-05-19T10:46:26	Update Unicode support to 12.1. * scipts/build__map.py: Implement helper pythonic scripts used to generate some Unicode search maps and data for helper Unicode functions used in MD4C. This should simplify updating to future Unicode versions. md_get_unicode_fold_info: Use data generated by the scripts. * md_is_unicode_whitespace__: Ditto. * md_is_unicode_punct__: Ditto.
aca5c27f	2019-05-16T22:48:08	test/spec.txt: Update from upstream head.
64a1bc37	2019-05-15T23:25:05	test/coverage.txt: Sort the regression test cases by the issue number.
919a0cc9	2019-05-08T07:38:33	test/*.txt: Fix some formatting.
1757ff55	2019-05-07T23:10:46	test/spec_tests.py: Make ready for spec.txt from cmark-gfm project. This allows easier checking of our GFM dialect compatibility.
83047d3e	2019-05-07T22:24:29	md_analyze_permissive_url_autolink: Improve. * Fix domain recognition so that it has to have at least two dot-delimited components. * Fix handling if parenthesis so that they have to form balanced pairs; i.e. the first ')' not having a preceding opener ends the path. Fixes #76.
609dfb0b	2019-05-05T15:56:51	md_analyze_line: Treat blank lines inside a HTML block more carefully... ... with respect to the parent list containers. Fixes #10 (but now really).
95279131	2019-04-30T00:32:36	When undoing complete block from ctx->block_bytesp[], reset ctx->current_block properly. Fixes #74.
d4d10915	2019-04-29T19:03:16	Improve parsing of inline raw HTML. * Isolate some common code for scanning HTML closer into a new function so most HTML scanner functions reuse the same code. * Improve the scanning for the closer so that on failure we remember the range where no closer is present. So any later scanning attempts may fail early. Fixes #73.
d7920b9c	2019-04-08T19:35:06	Merge pull request #67 from mity/spec-0.29 This merges all changes for CommonMark specification 0.28 -> 0.29 transition.
5b78f295	2019-04-08T11:00:27	test/spec.txt: Update from upstream head.
2a7b97ed	2019-04-05T08:18:54	test/spec.txt: Update from upstream head.
b8586987	2019-04-03T08:28:27	md_collect_mark: Add missing 'continue' to '~' branch. Fixes #69.
855a1bfc	2019-03-27T02:04:24	test/spec.txt: Update from upstream head.
94c86fe2	2019-03-26T14:45:23	Revert "Fix problematic link destinations with angle brackets." The updated specification now explicitly requests the behavior we implemented before fixing #24. This reverts commit 2e0a74ba990e291ef4eace047d50af05ca81daef. Also remove associated regression test as it is no longer valid.
0959975a	2019-03-26T14:01:02	md_analyze_emph: Follow specs changes to the "rule of three".
98968e22	2019-03-26T13:33:05	Update spec.txt from upstream head. (I previously used an updated revision of it by mistake.)
1edd0c9c	2019-03-26T11:49:25	test/spec.txt: Update to current upstream HEAD.
2dd96ab4	2019-03-12T09:56:11	Fix O(n^2) in handling the "rule of three". We had to break the list of potential '*' openers into multiple ones so we do not have to walk it when looking for matching length due to the "rule of three" for intraword delimiter runs. Fixes #63.
b2108652	2019-03-11T21:13:15	md_analyze_line: Fix O(n^2) in thematic break handling. Fixes #66.
37104fc2	2019-03-11T20:26:58	md_is_code_span: Fix crash at EOF. Fixes #65.
966b8e39	2019-03-11T19:56:46	md_is_link_title: Stop on ')' lin ()-style title. Fixes #60.
fc27108e	2019-03-11T19:55:08	test/pathological_tests.py: Output test durations.
53f65852	2019-03-11T19:03:34	test/spec.txt: Little update. Somehow we were having little different spec.txt version that the one from CommonMark repo tag 0.28. But we still pass all its compliance test suite.
685b7144	2019-03-10T11:20:39	Move codespan detection from md_analyze_backtick() into... md_is_code_span(), called from md_collect_marks(). We have to do this at the same time as detecting raw inline HTML to follow CommonMark priority requirements. Also it is done very differently now: When scanning for the closer mark, we remember (the latest) position of potential closers for all other lengths as well. This means that: (1) If we find it, we reduced the task because all subsequent scan shall begin after the closer. (2) If we do not find it, then we have to reach the end of the block and hence we then know (for every allowed marker length) the position of last such backtick sequence. (3) That makes the guaranty that any subsequent call with either succeed in its scan (and reduce the task even further); or that we shall be able to detect instantly there is no suitable closer. I.e. every call either reduces the task by O(n) scan (1); or collects all the data in O(n) because (2) happens at most once; or fails in O(1) (3). This makes O(n) guaranty of the function complexity. Fixes #59.
0cb61205	2019-03-10T10:50:23	Move raw inline HTML detection from md_analyze_lt_qt() into md_collect_marks(). Fixes #58: For resolving raw inline HTML the function tried closer with all potential openers, because raw HTML can have '<' inside of an attribute. However this caused O(n^2) for input like "<><><><><><><>...". We solved by handling raw HTML in earlier stage, directly in md_collect_marks(), where we can scan linerary forward. Fixes #61: As a side effect, this also fixes the issue that MD_FLAG_NOHTMLSPANS disabled also recognition of CommonMark autolinks.
8e01a769	2019-02-10T22:58:42	Implement task lists. (#50) Fixes #30.
d32aa2e0	2019-02-09T10:40:52	Fix conflict in parsing permissive autolinks and ordinary links. The issues is caused by the fact that we do not know exact position of permissive auto-link in time of md_collect_marks() because there is no syntax to mark its end on the 1st place. This causes that eventually, the closer mark in ctx->marks[] can be out-of-order somewhat. As a consequence, if some other mark range (e.g. ordinary link) shadows the auto-link, the closer mark may be left outside the shadowed range and survive till the phase when we generate the output. We fix by using an extra mark flag to remember we did really output the opener mark, and output the closer only in such case. Fixes #53.
67401e70	2019-02-06T04:31:25	md_analyze_inlines: Resolve table cell boundaries before links. This brings some corner cases closer to cmark-gfm. Also fixes #51.
8fc692ba	2018-06-11T18:17:26	md_rollback: Do not touch TABLECELLBOUNDARIES chain. This chain is not normal opener/closer inline mark chain. Fixes #42.
e6e2ea4c	2018-06-11T11:43:47	md_analyze_line: Fix mixing list and table parsing. If table header underline is not nested the same way as the preceding line (i.e. the wannabe table header line), then it cannot form a table. Fixes #41.
4ef024fb	2018-05-29T23:30:02	md_process_inlines: Fix link/image closers spanning over multiple lines. Fixes #40.
7deaccf6	2018-05-29T18:32:10	md_is_link_label: Fix if the link label contains just backslash escapes. The function did not remember the label start line index, leading to bad consequences. Fixes #39.
bf022cb6	2018-05-28T21:14:45	Fix md_split_simple_pairing_mark(). When splitting a mark into two, make sure each of them gets the right share od dummies for case that we will have to split once more. Fixes #36.
e7b84d65	2018-05-28T21:09:32	pathological_tests.py: Fix test compatibility with Windows.
81e2a5ca	2018-04-12T17:03:37	pathological_tests.py: Test deeply nested lists.
0d1a41a4	2018-03-28T08:21:21	md_build_attr_append_substr: Fix +1 allocation error. Fixes #33.
19b24bdd	2017-08-16T18:15:19	Simplify the pathological test "many references".
07cec7dc	2017-08-16T16:34:50	Add regression test for #24.
ee3bee1a	2017-08-02T00:37:18	Upgrade to CommonMark specification 0.28.
c52a50a3	2017-07-24T21:07:09	pathological_tests.py: Add test for reference definition lookup.
938460d5	2017-07-24T21:17:53	Improve/unify output of test scripts.
c51fb310	2017-07-24T23:14:05	md_analyze_marks: Walk only required range of the marks. This changes causes that when recursing to analysis of link contents, only the marks between the link opener and closer are iterated in md_analyze_marks(). Fixes #22
a27aefde	2017-07-24T20:17:50	pathological_tests.py: Allow short option -p as a synonym of --program.
f4f7b223	2017-07-24T20:15:09	pathological_tests.py: Allow Windowish line ends.
26f14899	2017-07-24T19:27:27	Add pathological_tests.py from cmark.
ad4f28bb	2017-07-24T20:09:23	md_analyze_simple_pairing_mark: Fix the "rule of three". If the first emphasis opener is refused due the rule of three, a previous opener is examined. However the variable opener_orig_size_module3 was not (re)set accordingly. Fixes #21.
cfbce759	2017-07-18T18:01:02	Rework ref. def. dictionary. It now uses FNV1a and we now sort/bsearch only contents of single bucket. Additionally we fix #20 by disabling the invalid ref. definitions during hashtable build.
f2821cbd	2017-07-14T17:10:45	md_analyze_permissive_email_autolink: Make it compatible with CMark-gfm.
1bc7f3a8	2017-07-14T02:22:15	render_url_escaped: Fix escaping of ampersand. This affected generating href attribute if links or src attribute of images.
f3f9404e	2017-07-14T02:06:23	Improve URL autolinks extension. It is now much more compatible to Cmark-gfm. With the flag MD_FLAG_PERMISSIVEWWWAUTOLINKS, we now also support the WWW autolinks (when the http: scheme is omitted).
25a156ee	2017-07-12T23:30:14	Implement strikethrough extension.
8999e184	2017-01-04T14:40:44	Fix "rule of three" for emphasis resolution (issue #14).
c63909df	2017-01-04T15:04:09	When splitting emphasis opener mark, we have to retain 'dummy' marks available for more splitting in the future (issue #15).
52712384	2016-12-27T22:47:03	When parsing tables, pipes inside a link/image/code span cannot make cell boundary (issue #7).
f9b4cb8f	2016-12-15T16:47:41	md_process_inlines: Fix when an expanded mark shadows some nested marks (issue #11).
c235a02e	2016-12-15T13:18:48	test/coverage.txt: Add some tests for higher code coverage.
a725fee3	2016-12-14T16:51:24	md_enter_child_containers: Fix crash (issue #10). Calling md_push_container_bytes() may result in ending a current block which may result in removing some contents from ctx->block_bytes when removing some lines with link reference definitions. This in effect means we have to end the block explicitly before storing the offset into the ctx->block_bytes.
ba29d007	2016-12-12T23:31:59	md_is_link_reference_definition: Fix handling of multiline label (issue #9).
09ae8609	2016-12-07T23:56:47	Handle images more like links. Remove MD_SPAN_IMG_DETAIL::alt. Instead, the contents of the image is propagated to the renderer via MD_RENDERER::text() callback. * This fixes handling of entities inside the image text (issue #4). * It simplifies parsing and, more importantly, it better distingusshes what is responsibility of parser or renderer respectively. * This allows more flexibility on renderers side. Renderer who do not * really support images can just output the image content as any other text. The cost is a renderer into HTML (if it wants to render image contents into the attribute ALT of the IMG tag), has to handle images with more care. Typically such renderer has to track whether it is inside an image, and if so, then render span enter/leave as an empty string.
23312d6d	2016-12-05T11:13:43	md_is_html_tag: Fix parsing unquoted attribute value (issue #2).
b40d5950	2016-12-04T17:01:00	Fix file permissions of python scripts.
be7fcc16	2016-11-21T09:47:31	Implement tables. Note it is implemented as an extension. To enable it, the flag MD_FLAG_TABLES must be explicitly specified.

d3c1c0bb

2022-01-14T17:27:05

fuzz-mdhtml.c: Cleanup of the code.

b42e7f5c

2022-01-10T11:41:25

md_resolve_links: Avoid link ref. def. lookup if... if we know that the bracket pair contains nested brackets. That makes the label invalid anyway, therefore we know that there is no link ref. def. to be found anyway. In case of heavily nested bracket pairs, the lookup could lead to quadratic parsing times. Fixes #172.

7f44e1ad

2022-01-10T10:39:29

pathological_tests.py: Improve code alignment.

a8bb4d30

2022-01-06T16:01:55

md_is_table_underline: Remove requirement for minimal length of a cell underline. Fixes #169.

c01aa6b3

2021-06-27T18:28:26

Update CommonMark spec file to v. 0.30

bcb55d0d

2021-04-14T09:18:09

md_resolve_links: Suppress bogus nested permissive autolink. Fixes #152.

3478ec69

2021-02-23T14:01:31

Added fuzzer for oss-fuzz integration. (#151)

fd7b5fe0

2021-02-05T21:40:47

md_analyze_line: Fix implicit ending of HTML blocks... ... when the HTML block is not explicitly ended (before the enclosing container block ends). Fixes #149.

da5821ae

2020-12-14T19:53:40

Fix testcase for issue #142.

5a44e327

2020-12-14T18:59:56

md_link_label_cmp: Fix the loop end condition. The old version likely could stop prematurely in a corner case when there was a Unicode character at the end of the either string, which maps into multiple fold info codepoints. Fixes #142.

3254b7cb

2020-11-13T12:02:39

md_process_table_block_contents: Suppress empty TBODY block generation. When the table has no body rows, do not call the callback with MD_BLOCK_TBODY events. Fixes #138.

4585088a

2020-11-13T10:16:34

md_analyze_permissive_url_autolink: Better GFM compatibility. The autolinks now allow unmatched parenthesis, only the trailing parenthesis closers are handled specially to deal with the situation the autolink is all inside an outer parenthesis. Somehow our tests were broken and avoided the cases with unmatched parenthesis pairs inside the auto-link. That's now fixed and in sync with GFM specs too. Fixes #135.

002f76c9

2020-10-18T09:37:45

md_resolve_links: Skip [...] used as a reference link/image label. Fixes #131.

c501c891

2020-07-30T10:13:05

Fix spelling of "than" in many occurances. I often spell it errorneously as "then". Doing this mistake way too often when typing fast.

c595c2ed

2020-07-30T08:38:19

md_process_verbatim_block_contents: Fix off by 1 error. This caused outputting wrong indentation inside a fenced code blocks for lines indented with mor ethan 16 spaces. Fixes #124.

0c4d7f3d

2020-07-28T07:18:23

test/normalize.py: Use html.escape instead of cgi.escape. Fixes #123.

d0e3ed79

2020-03-12T22:45:32

md2html: Skip UTF-8 BOM, if present in the input.

9e6ab76c

2020-02-17T12:41:50

Minor fuzz-input cleanup. Move some permissive links incorrectly placed in commonmark.md into gfm.md.

cc9a9d28

2020-02-16T15:29:54

test/fuzz-test: Add some fuzzing testing initial input.

5d7c3597

2020-02-16T13:46:16

md_analyze_emph: Detect correctly opener chain when resolving the range. Fixes #107.

b4c30cd6

2020-02-13T02:23:03

Improve wiki-link parsing. * Get rid of MD_LINE::total_indent. * Remove some special complicated branching for nested images: Instead we use md_rollback() the wiki-link destination span to kill _any_ marks resolved so far, including the images. * Remove any length limit from label. Only destination length is limited, regardless of whether '|' is present or not. * Move the special handling of `[[foo|]]` from md_process_inlines() into md_resolve_links(). We simply expand the closer mark to consume the `|`. * Do not modify the opener and closer marks until we really know it is indeed a wiki-link.

403043bb

2020-01-16T16:15:08

md_mark_chain_append: Set next of the tail mark to -1. Fixes #104.

e6661f23

2020-01-10T19:27:10

Implement an underline extension. (#103) Closes #101.

82d7d087

2020-01-10T15:48:00

Rework/improve recognition of strike-through spans. Closes #102.

561f52e0

2020-01-05T18:33:46

md_is_autolink_email: Fix an off-by-one error. Fixes #100.

46f25f0b

2019-11-12T21:48:26

md_analyze_emph: Call md_resolve_range() with proper chain. Errorneously, we have called md_resolve_range() with mark chain derived from the closer mark. In the case that the opener and closer marks differ in length (and we have split one or the other), we pass in an incorrect chain, which may lead to strange behavior in subsequent analysis. Fixes #98.

e336e640

2019-11-04T15:20:59

Add support for Wiki links (#92) With a new flag MD_FLAG_WIKILINKS, recoginize wiki-style links as [[foo]] and [[foo|bar]]. Update also the HTML renderer accordingly, to output a custom HTML tag <x-wikilink> when seeing it.

ef85cfc2

2019-11-04T15:05:07

Simplify parsing of tables (#97) We do so by removing the function md_is_table_row(). md_is_table_row() did some crazy inline parsing to detect whether the line contains at least one pipe which is not inside a code span or other high-priority inline element. This was very complicated under the hood and to was actually breaking the clean design which separates block analysis parse and inline analysis of each block contents. We now just use the table underline for determining the block is table and its properties like e.g. the column count. This means a paragraph now cannot interrupt a table. This is a change in a behavior but likely acceptable one as it actually brings the behavior closer to behavior of tables in cmark-gfm in this regard. Last but not least, it seems to prevent adoption of other useful features, for about that, see the discussion in PR #92.

993c7b9b

2019-11-03T23:32:46

Render LaTeX math into HTML as a tag <x-equation>... ... instead of <equation>. This is to highlight that it is not a standard HTML tag.

e97d0250

2019-11-03T13:44:29

Link label comparision fixes. * md_link_label_cmp: To match the labels, the loop has to reach ends of the labels for both of them. * md_link_label_cmp_load_fold_info: Collapse consequtive whitespace into a single ' ' for the label comparison purposes. Fixes #96.

0354e1ab

2019-10-04T22:34:08

md_is_container_mark: Ordered list mark requires at least one digit. Fixes #95.

97606369

2019-07-07T11:19:21

Fix the last test case in latex-math.txt.

099ce69b

2019-07-07T11:15:44

Add missing file into git.

2e965941

2019-07-07T10:59:20

Add/improve docs for the LaTeX math spans.

8bac86aa

2019-07-07T09:46:10

Added support for LaTeX math (#87) Addresses #86.

ce8b5d94

2019-05-27T22:16:35

md_analyze_line: Blockquote with blank line can interrupt a paragraph. Fixes #83.

51386164

2019-05-19T11:46:26

md_link_label_cmp: Fix handling non-trivial folding info. Fixes #78.

4f6a9e54

2019-05-19T10:46:26

Update Unicode support to 12.1. * scipts/build_*_map.py: Implement helper pythonic scripts used to generate some Unicode search maps and data for helper Unicode functions used in MD4C. This should simplify updating to future Unicode versions. * md_get_unicode_fold_info: Use data generated by the scripts. * md_is_unicode_whitespace__: Ditto. * md_is_unicode_punct__: Ditto.

aca5c27f

2019-05-16T22:48:08

test/spec.txt: Update from upstream head.

64a1bc37

2019-05-15T23:25:05

test/coverage.txt: Sort the regression test cases by the issue number.

919a0cc9

2019-05-08T07:38:33

test/*.txt: Fix some formatting.

1757ff55

2019-05-07T23:10:46

test/spec_tests.py: Make ready for spec.txt from cmark-gfm project. This allows easier checking of our GFM dialect compatibility.

83047d3e

2019-05-07T22:24:29

md_analyze_permissive_url_autolink: Improve. * Fix domain recognition so that it has to have at least two dot-delimited components. * Fix handling if parenthesis so that they have to form balanced pairs; i.e. the first ')' not having a preceding opener ends the path. Fixes #76.

609dfb0b

2019-05-05T15:56:51

md_analyze_line: Treat blank lines inside a HTML block more carefully... ... with respect to the parent list containers. Fixes #10 (but now really).

95279131

2019-04-30T00:32:36

When undoing complete block from ctx->block_bytesp[], reset ctx->current_block properly. Fixes #74.

d4d10915

2019-04-29T19:03:16

Improve parsing of inline raw HTML. * Isolate some common code for scanning HTML closer into a new function so most HTML scanner functions reuse the same code. * Improve the scanning for the closer so that on failure we remember the range where no closer is present. So any later scanning attempts may fail early. Fixes #73.

d7920b9c

2019-04-08T19:35:06

Merge pull request #67 from mity/spec-0.29 This merges all changes for CommonMark specification 0.28 -> 0.29 transition.

5b78f295

2019-04-08T11:00:27

test/spec.txt: Update from upstream head.

2a7b97ed

2019-04-05T08:18:54

test/spec.txt: Update from upstream head.

b8586987

2019-04-03T08:28:27

md_collect_mark: Add missing 'continue' to '~' branch. Fixes #69.

855a1bfc

2019-03-27T02:04:24

test/spec.txt: Update from upstream head.

94c86fe2

2019-03-26T14:45:23

Revert "Fix problematic link destinations with angle brackets." The updated specification now explicitly requests the behavior we implemented before fixing #24. This reverts commit 2e0a74ba990e291ef4eace047d50af05ca81daef. Also remove associated regression test as it is no longer valid.

0959975a

2019-03-26T14:01:02

md_analyze_emph: Follow specs changes to the "rule of three".

98968e22

2019-03-26T13:33:05

Update spec.txt from upstream head. (I previously used an updated revision of it by mistake.)

1edd0c9c

2019-03-26T11:49:25

test/spec.txt: Update to current upstream HEAD.

2dd96ab4

2019-03-12T09:56:11

Fix O(n^2) in handling the "rule of three". We had to break the list of potential '*' openers into multiple ones so we do not have to walk it when looking for matching length due to the "rule of three" for intraword delimiter runs. Fixes #63.

b2108652

2019-03-11T21:13:15

md_analyze_line: Fix O(n^2) in thematic break handling. Fixes #66.

37104fc2

2019-03-11T20:26:58

md_is_code_span: Fix crash at EOF. Fixes #65.

966b8e39

2019-03-11T19:56:46

md_is_link_title: Stop on ')' lin ()-style title. Fixes #60.

fc27108e

2019-03-11T19:55:08

test/pathological_tests.py: Output test durations.

53f65852

2019-03-11T19:03:34

test/spec.txt: Little update. Somehow we were having little different spec.txt version that the one from CommonMark repo tag 0.28. But we still pass all its compliance test suite.

685b7144

2019-03-10T11:20:39

Move codespan detection from md_analyze_backtick() into... md_is_code_span(), called from md_collect_marks(). We have to do this at the same time as detecting raw inline HTML to follow CommonMark priority requirements. Also it is done very differently now: When scanning for the closer mark, we remember (the latest) position of potential closers for all other lengths as well. This means that: (1) If we find it, we reduced the task because all subsequent scan shall begin after the closer. (2) If we do not find it, then we have to reach the end of the block and hence we then know (for every allowed marker length) the position of last such backtick sequence. (3) That makes the guaranty that any subsequent call with either succeed in its scan (and reduce the task even further); or that we shall be able to detect instantly there is no suitable closer. I.e. every call either reduces the task by O(n) scan (1); or collects all the data in O(n) because (2) happens at most once; or fails in O(1) (3). This makes O(n) guaranty of the function complexity. Fixes #59.

0cb61205

2019-03-10T10:50:23

Move raw inline HTML detection from md_analyze_lt_qt() into md_collect_marks(). Fixes #58: For resolving raw inline HTML the function tried closer with all potential openers, because raw HTML can have '<' inside of an attribute. However this caused O(n^2) for input like "<><><><><><><>...". We solved by handling raw HTML in earlier stage, directly in md_collect_marks(), where we can scan linerary forward. Fixes #61: As a side effect, this also fixes the issue that MD_FLAG_NOHTMLSPANS disabled also recognition of CommonMark autolinks.

8e01a769

2019-02-10T22:58:42

Implement task lists. (#50) Fixes #30.

d32aa2e0

2019-02-09T10:40:52

Fix conflict in parsing permissive autolinks and ordinary links. The issues is caused by the fact that we do not know exact position of permissive auto-link in time of md_collect_marks() because there is no syntax to mark its end on the 1st place. This causes that eventually, the closer mark in ctx->marks[] can be out-of-order somewhat. As a consequence, if some other mark range (e.g. ordinary link) shadows the auto-link, the closer mark may be left outside the shadowed range and survive till the phase when we generate the output. We fix by using an extra mark flag to remember we did really output the opener mark, and output the closer only in such case. Fixes #53.

67401e70

2019-02-06T04:31:25

md_analyze_inlines: Resolve table cell boundaries before links. This brings some corner cases closer to cmark-gfm. Also fixes #51.

8fc692ba

2018-06-11T18:17:26

md_rollback: Do not touch TABLECELLBOUNDARIES chain. This chain is not normal opener/closer inline mark chain. Fixes #42.

e6e2ea4c

2018-06-11T11:43:47

md_analyze_line: Fix mixing list and table parsing. If table header underline is not nested the same way as the preceding line (i.e. the wannabe table header line), then it cannot form a table. Fixes #41.

4ef024fb

2018-05-29T23:30:02

md_process_inlines: Fix link/image closers spanning over multiple lines. Fixes #40.

7deaccf6

2018-05-29T18:32:10

md_is_link_label: Fix if the link label contains just backslash escapes. The function did not remember the label start line index, leading to bad consequences. Fixes #39.

bf022cb6

2018-05-28T21:14:45

Fix md_split_simple_pairing_mark(). When splitting a mark into two, make sure each of them gets the right share od dummies for case that we will have to split once more. Fixes #36.

e7b84d65

2018-05-28T21:09:32

pathological_tests.py: Fix test compatibility with Windows.

81e2a5ca

2018-04-12T17:03:37

pathological_tests.py: Test deeply nested lists.

0d1a41a4

2018-03-28T08:21:21

md_build_attr_append_substr: Fix +1 allocation error. Fixes #33.

19b24bdd

2017-08-16T18:15:19

Simplify the pathological test "many references".

07cec7dc

2017-08-16T16:34:50

Add regression test for #24.

ee3bee1a

2017-08-02T00:37:18

Upgrade to CommonMark specification 0.28.

c52a50a3

2017-07-24T21:07:09

pathological_tests.py: Add test for reference definition lookup.

938460d5

2017-07-24T21:17:53

Improve/unify output of test scripts.

c51fb310

2017-07-24T23:14:05

md_analyze_marks: Walk only required range of the marks. This changes causes that when recursing to analysis of link contents, only the marks between the link opener and closer are iterated in md_analyze_marks(). Fixes #22

a27aefde

2017-07-24T20:17:50

pathological_tests.py: Allow short option -p as a synonym of --program.

f4f7b223

2017-07-24T20:15:09

pathological_tests.py: Allow Windowish line ends.

26f14899

2017-07-24T19:27:27

Add pathological_tests.py from cmark.

ad4f28bb

2017-07-24T20:09:23

md_analyze_simple_pairing_mark: Fix the "rule of three". If the first emphasis opener is refused due the rule of three, a previous opener is examined. However the variable opener_orig_size_module3 was not (re)set accordingly. Fixes #21.

cfbce759

2017-07-18T18:01:02

Rework ref. def. dictionary. It now uses FNV1a and we now sort/bsearch only contents of single bucket. Additionally we fix #20 by disabling the invalid ref. definitions during hashtable build.

f2821cbd

2017-07-14T17:10:45

md_analyze_permissive_email_autolink: Make it compatible with CMark-gfm.

1bc7f3a8

2017-07-14T02:22:15

render_url_escaped: Fix escaping of ampersand. This affected generating href attribute if links or src attribute of images.

f3f9404e

2017-07-14T02:06:23

Improve URL autolinks extension. It is now much more compatible to Cmark-gfm. With the flag MD_FLAG_PERMISSIVEWWWAUTOLINKS, we now also support the WWW autolinks (when the http: scheme is omitted).

25a156ee

2017-07-12T23:30:14

Implement strikethrough extension.

8999e184

2017-01-04T14:40:44

Fix "rule of three" for emphasis resolution (issue #14).

c63909df

2017-01-04T15:04:09

When splitting emphasis opener mark, we have to retain 'dummy' marks available for more splitting in the future (issue #15).

52712384

2016-12-27T22:47:03

When parsing tables, pipes inside a link/image/code span cannot make cell boundary (issue #7).

f9b4cb8f

2016-12-15T16:47:41

md_process_inlines: Fix when an expanded mark shadows some nested marks (issue #11).

c235a02e

2016-12-15T13:18:48

test/coverage.txt: Add some tests for higher code coverage.

a725fee3

2016-12-14T16:51:24

md_enter_child_containers: Fix crash (issue #10). Calling md_push_container_bytes() may result in ending a current block which may result in removing some contents from ctx->block_bytes when removing some lines with link reference definitions. This in effect means we have to end the block explicitly before storing the offset into the ctx->block_bytes.

ba29d007

2016-12-12T23:31:59

md_is_link_reference_definition: Fix handling of multiline label (issue #9).

09ae8609

2016-12-07T23:56:47

Handle images more like links. Remove MD_SPAN_IMG_DETAIL::alt. Instead, the contents of the image is propagated to the renderer via MD_RENDERER::text() callback. * This fixes handling of entities inside the image text (issue #4). * It simplifies parsing and, more importantly, it better distingusshes what is responsibility of parser or renderer respectively. * This allows more flexibility on renderers side. Renderer who do not * really support images can just output the image content as any other text. The cost is a renderer into HTML (if it wants to render image contents into the attribute ALT of the IMG tag), has to handle images with more care. Typically such renderer has to track whether it is inside an image, and if so, then render span enter/leave as an empty string.

23312d6d

2016-12-05T11:13:43

md_is_html_tag: Fix parsing unquoted attribute value (issue #2).

b40d5950

2016-12-04T17:01:00

Fix file permissions of python scripts.

be7fcc16

2016-11-21T09:47:31

Implement tables. Note it is implemented as an extension. To enable it, the flag MD_FLAG_TABLES must be explicitly specified.

kc3-lang/md4c/test

test

Log