|
d3c1c0bb
|
2022-01-14T17:27:05
|
|
fuzz-mdhtml.c: Cleanup of the code.
|
|
b42e7f5c
|
2022-01-10T11:41:25
|
|
md_resolve_links: Avoid link ref. def. lookup if...
if we know that the bracket pair contains nested brackets. That makes
the label invalid anyway, therefore we know that there is no link ref.
def. to be found anyway.
In case of heavily nested bracket pairs, the lookup could lead to
quadratic parsing times.
Fixes #172.
|
|
7f44e1ad
|
2022-01-10T10:39:29
|
|
pathological_tests.py: Improve code alignment.
|
|
a8bb4d30
|
2022-01-06T16:01:55
|
|
md_is_table_underline: Remove requirement for minimal length of a cell underline.
Fixes #169.
|
|
c01aa6b3
|
2021-06-27T18:28:26
|
|
Update CommonMark spec file to v. 0.30
|
|
bcb55d0d
|
2021-04-14T09:18:09
|
|
md_resolve_links: Suppress bogus nested permissive autolink.
Fixes #152.
|
|
3478ec69
|
2021-02-23T14:01:31
|
|
Added fuzzer for oss-fuzz integration. (#151)
|
|
fd7b5fe0
|
2021-02-05T21:40:47
|
|
md_analyze_line: Fix implicit ending of HTML blocks...
... when the HTML block is not explicitly ended (before the enclosing
container block ends).
Fixes #149.
|
|
da5821ae
|
2020-12-14T19:53:40
|
|
Fix testcase for issue #142.
|
|
5a44e327
|
2020-12-14T18:59:56
|
|
md_link_label_cmp: Fix the loop end condition.
The old version likely could stop prematurely in a corner case when
there was a Unicode character at the end of the either string, which
maps into multiple fold info codepoints.
Fixes #142.
|
|
3254b7cb
|
2020-11-13T12:02:39
|
|
md_process_table_block_contents: Suppress empty TBODY block generation.
When the table has no body rows, do not call the callback with
MD_BLOCK_TBODY events.
Fixes #138.
|
|
4585088a
|
2020-11-13T10:16:34
|
|
md_analyze_permissive_url_autolink: Better GFM compatibility.
The autolinks now allow unmatched parenthesis, only the trailing
parenthesis closers are handled specially to deal with the situation the
autolink is all inside an outer parenthesis.
Somehow our tests were broken and avoided the cases with unmatched
parenthesis pairs inside the auto-link. That's now fixed and in sync
with GFM specs too.
Fixes #135.
|
|
002f76c9
|
2020-10-18T09:37:45
|
|
md_resolve_links: Skip [...] used as a reference link/image label.
Fixes #131.
|
|
c501c891
|
2020-07-30T10:13:05
|
|
Fix spelling of "than" in many occurances.
I often spell it errorneously as "then". Doing this mistake way too
often when typing fast.
|
|
c595c2ed
|
2020-07-30T08:38:19
|
|
md_process_verbatim_block_contents: Fix off by 1 error.
This caused outputting wrong indentation inside a fenced code blocks for
lines indented with mor ethan 16 spaces.
Fixes #124.
|
|
0c4d7f3d
|
2020-07-28T07:18:23
|
|
test/normalize.py: Use html.escape instead of cgi.escape.
Fixes #123.
|
|
d0e3ed79
|
2020-03-12T22:45:32
|
|
md2html: Skip UTF-8 BOM, if present in the input.
|
|
9e6ab76c
|
2020-02-17T12:41:50
|
|
Minor fuzz-input cleanup.
Move some permissive links incorrectly placed in commonmark.md into
gfm.md.
|
|
cc9a9d28
|
2020-02-16T15:29:54
|
|
test/fuzz-test: Add some fuzzing testing initial input.
|
|
5d7c3597
|
2020-02-16T13:46:16
|
|
md_analyze_emph: Detect correctly opener chain when resolving the range.
Fixes #107.
|
|
b4c30cd6
|
2020-02-13T02:23:03
|
|
Improve wiki-link parsing.
* Get rid of MD_LINE::total_indent.
* Remove some special complicated branching for nested images: Instead
we use md_rollback() the wiki-link destination span to kill _any_
marks resolved so far, including the images.
* Remove any length limit from label. Only destination length is
limited, regardless of whether '|' is present or not.
* Move the special handling of `[[foo|]]` from md_process_inlines()
into md_resolve_links(). We simply expand the closer mark to consume
the `|`.
* Do not modify the opener and closer marks until we really know it
is indeed a wiki-link.
|
|
403043bb
|
2020-01-16T16:15:08
|
|
md_mark_chain_append: Set next of the tail mark to -1.
Fixes #104.
|
|
e6661f23
|
2020-01-10T19:27:10
|
|
Implement an underline extension. (#103)
Closes #101.
|
|
82d7d087
|
2020-01-10T15:48:00
|
|
Rework/improve recognition of strike-through spans.
Closes #102.
|
|
561f52e0
|
2020-01-05T18:33:46
|
|
md_is_autolink_email: Fix an off-by-one error.
Fixes #100.
|
|
46f25f0b
|
2019-11-12T21:48:26
|
|
md_analyze_emph: Call md_resolve_range() with proper chain.
Errorneously, we have called md_resolve_range() with mark chain derived
from the closer mark. In the case that the opener and closer marks
differ in length (and we have split one or the other), we pass in an
incorrect chain, which may lead to strange behavior in subsequent
analysis.
Fixes #98.
|
|
e336e640
|
2019-11-04T15:20:59
|
|
Add support for Wiki links (#92)
With a new flag MD_FLAG_WIKILINKS, recoginize wiki-style links
as [[foo]] and [[foo|bar]].
Update also the HTML renderer accordingly, to output a custom
HTML tag <x-wikilink> when seeing it.
|
|
ef85cfc2
|
2019-11-04T15:05:07
|
|
Simplify parsing of tables (#97)
We do so by removing the function md_is_table_row().
md_is_table_row() did some crazy inline parsing to detect whether the
line contains at least one pipe which is not inside a code span or other
high-priority inline element.
This was very complicated under the hood and to was actually breaking
the clean design which separates block analysis parse and inline analysis
of each block contents.
We now just use the table underline for determining the block is table
and its properties like e.g. the column count.
This means a paragraph now cannot interrupt a table. This is a change in
a behavior but likely acceptable one as it actually brings the behavior
closer to behavior of tables in cmark-gfm in this regard.
Last but not least, it seems to prevent adoption of other useful
features, for about that, see the discussion in PR #92.
|
|
993c7b9b
|
2019-11-03T23:32:46
|
|
Render LaTeX math into HTML as a tag <x-equation>...
... instead of <equation>. This is to highlight that it is not a
standard HTML tag.
|
|
e97d0250
|
2019-11-03T13:44:29
|
|
Link label comparision fixes.
* md_link_label_cmp: To match the labels, the loop has to reach ends of
the labels for both of them.
* md_link_label_cmp_load_fold_info: Collapse consequtive whitespace
into a single ' ' for the label comparison purposes.
Fixes #96.
|
|
0354e1ab
|
2019-10-04T22:34:08
|
|
md_is_container_mark: Ordered list mark requires at least one digit.
Fixes #95.
|
|
97606369
|
2019-07-07T11:19:21
|
|
Fix the last test case in latex-math.txt.
|
|
099ce69b
|
2019-07-07T11:15:44
|
|
Add missing file into git.
|
|
2e965941
|
2019-07-07T10:59:20
|
|
Add/improve docs for the LaTeX math spans.
|
|
8bac86aa
|
2019-07-07T09:46:10
|
|
Added support for LaTeX math (#87)
Addresses #86.
|
|
ce8b5d94
|
2019-05-27T22:16:35
|
|
md_analyze_line: Blockquote with blank line can interrupt a paragraph.
Fixes #83.
|
|
51386164
|
2019-05-19T11:46:26
|
|
md_link_label_cmp: Fix handling non-trivial folding info.
Fixes #78.
|
|
4f6a9e54
|
2019-05-19T10:46:26
|
|
Update Unicode support to 12.1.
* scipts/build_*_map.py: Implement helper pythonic scripts used to
generate some Unicode search maps and data for helper Unicode
functions used in MD4C.
This should simplify updating to future Unicode versions.
* md_get_unicode_fold_info: Use data generated by the scripts.
* md_is_unicode_whitespace__: Ditto.
* md_is_unicode_punct__: Ditto.
|
|
aca5c27f
|
2019-05-16T22:48:08
|
|
test/spec.txt: Update from upstream head.
|
|
64a1bc37
|
2019-05-15T23:25:05
|
|
test/coverage.txt: Sort the regression test cases by the issue number.
|
|
919a0cc9
|
2019-05-08T07:38:33
|
|
test/*.txt: Fix some formatting.
|
|
1757ff55
|
2019-05-07T23:10:46
|
|
test/spec_tests.py: Make ready for spec.txt from cmark-gfm project.
This allows easier checking of our GFM dialect compatibility.
|
|
83047d3e
|
2019-05-07T22:24:29
|
|
md_analyze_permissive_url_autolink: Improve.
* Fix domain recognition so that it has to have at least two
dot-delimited components.
* Fix handling if parenthesis so that they have to form balanced
pairs; i.e. the first ')' not having a preceding opener ends the
path.
Fixes #76.
|
|
609dfb0b
|
2019-05-05T15:56:51
|
|
md_analyze_line: Treat blank lines inside a HTML block more carefully...
... with respect to the parent list containers.
Fixes #10 (but now really).
|
|
95279131
|
2019-04-30T00:32:36
|
|
When undoing complete block from ctx->block_bytesp[], reset ctx->current_block properly.
Fixes #74.
|
|
d4d10915
|
2019-04-29T19:03:16
|
|
Improve parsing of inline raw HTML.
* Isolate some common code for scanning HTML closer into a new function
so most HTML scanner functions reuse the same code.
* Improve the scanning for the closer so that on failure we remember
the range where no closer is present. So any later scanning attempts
may fail early.
Fixes #73.
|
|
d7920b9c
|
2019-04-08T19:35:06
|
|
Merge pull request #67 from mity/spec-0.29
This merges all changes for CommonMark specification 0.28 -> 0.29 transition.
|
|
5b78f295
|
2019-04-08T11:00:27
|
|
test/spec.txt: Update from upstream head.
|
|
2a7b97ed
|
2019-04-05T08:18:54
|
|
test/spec.txt: Update from upstream head.
|
|
b8586987
|
2019-04-03T08:28:27
|
|
md_collect_mark: Add missing 'continue' to '~' branch.
Fixes #69.
|
|
855a1bfc
|
2019-03-27T02:04:24
|
|
test/spec.txt: Update from upstream head.
|
|
94c86fe2
|
2019-03-26T14:45:23
|
|
Revert "Fix problematic link destinations with angle brackets."
The updated specification now explicitly requests the behavior we
implemented before fixing #24.
This reverts commit 2e0a74ba990e291ef4eace047d50af05ca81daef.
Also remove associated regression test as it is no longer valid.
|
|
0959975a
|
2019-03-26T14:01:02
|
|
md_analyze_emph: Follow specs changes to the "rule of three".
|
|
98968e22
|
2019-03-26T13:33:05
|
|
Update spec.txt from upstream head.
(I previously used an updated revision of it by mistake.)
|
|
1edd0c9c
|
2019-03-26T11:49:25
|
|
test/spec.txt: Update to current upstream HEAD.
|
|
2dd96ab4
|
2019-03-12T09:56:11
|
|
Fix O(n^2) in handling the "rule of three".
We had to break the list of potential '*' openers into multiple ones so
we do not have to walk it when looking for matching length due to the
"rule of three" for intraword delimiter runs.
Fixes #63.
|
|
b2108652
|
2019-03-11T21:13:15
|
|
md_analyze_line: Fix O(n^2) in thematic break handling.
Fixes #66.
|
|
37104fc2
|
2019-03-11T20:26:58
|
|
md_is_code_span: Fix crash at EOF.
Fixes #65.
|
|
966b8e39
|
2019-03-11T19:56:46
|
|
md_is_link_title: Stop on ')' lin ()-style title.
Fixes #60.
|
|
fc27108e
|
2019-03-11T19:55:08
|
|
test/pathological_tests.py: Output test durations.
|
|
53f65852
|
2019-03-11T19:03:34
|
|
test/spec.txt: Little update.
Somehow we were having little different spec.txt version that the one
from CommonMark repo tag 0.28. But we still pass all its compliance
test suite.
|
|
685b7144
|
2019-03-10T11:20:39
|
|
Move codespan detection from md_analyze_backtick() into...
md_is_code_span(), called from md_collect_marks().
We have to do this at the same time as detecting raw inline HTML to
follow CommonMark priority requirements.
Also it is done very differently now:
When scanning for the closer mark, we remember (the latest) position of
potential closers for all other lengths as well.
This means that:
(1) If we find it, we reduced the task because all subsequent scan shall
begin after the closer.
(2) If we do not find it, then we have to reach the end of the block and
hence we then know (for every allowed marker length) the position of last
such backtick sequence.
(3) That makes the guaranty that any subsequent call with either succeed
in its scan (and reduce the task even further); or that we shall be able
to detect instantly there is no suitable closer.
I.e. every call either reduces the task by O(n) scan (1); or collects
all the data in O(n) because (2) happens at most once; or fails in O(1)
(3).
This makes O(n) guaranty of the function complexity.
Fixes #59.
|
|
0cb61205
|
2019-03-10T10:50:23
|
|
Move raw inline HTML detection from md_analyze_lt_qt() into md_collect_marks().
Fixes #58:
For resolving raw inline HTML the function tried closer with all
potential openers, because raw HTML can have '<' inside of an attribute.
However this caused O(n^2) for input like "<><><><><><><>...".
We solved by handling raw HTML in earlier stage, directly in
md_collect_marks(), where we can scan linerary forward.
Fixes #61:
As a side effect, this also fixes the issue that MD_FLAG_NOHTMLSPANS
disabled also recognition of CommonMark autolinks.
|
|
8e01a769
|
2019-02-10T22:58:42
|
|
Implement task lists. (#50)
Fixes #30.
|
|
d32aa2e0
|
2019-02-09T10:40:52
|
|
Fix conflict in parsing permissive autolinks and ordinary links.
The issues is caused by the fact that we do not know exact position
of permissive auto-link in time of md_collect_marks() because there
is no syntax to mark its end on the 1st place.
This causes that eventually, the closer mark in ctx->marks[] can be
out-of-order somewhat.
As a consequence, if some other mark range (e.g. ordinary link)
shadows the auto-link, the closer mark may be left outside the shadowed
range and survive till the phase when we generate the output.
We fix by using an extra mark flag to remember we did really output
the opener mark, and output the closer only in such case.
Fixes #53.
|
|
67401e70
|
2019-02-06T04:31:25
|
|
md_analyze_inlines: Resolve table cell boundaries before links.
This brings some corner cases closer to cmark-gfm.
Also fixes #51.
|
|
8fc692ba
|
2018-06-11T18:17:26
|
|
md_rollback: Do not touch TABLECELLBOUNDARIES chain.
This chain is not normal opener/closer inline mark chain.
Fixes #42.
|
|
e6e2ea4c
|
2018-06-11T11:43:47
|
|
md_analyze_line: Fix mixing list and table parsing.
If table header underline is not nested the same way as the preceding
line (i.e. the wannabe table header line), then it cannot form a table.
Fixes #41.
|
|
4ef024fb
|
2018-05-29T23:30:02
|
|
md_process_inlines: Fix link/image closers spanning over multiple lines.
Fixes #40.
|
|
7deaccf6
|
2018-05-29T18:32:10
|
|
md_is_link_label: Fix if the link label contains just backslash escapes.
The function did not remember the label start line index, leading to bad
consequences.
Fixes #39.
|
|
bf022cb6
|
2018-05-28T21:14:45
|
|
Fix md_split_simple_pairing_mark().
When splitting a mark into two, make sure each of them gets the right
share od dummies for case that we will have to split once more.
Fixes #36.
|
|
e7b84d65
|
2018-05-28T21:09:32
|
|
pathological_tests.py: Fix test compatibility with Windows.
|
|
81e2a5ca
|
2018-04-12T17:03:37
|
|
pathological_tests.py: Test deeply nested lists.
|
|
0d1a41a4
|
2018-03-28T08:21:21
|
|
md_build_attr_append_substr: Fix +1 allocation error.
Fixes #33.
|
|
19b24bdd
|
2017-08-16T18:15:19
|
|
Simplify the pathological test "many references".
|
|
07cec7dc
|
2017-08-16T16:34:50
|
|
Add regression test for #24.
|
|
ee3bee1a
|
2017-08-02T00:37:18
|
|
Upgrade to CommonMark specification 0.28.
|
|
c52a50a3
|
2017-07-24T21:07:09
|
|
pathological_tests.py: Add test for reference definition lookup.
|
|
938460d5
|
2017-07-24T21:17:53
|
|
Improve/unify output of test scripts.
|
|
c51fb310
|
2017-07-24T23:14:05
|
|
md_analyze_marks: Walk only required range of the marks.
This changes causes that when recursing to analysis of link contents,
only the marks between the link opener and closer are iterated in
md_analyze_marks().
Fixes #22
|
|
a27aefde
|
2017-07-24T20:17:50
|
|
pathological_tests.py: Allow short option -p as a synonym of --program.
|
|
f4f7b223
|
2017-07-24T20:15:09
|
|
pathological_tests.py: Allow Windowish line ends.
|
|
26f14899
|
2017-07-24T19:27:27
|
|
Add pathological_tests.py from cmark.
|
|
ad4f28bb
|
2017-07-24T20:09:23
|
|
md_analyze_simple_pairing_mark: Fix the "rule of three".
If the first emphasis opener is refused due the rule of three, a previous
opener is examined. However the variable opener_orig_size_module3 was not
(re)set accordingly.
Fixes #21.
|
|
cfbce759
|
2017-07-18T18:01:02
|
|
Rework ref. def. dictionary.
It now uses FNV1a and we now sort/bsearch only contents of single bucket.
Additionally we fix #20 by disabling the invalid ref. definitions during
hashtable build.
|
|
f2821cbd
|
2017-07-14T17:10:45
|
|
md_analyze_permissive_email_autolink: Make it compatible with CMark-gfm.
|
|
1bc7f3a8
|
2017-07-14T02:22:15
|
|
render_url_escaped: Fix escaping of ampersand.
This affected generating href attribute if links or src attribute of
images.
|
|
f3f9404e
|
2017-07-14T02:06:23
|
|
Improve URL autolinks extension.
It is now much more compatible to Cmark-gfm.
With the flag MD_FLAG_PERMISSIVEWWWAUTOLINKS, we now also support the
WWW autolinks (when the http: scheme is omitted).
|
|
25a156ee
|
2017-07-12T23:30:14
|
|
Implement strikethrough extension.
|
|
8999e184
|
2017-01-04T14:40:44
|
|
Fix "rule of three" for emphasis resolution (issue #14).
|
|
c63909df
|
2017-01-04T15:04:09
|
|
When splitting emphasis opener mark, we have to retain 'dummy' marks available for more splitting in the future (issue #15).
|
|
52712384
|
2016-12-27T22:47:03
|
|
When parsing tables, pipes inside a link/image/code span cannot make cell boundary (issue #7).
|
|
f9b4cb8f
|
2016-12-15T16:47:41
|
|
md_process_inlines: Fix when an expanded mark shadows some nested marks (issue #11).
|
|
c235a02e
|
2016-12-15T13:18:48
|
|
test/coverage.txt: Add some tests for higher code coverage.
|
|
a725fee3
|
2016-12-14T16:51:24
|
|
md_enter_child_containers: Fix crash (issue #10).
Calling md_push_container_bytes() may result in ending a current block
which may result in removing some contents from ctx->block_bytes when
removing some lines with link reference definitions.
This in effect means we have to end the block explicitly before storing
the offset into the ctx->block_bytes.
|
|
ba29d007
|
2016-12-12T23:31:59
|
|
md_is_link_reference_definition: Fix handling of multiline label (issue #9).
|
|
09ae8609
|
2016-12-07T23:56:47
|
|
Handle images more like links.
Remove MD_SPAN_IMG_DETAIL::alt. Instead, the contents of the image is
propagated to the renderer via MD_RENDERER::text() callback.
* This fixes handling of entities inside the image text (issue #4).
* It simplifies parsing and, more importantly, it better distingusshes
what is responsibility of parser or renderer respectively.
* This allows more flexibility on renderers side. Renderer who do not
* really support images can just output the image content as any
other text.
The cost is a renderer into HTML (if it wants to render image contents
into the attribute ALT of the IMG tag), has to handle images with more
care. Typically such renderer has to track whether it is inside an image,
and if so, then render span enter/leave as an empty string.
|
|
23312d6d
|
2016-12-05T11:13:43
|
|
md_is_html_tag: Fix parsing unquoted attribute value (issue #2).
|
|
b40d5950
|
2016-12-04T17:01:00
|
|
Fix file permissions of python scripts.
|
|
be7fcc16
|
2016-11-21T09:47:31
|
|
Implement tables.
Note it is implemented as an extension. To enable it, the flag MD_FLAG_TABLES
must be explicitly specified.
|