mediawiki reader: improve strong/emph conformance by silby · Pull Request #10766 · jgm/pandoc

silby · 2025-04-07T23:59:50Z

I made some progress with this today without completely blowing up the existing strong and emph parsers but weird edge cases remain. E.g. consider ''foo''''bar''. Pandoc today will give you Emph [ Str "foo" , Str "bar" ], which has an obvious appeal. My work in progress gives Emph [ Str "foo''" ] , Str "bar''", which is odder but defensible given other requirements for emphasized quote marks. The actual correct answer, according to MediaWiki, is Emph [ Str "foo'" , Strong [ Str "bar" ] ], i.e. foo'bar, which is basically a koan.

Parsoid has a lot of code just for processing quotes, presumably aiming to maintain bug-for-bug compatibility with whatever MediaWiki's first parser did. So what a string of single-quotes means varies depending on what comes after it in the line, in a more context-sensitive way than I expected.

Would it be better to merge code that makes us more conformant with MediaWiki for some cases and "wrong in a different way" for others, or to try to reach perfection?

jgm · 2025-04-08T14:44:41Z

In general I'm not too concerned with divergences in edge cases. Nobody is ever going to write ''foo''''bar'' and intend to get emph "foo'" + strong "bar". Your original case ''foo''', by contrast, seems like something that would come up naturally.

jgm · 2025-04-08T14:47:55Z

Is Parsoid the parser mediawiki uses? Or is that something else?

tarleb · 2025-07-29T19:19:56Z

This looks ready but is still marked as draft. @silby, can we merge this?

mediawiki reader: improve strong/emph conformance

330f159

jgm force-pushed the main branch from 60c147d to bfcff3e Compare May 12, 2025 00:38

jgm force-pushed the main branch from 4bb4f7f to 74351e4 Compare December 24, 2025 22:44

jgm force-pushed the main branch from ac068c2 to 54453a3 Compare March 22, 2026 09:17

jshtab mentioned this pull request May 5, 2026

MediaWiki bold-italics are not parsed properly by Pandoc cpe-wg/wiki.vg-spec#3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mediawiki reader: improve strong/emph conformance#10766

mediawiki reader: improve strong/emph conformance#10766
silby wants to merge 1 commit into
jgm:mainfrom
silby:push-vpmxzmvlqpnx

silby commented Apr 7, 2025

Uh oh!

jgm commented Apr 8, 2025

Uh oh!

jgm commented Apr 8, 2025

Uh oh!

tarleb commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

silby commented Apr 7, 2025

Uh oh!

jgm commented Apr 8, 2025

Uh oh!

jgm commented Apr 8, 2025

Uh oh!

tarleb commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants