Skip to content

DOCX reader discards figure caption (regression) #9610

Closed
@frederik

Description

Problem description

The latest pandoc version(s) 3.1.12.3 (possibly .2) seems to drop figure captions from docx during import.

pandoc captions.docx -o test.json

Latest known version where it works: 3.1.12.1

Reproduction

I am attaching a docx and two json outputs from: pandoc 3.1.12.1 and 3.1.12.3

here's the diff (only the caption is missing)

115,151d114
<         },
<         {
<             "t": "Para",
<             "c": [
<                 {
<                     "t": "Str",
<                     "c": "Figure"
<                 },
<                 {
<                     "t": "Space"
<                 },
<                 {
<                     "t": "Str",
<                     "c": "1"
<                 },
<                 {
<                     "t": "Space"
<                 },
<                 {
<                     "t": "Str",
<                     "c": "A"
<                 },
<                 {
<                     "t": "Space"
<                 },
<                 {
<                     "t": "Str",
<                     "c": "figure"
<                 },
<                 {
<                     "t": "Space"
<                 },
<                 {
<                     "t": "Str",
<                     "c": "caption"
<                 }
<             ]

caption.docx
test-latest.json
test-3-1-11.json

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions