bug in WXR_Parser_Regex when parsing authors

The regex parser assumes that [author info is all contained on a single line](https://github.com/WordPress/wordpress-importer/blob/master/src/parsers/class-wxr-parser-regex.php#L64), when in practice the WP exporter outputs authors across multiple lines in the WXR.

For example, the exporter outputs

```
 <wp:author>
   <wp:author_id>7</wp:author_id>
   <wp:author_login>username</wp:author_login>
   <wp:author_email>user@example.com</wp:author_email>
   <wp:author_display_name><![CDATA[First Last]]></wp:author_display_name>
   <wp:author_first_name><![CDATA[First]]></wp:author_first_name>
   <wp:author_last_name><![CDATA[Last]]></wp:author_last_name>
 </wp:author>
```

whereas, the regex parser is expecting

```
<wp:author><wp:author_id>7</wp:author_id><wp:author_login>username</wp:author_login><wp:author_email>user@example.com</wp:author_email><wp:author_display_name><![CDATA[First Last]]></wp:author_display_name><wp:author_first_name><![CDATA[First]]></wp:author_first_name><wp:author_last_name><![CDATA[Last]]></wp:author_last_name></wp:author>
```

I've got a tentative fix, but need to test it some more before submitting a PR (which probably won't be until the weekend)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug in WXR_Parser_Regex when parsing authors #144

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug in WXR_Parser_Regex when parsing authors #144

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions