Feed item content gets unescaped

I'm trying to parse a feed and render its contents on a website. The feed sometimes contains HTML code blocks (think tutorial posts explaining how to do something in HTML, [like this](https://css-tricks.com/creating-an-auto-closing-notification-with-an-html-popover/#aa-lets-start-with-the-popover)).

Take this example feed for instance:
```xml
<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<item>
			<content:encoded><![CDATA[
				<pre><code>&lt;div class="wrapper">Lorem ipsum dolor sit amet&lt;/div></code></pre>
			]]></content:encoded>
		</item>
	</channel>
</rss>
```

Intuitively, I expected that `parseFeed(xml).items[0].content` would return something like:
```
<pre><code>&lt;div class="wrapper">Lorem ipsum dolor sit amet&lt;/div></code></pre>
```

Instead, the text for content gets unescaped ([RSS](https://github.com/rowanmanning/feed-parser/blob/ccf98af3f19dd947d44cb90e89f2a03a70c530a8/lib/feed/item/rss.js#L98), [Atom](https://github.com/rowanmanning/feed-parser/blob/ccf98af3f19dd947d44cb90e89f2a03a70c530a8/lib/feed/item/atom.js#L116)), and this is returned instead:
```
<pre><code><div class="wrapper">Lorem ipsum dolor sit amet</div></code></pre>
```

While I do want the outer `<pre>` and `<code>` tags to be rendered as proper HTML tags on the final page, the inner `div` I want to keep verbatim, i.e. `&lt;div class="wrapper">`, so that it is rendered as text on the final website.


I made the changes to suit my needs in [this commit](https://github.com/rowanmanning/feed-parser/commit/7b92b21218e03b99d6c56b43d2dcaf36aaf52d9c), including some tests. I was unable to get most of the integration tests to actually pass, since the `feedparser` library ([used to process feeds in tests](https://github.com/rowanmanning/feed-parser/blob/ccf98af3f19dd947d44cb90e89f2a03a70c530a8/test/integration/sample-feeds.test.js#L95)) seems to unescape HTML in the same way, with no option to turn it off.

The way I did it would also be a breaking change; to avoid, assuming you even want to support this use case, perhaps we could add an `options` parameter to the `parseFeed` function to opt out of unescaping?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feed item content gets unescaped #209

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feed item content gets unescaped #209

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions