Skip to content

Add HTML input support #312

Closed
Closed
@fsteeg

Description

@fsteeg

For HTML processing as in https://github.com/programmieraffe/oerhoernchen20 (with sources configured as in https://github.com/programmieraffe/oerhoernchen20/blob/master/scrapy/projects.json), we need an HtmlReader, to be used between HttpOpener and XmlDecoder (parsing HTML to XML, e.g. with https://jsoup.org/) in a workflow to create JSON index data for https://github.com/orgs/hbz/projects/4.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions