Skip to content

Trouble unzipping sitemap.xml (zlib: incorrect header check) #2461

@barjin

Description

@barjin

Which package is this bug report for? If unsure which one to select, leave blank

@crawlee/core

Issue description

gzip is not able to unzip some sitemaps properly (and the Sitemap.load() call ends with Malformed sitemap content error.

Code sample

import { Sitemap } from 'crawlee';

// loading the sitemap in browser works ok.
const sitemap = await Sitemap.load('https://www.paypal-community.com/sitemap.xml');

Package version

3.9.2

Node.js version

20

Operating system

Linux

Apify platform

  • Tick me if you encountered this issue on the Apify platform

I have tested this on the next release

No response

Other context

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working.t-toolingIssues with this label are in the ownership of the tooling team.

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions