Skip to content

[Bug]: result.link (links extraction results empty lists) not working when using raw_html_url = f"raw:{raw_html}" as input #668

Open
@mllife

Description

crawl4ai version

0.4.247

Expected Behavior

page should have some links to other pages, which it should return in result.link

Current Behavior

all_links = result.links.get("internal", []) + result.links.get("external", []) # always empty

when doing

raw_html_url = f"raw:{raw_html}"

async with C4AIAsyncWebCrawler(config=browser_config) as crawler:
result = await crawler.arun(raw_html_url, config=crawler_config)

Is this reproducible?

Yes

Inputs Causing the Bug

Steps to Reproduce

Code snippets

OS

macOS

Python version

3.11.9

Browser

No response

Browser version

No response

Error logs & Screenshots (if applicable)

No response

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions