Skip to content

Fixes for graph pruning #359

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 9 commits into
base: main
Choose a base branch
from
Open

Conversation

stellasia
Copy link
Contributor

@stellasia stellasia commented Jun 12, 2025

Description

  • New paramters (additional_node_types...) were not propagated properly in the SimpleKGBuilderConfig object
  • Lexical graph was pruned :''''(
  • Some components were missing the lexical graph input
  • Make deprecated parameters even more explicit in the documention

Type of Change

  • New feature
  • Bug fix
  • Breaking change
  • Documentation update
  • Project configuration change

Complexity

Note

Please provide an estimated complexity of this PR of either Low, Medium or High

Complexity:

How Has This Been Tested?

  • Unit tests
  • E2E tests
  • Manual tests

Checklist

The following requirements should have been met (depending on the changes in the branch):

  • Documentation has been updated
  • Unit tests have been updated
  • E2E tests have been updated
  • Examples have been updated
  • New files have copyright header
  • CLA (https://neo4j.com/developer/cla/) has been signed
  • CHANGELOG.md updated if appropriate

@stellasia stellasia marked this pull request as ready for review June 13, 2025 13:54
@stellasia stellasia requested a review from a team as a code owner June 13, 2025 13:54
@NathalieCharbel
Copy link
Contributor

NathalieCharbel commented Jun 18, 2025

should we also update this diagram and its description in the docs so we make it clear that the pruner now is a component and executed after the extractor?

@@ -24,6 +24,7 @@ A Knowledge Graph (KG) construction pipeline requires a few components (some of
- **Schema builder**: provide a schema to ground the LLM extracted node and relationship types and obtain an easily navigable KG. Schema can be provided manually or extracted automatically using LLMs.
- **Lexical graph builder**: build the lexical graph (Document, Chunk and their relationships) (optional).
- **Entity and relation extractor**: extract relevant entities and relations from the text.
- **Graph pruner**: clean the graph based on schema.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shall we also add that it's optional?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is optional? The component is not optional, the cleaning is depending on what is defined in the schema, which is what the sentence says, no?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants