Skip to content

bug: traces are not correctly parsed #9616

Open

Description

What

.. antioxydant: extrait riche en tocophérols. Peut contenir des traces d'autres céréales contenant du gluten (blé, seigle, orge), d'autres fruits à coque (noix de cajou, noix de pécan), de lait et de soja.

  1. put blé as a sub-ingredient of antioxydant
  2. does not recognized d'autres fruits à coque (autres fruit à coque is in the taxonomy and d' is in the stopwords)
  3. de lait et de soja is parsed as soj-milk

Steps to reproduce the behavior

https://world.openfoodfacts.org/product/4056489601913/granola-premier-super-nutty-new-crownfield

  1. Click on 'Details of the analysis of the ingredients

Expected behavior

  1. blé is not a subingredient of antioxydant
  2. d'autres fruits à coque should be recognized
  3. de lait et de soja should be parsed as milk and soy

Additional context

Notes for 2) I tried:
stopword, escape the single quote, d'autre X
add "d'autres fruits à coque" as synonym of "fr:fruits à coque" X
replace "d'autres fruits à coque" by "fruits à coque" v
replace "d'autres fruits à coque" by "autres fruits à coque" v
tried many variants "d'autres fruits à coque", "d-autres-fruits-a-coque" x

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Projects

    • Status

      To discuss and validate

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions