Skip to content

Issues: kermitt2/grobid

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

<ref type="figure" target="#fig_5"> missed bug From Hemiptera and especially its suborder Heteroptera
#1233 opened Jan 16, 2025 by Samuel-Scalbert
<p> duplicates bug From Hemiptera and especially its suborder Heteroptera
#1232 opened Jan 16, 2025 by Samuel-Scalbert
"Author contributions" section content is skipped by grobid bug From Hemiptera and especially its suborder Heteroptera
#1231 opened Jan 14, 2025 by i-amkashif
Misclassified tables and/or figures maybe tossed incorrectly bug From Hemiptera and especially its suborder Heteroptera implemented The issue has been implemented
#1206 opened Dec 3, 2024 by lfoppiano
Empty refs bug From Hemiptera and especially its suborder Heteroptera
#1175 opened Oct 2, 2024 by lfoppiano 0.8.2
Avoid replacing DOIs with shorter ones bug From Hemiptera and especially its suborder Heteroptera
#1127 opened Jun 10, 2024 by lfoppiano Draft 0.8.2
processHeaderDocument returns BibTeX by default instead of TEI bug From Hemiptera and especially its suborder Heteroptera need help Issues where the contributors are even more incompetent than usual
#1093 opened Apr 3, 2024 by michamos
general paragraph text wrongly recognized as "figDesc/div/p" bug From Hemiptera and especially its suborder Heteroptera error cases Some error/test case for future improvements models:fulltext
#1077 opened Jan 25, 2024 by sawyerzheng
PDF to XML conversion error bug From Hemiptera and especially its suborder Heteroptera pdfalto Issue related to pdfalto
#1033 opened Jun 23, 2023 by SebastianFeltl
Missed second names in bibtex and decapitalization of surnames bug From Hemiptera and especially its suborder Heteroptera
#1011 opened May 6, 2023 by oborin1
Possibility to list all ongoing trainings & kill bug From Hemiptera and especially its suborder Heteroptera enhancement
#985 opened Jan 16, 2023 by malee1382
Issue with producing <list> and <item> in grobid version 0.7.1 bug From Hemiptera and especially its suborder Heteroptera
#927 opened Jun 22, 2022 by Tanmay98
Error case with accents bug From Hemiptera and especially its suborder Heteroptera pdfalto Issue related to pdfalto
#906 opened Apr 7, 2022 by kermitt2
HTTP 500 for processCitationList on non-breaking whitespace string bug From Hemiptera and especially its suborder Heteroptera
#849 opened Nov 3, 2021 by bnewbold
Missing (very few tokens) in the generated segmentation training data bug From Hemiptera and especially its suborder Heteroptera
#812 opened Aug 11, 2021 by lfoppiano
Header model, relative font size includes spaces with a zero font size bug From Hemiptera and especially its suborder Heteroptera
#795 opened Jul 15, 2021 by de-code
Can't parse table coords when table is a image bug From Hemiptera and especially its suborder Heteroptera
#789 opened Jul 6, 2021 by elonzh
Wrong figure recognition bug From Hemiptera and especially its suborder Heteroptera
#787 opened Jun 30, 2021 by elonzh
PDF source file containing "pdf" before ".pdf" extension breaks naming of training files bug From Hemiptera and especially its suborder Heteroptera implemented The issue has been implemented
#776 opened Jun 21, 2021 by cboulanger
Issue with sentence segmentation offsets bug From Hemiptera and especially its suborder Heteroptera
#753 opened Apr 29, 2021 by kermitt2
Figures and tables in the back / annex section ignored bug From Hemiptera and especially its suborder Heteroptera enhancement
#737 opened Apr 14, 2021 by de-code
Danish letter "æ" is converted to "ae" bug From Hemiptera and especially its suborder Heteroptera
#728 opened Mar 8, 2021 by fnielsen
Full text model layout features: BLOCKSTART missing, if very first block token is a new line bug From Hemiptera and especially its suborder Heteroptera implemented The issue has been implemented
#712 opened Feb 12, 2021 by de-code
whitespace within a URL string in GROBID converted text bug From Hemiptera and especially its suborder Heteroptera
#679 opened Nov 28, 2020 by caifand
Potential NullPointerException in FullTextParser if segmentation is not resulting in body bug From Hemiptera and especially its suborder Heteroptera
#676 opened Nov 23, 2020 by de-code
ProTip! Exclude everything labeled bug with -label:bug.