Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Total number of works is not equivalent to count of papers. #115

Open
jpwahle opened this issue Dec 19, 2022 · 0 comments
Open

Total number of works is not equivalent to count of papers. #115

jpwahle opened this issue Dec 19, 2022 · 0 comments

Comments

@jpwahle
Copy link
Owner

jpwahle commented Dec 19, 2022

Describe the bug
Since SemanticScholar provides a count for the total number of papers an author has published it can be higher than the number of works in D3 dataset. See here for more details.

To Reproduce
Get the authors file and count the number of works of author with id 2686522

Expected behavior
Get 5 works in DBLP. However, it will return 26 works because the author has more works outside of DBLP.

Screenshots
image
image

How to fix
Either count only in DBLP or provide a separate field that provides insights on the difference (e.g., dblp_works: 5 and outside_dblp_words: 21)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant