Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

edge file TSV missing publications ids (PMIDs) provenance #190

Open
TriageDr opened this issue May 28, 2020 · 3 comments
Open

edge file TSV missing publications ids (PMIDs) provenance #190

TriageDr opened this issue May 28, 2020 · 3 comments
Labels
enhancement New feature or request

Comments

@TriageDr
Copy link

I noted just from a quick peek at the publications column (number 21 I think), that the file returned no publications for any of the edges - is this correct? command line code I used was awk '{print $21}' merged-kg_edges.tsv

@justaddcoffee justaddcoffee added the bug Something isn't working label May 28, 2020
@justaddcoffee
Copy link
Collaborator

Thanks @TriageDr - we'll have a look

@justaddcoffee
Copy link
Collaborator

I can't reproduce this - I see entries in the publication column in the most recent build:

 wget kg-hub.berkeleybop.io/kg-covid-19.tar.gz
(venv) ~/ *master $ cut -f 21 merged-kg_edges.tsv | head -1
publication
(venv) ~/ *master $ cut -f 21 merged-kg_edges.tsv | sort | uniq | grep -c '' 
82
(venv) ~/ *master $ cut -f 21 merged-kg_edges.tsv | sort | uniq

PMID:10969042
PMID:11815627
PMID:14647384
PMID:15094372
PMID:15479853
PMID:15688292
PMID:15791205

There are only 82 unique values in this column, which I think is because only a few transform emit this info (IntAct)...

@justaddcoffee
Copy link
Collaborator

@TriageDr adding - if there are ingests for which publication info is particularly important, we can make a ticket to add this info ASAP

@justaddcoffee justaddcoffee added enhancement New feature or request and removed bug Something isn't working labels May 28, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants