Skip to content

When article metadata is missing #729

@jvwong

Description

@jvwong

Background

Issue: No article metadata is associated with a Document

Consequences

  • Fail to 'attribute' the authors and journal
  • Data integration suffers when pathway data and exports omit article references
  • Explore view paper metadata will be emptyish or worse, have,dead links (e.g. Google Scholar, PubMed)
    • Related papers system will be unable to sort articles based on participant queries (INDRA)
  • Non starter for 'assistants' based upon automated reading etc...

Reasons for missing metadata

  • A. Not in PubMed

    • Prepublication: The metadata exists, but is yet to be deposited by journal. Acceptance-to-publication lag ~4-6 weeks
    • Exclusion: PubMed does not index (e.g. preprints)
    • Nope: Spam
  • B. In PubMed

    • Technical glitch: EUTILS service was down or errored when called by Factoid server
    • Clash: There was > 1 result when PubMed EUTILS service was queried (article title)
    • Related: PubMed metadata grows over time as curation occurs (e.g. MeSH headings, reference lists)

Obtaining article metadata

  • Manual: Authors enter/edit
  • Automated: Software retrieves/replaces existing data

Specification

Details

  • Manual retrieval: This covers the (A) 'Prepublication' case

    • Idea is to format the available metadata information consistently (e.g. explore view, homepage gallery), and make it clear which fields are 'stubbed out'; then allow author to add or edit -- at very least journal name, maybe more
    • Where: Call to action; Editor; Submit
  • Automated retrieval: This covers the (A) 'Prepublication' and (B) 'Technical glitch', 'Related'

    • Idea is to use existing service to fetch/refresh the metadata from PubMed based on the information provided by authors
    • Possible approaches
      • Cron job: Run the update periodically, optionally filtering some Documents (i.e. articles published a certain time ago)
      • other: Lazy load information when the Document is accessed (e.g. explorer, data export)

Mockup

//TODO

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions