-
Couldn't load subscription status.
- Fork 7
Fallback for errors in related paper retrieval #1025
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…papers call to indra module.
|
Great Are these fallback results marked so that they can be updated properly by the cron job with real results? |
So the cron will call: factoid/src/server/routes/api/document/index.js Lines 2318 to 2322 in 3c250f6
so this should be updated accordingly - is this what you mean? |
|
I just want to make sure that the placeholder data doesn't sit around indefinitely. The cron job updates everything unconditionally, right? |
Yes, in theory the cron should replace data if those web service calls are successful. |
|
All right. Sounds good. Later on we may consider having the cron job leave the data as-is if it fails for a doc that has prior, valid data. It's not ideal if an existing set of data -- albeit possibly a bit old -- is overwritten by placeholder data because the update fails. Not that important now but may be nice to have eventually. The main thing overall is that the cron job / update process is frequent with a relatively low failure rate. That minimises the likelihood that any given doc has failures at any point in time. If the errors are random or sporadic, then we could also consider increasing the rate of the cron job only for failed docs (e.g. a second, high-frequency cron job just for the failures). That would also push the error probability down |
Actually this is a point I missed - let me push another update so that
|
|
Here are the cases for the related papers:
I guess this is ripe for some integration tests if this gets any more complicated. |
This update stems from observations that Explorer views are showing no related papers, making it appear 'broken'.
It appears that calls to INDRA or semantic search are failing, in a manner that is intermittent. The problem is that there are no fall backs, that is, either everything works or the whole thing fails.
Here, I simply use the document level papers retrieved from PubMed (which is typically robust) to back fill the related papers in the case that any problems occur.
Refs:
#988 (comment)
#937
PathwayCommons/semantic-search#98