-
Notifications
You must be signed in to change notification settings - Fork 25
Description
Max has noted that ARAX answer quality has dramatically degraded on CI
https://arax.ncats.io/?systest=235
One hypothesis was that it was due to the RTX-KG2 PloverDB CI instance being down for a week over the break, but it seems to be back and the problem persists.
The error seems to be:
File "/mnt/data/orangeboard/production/RTX/code/ARAX/ARAXQuery/Infer/scripts/creativeCRG.py", line 110,
in load_ML_CRGmodel\n fitModel = joblib.load(file_path)\n ^^^^^^^^^^^^^^^^^^^^^^\n', '
File "/usr/local/lib/python3.12/dist-packages/joblib/numpy_pickle.py", line 658,
in load\n obj = _unpickle(fobj, filename, mmap_mode)\n ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^\n', '
File "/usr/local/lib/python3.12/dist-packages/joblib/numpy_pickle.py", line 577,
in _unpickle\n obj = unpickler.load()\n ^^^^^^^^^^^^^^^^\n', '
File "/usr/lib/python3.12/pickle.py", line 1203, in load\n raise EOFError\n', 'EOFError\n'
Seems to be an End of File error with creativeCRG?
When CI started on 2025-12-24T07:24:44.003522, it reported:
-rwxrwxrwx. 1 rt rt 1630233926 May 17 2025 chemical_gene_embeddings_v1.0.KG2.10.0_refreshedTo_KG2.10.2.npz
I assume that this is right file?
On arax.ncats.io:
-rw-r--r-- 1 ubuntu ubuntu 1630233926 May 17 2025 chemical_gene_embeddings_v1.0.KG2.10.0_refreshedTo_KG2.10.2.npz
i.e. the same size and date.
Here is an example of a query that failed:
https://arax.ci.transltr.io/?r=0750ab15-608e-4501-809b-f809cec45e5f
If I rerun the query today, I still also get an error.
Here is a query_graph:
{
"edges": {
"t_edge": {
"attribute_constraints": [],
"knowledge_type": "inferred",
"object": "ON",
"predicates": [
"biolink:affects"
],
"qualifier_constraints": [
{
"qualifier_set": [
{
"qualifier_type_id": "biolink:object_aspect_qualifier",
"qualifier_value": "activity_or_abundance"
},
{
"qualifier_type_id": "biolink:object_direction_qualifier",
"qualifier_value": "decreased"
}
]
}
],
"subject": "SN"
}
},
"nodes": {
"ON": {
"categories": [
"biolink:Gene"
],
"constraints": [],
"is_set": false
},
"SN": {
"categories": [
"biolink:ChemicalEntity"
],
"constraints": [],
"ids": [
"CHEBI:142437"
],
"is_set": false
}
}
}
Running this on arax.ncats.io works:
https://arax.ncats.io/?r=424701
But fails on https://arax.ci.transltr.io/
Why is that? I do not know. Any ideas?