Skip to content

xCRG failing on CI? #2605

@edeutsch

Description

@edeutsch

Max has noted that ARAX answer quality has dramatically degraded on CI
https://arax.ncats.io/?systest=235

One hypothesis was that it was due to the RTX-KG2 PloverDB CI instance being down for a week over the break, but it seems to be back and the problem persists.

The error seems to be:

 File "/mnt/data/orangeboard/production/RTX/code/ARAX/ARAXQuery/Infer/scripts/creativeCRG.py", line 110, 
in load_ML_CRGmodel\n fitModel = joblib.load(file_path)\n ^^^^^^^^^^^^^^^^^^^^^^\n', ' 
File "/usr/local/lib/python3.12/dist-packages/joblib/numpy_pickle.py", line 658, 
in load\n obj = _unpickle(fobj, filename, mmap_mode)\n ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^\n', ' 
File "/usr/local/lib/python3.12/dist-packages/joblib/numpy_pickle.py", line 577, 
in _unpickle\n obj = unpickler.load()\n ^^^^^^^^^^^^^^^^\n', ' 
File "/usr/lib/python3.12/pickle.py", line 1203, in load\n raise EOFError\n', 'EOFError\n'

Seems to be an End of File error with creativeCRG?

When CI started on 2025-12-24T07:24:44.003522, it reported:

-rwxrwxrwx. 1 rt rt  1630233926 May 17  2025 chemical_gene_embeddings_v1.0.KG2.10.0_refreshedTo_KG2.10.2.npz

I assume that this is right file?

On arax.ncats.io:

-rw-r--r-- 1 ubuntu ubuntu  1630233926 May 17  2025 chemical_gene_embeddings_v1.0.KG2.10.0_refreshedTo_KG2.10.2.npz

i.e. the same size and date.

Here is an example of a query that failed:
https://arax.ci.transltr.io/?r=0750ab15-608e-4501-809b-f809cec45e5f

If I rerun the query today, I still also get an error.

Here is a query_graph:

{
  "edges": {
    "t_edge": {
      "attribute_constraints": [],
      "knowledge_type": "inferred",
      "object": "ON",
      "predicates": [
        "biolink:affects"
      ],
      "qualifier_constraints": [
        {
          "qualifier_set": [
            {
              "qualifier_type_id": "biolink:object_aspect_qualifier",
              "qualifier_value": "activity_or_abundance"
            },
            {
              "qualifier_type_id": "biolink:object_direction_qualifier",
              "qualifier_value": "decreased"
            }
          ]
        }
      ],
      "subject": "SN"
    }
  },
  "nodes": {
    "ON": {
      "categories": [
        "biolink:Gene"
      ],
      "constraints": [],
      "is_set": false
    },
    "SN": {
      "categories": [
        "biolink:ChemicalEntity"
      ],
      "constraints": [],
      "ids": [
        "CHEBI:142437"
      ],
      "is_set": false
    }
  }
}

Running this on arax.ncats.io works:
https://arax.ncats.io/?r=424701

But fails on https://arax.ci.transltr.io/

Why is that? I do not know. Any ideas?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions