[feat] expose bfactors in protein_to_pyg function #388

kierandidi · 2024-04-19T17:35:55Z

Reference Issues/PRs

What does this implement/fix? Explain your changes

Bfactors are present in df from biopandas, but cannot be saved to pyg object. Now this option is enabled.

What testing did you do to verify the changes in this PR?

Pull Request Checklist

Added a note about the modification or contribution to the ./CHANGELOG.md file (if applicable)
Added appropriate unit test functions in the ./graphein/tests/* directories (if applicable)
Modify documentation in the corresponding Jupyter Notebook under ./notebooks/ (if applicable)
Ran python -m py.test tests/ and make sure that all unit tests pass (for small modifications, it might be sufficient to only run the specific test file, e.g., python -m py.test tests/protein/test_graphs.py)
Checked for style issues by running black . and isort .

a-r-j · 2024-04-20T13:18:57Z

graphein/protein/tensor/io.py

@@ -254,6 +261,10 @@ def protein_to_pyg(
    )
    if store_het:
        out.hetatms = [het_coords]
+
+    if store_bfactor:
+        out.bfactor = torch.tensor(df["b_factor"].values)


I think torch.from_numpy might be a little better

a-r-j · 2024-04-21T11:58:23Z

graphein/protein/tensor/io.py

@@ -254,6 +261,10 @@ def protein_to_pyg(
    )
    if store_het:
        out.hetatms = [het_coords]
+
+    if store_bfactor:
+        out.bfactor = torch.from_numpy(df["b_factor"].values)


This will be of shape num_atoms x 1 right? I expect this would break batching as all other tensors are of shape n_res x X

Good catch, solved this via a group by now that averages b factors on a per residue basis, consistent with the plddt information in the predicted datasets.

Ran more tests now and everything seems to work in a backward compatible way, lmk if there is anything else that needs to happen before merging @a-r-j

for more information, see https://pre-commit.ci

sonarcloud · 2024-04-21T19:25:28Z

Quality Gate failed

Failed conditions
C Maintainability Rating on New Code (required ≥ A)

See analysis details on SonarCloud

Catch issues before they fail your Quality Gate with our IDE extension SonarLint

[feat] expose bfactors in protein_to_pyg function

e75ca86

kierandidi requested a review from a-r-j April 19, 2024 17:35

kierandidi self-assigned this Apr 19, 2024

[doc] update CHANGELOG

8fd1eb2

kierandidi force-pushed the feat/expose_b_factor branch from 1d4b17a to 8fd1eb2 Compare April 19, 2024 17:37

a-r-j reviewed Apr 20, 2024

View reviewed changes

[doc] changed .tensor to .from_numpy for memory efficiency

b4bf164

kierandidi force-pushed the feat/expose_b_factor branch from 62c8719 to b4bf164 Compare April 20, 2024 14:26

a-r-j reviewed Apr 21, 2024

View reviewed changes

[fix] calculate bfactor per residue instead of per atom

8f0a730

kierandidi force-pushed the feat/expose_b_factor branch from 6bfb02e to 8f0a730 Compare April 21, 2024 19:24

[pre-commit.ci] auto fixes from pre-commit.com hooks

257d917

for more information, see https://pre-commit.ci

a-r-j merged commit e861231 into master Apr 23, 2024
28 of 32 checks passed

a-r-j deleted the feat/expose_b_factor branch July 15, 2024 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] expose bfactors in protein_to_pyg function #388

[feat] expose bfactors in protein_to_pyg function #388

kierandidi commented Apr 19, 2024 •

edited

Loading

a-r-j Apr 20, 2024

kierandidi Apr 20, 2024

a-r-j Apr 21, 2024 •

edited

Loading

kierandidi Apr 21, 2024

kierandidi Apr 22, 2024 •

edited

Loading

sonarcloud bot commented Apr 21, 2024

[feat] expose bfactors in protein_to_pyg function #388

[feat] expose bfactors in protein_to_pyg function #388

Conversation

kierandidi commented Apr 19, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes

What testing did you do to verify the changes in this PR?

Pull Request Checklist

a-r-j Apr 20, 2024

Choose a reason for hiding this comment

kierandidi Apr 20, 2024

Choose a reason for hiding this comment

a-r-j Apr 21, 2024 • edited Loading

Choose a reason for hiding this comment

kierandidi Apr 21, 2024

Choose a reason for hiding this comment

kierandidi Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

sonarcloud bot commented Apr 21, 2024

Quality Gate failed

kierandidi commented Apr 19, 2024 •

edited

Loading

a-r-j Apr 21, 2024 •

edited

Loading

kierandidi Apr 22, 2024 •

edited

Loading