Review/region correlation by MeyerBender · Pull Request #100 · LazDaria/SegTraQ

MeyerBender · 2026-02-03T09:40:28Z

This is a refactor of the region similarity (formerly region correlation) module. It renames functions and outputs to be more consistent with the rest of the package. In addition, the logic about assigning nuclei to cells changed slightly (a nucleus can now only be assigned to one cell, not to two). Also reworked the run_region_similarity() function that runs all methods in the module.

…st name to be in line with the new method name

…ore cells

…ta into conftest

…ms off

LazDaria · 2026-02-03T14:43:22Z

src/segtraq/rs/region_similarity.py

-    and computes a correlation (e.g. Pearson) between the gene expression profiles
-    of the cell and that nucleus.
+    and computes the similarity (cosine similarity, Pearson correlation, Spearman correlation)
+    between the gene expression profiles of the whole cell (including the nucleus) and that nucleus.


"Including the nucleus" is a bit misleading. If the nucleus is outside the cell, the part that is outside the cell is not considered.

LazDaria · 2026-02-03T14:45:07Z

src/segtraq/rs/region_similarity.py

-    Returns DataFrame with columns ["cell_id", "best_nuc_id", "IoU", "correlation_parts"].
+    For each cell in the SpatialData table, identifies the nucleus with highest intersection over union (IoU)
+    and computes the similarity (cosine similarity, Pearson correlation, Spearman correlation)
+    between the gene expression profiles of the cytoplasm (cell - nucleus) and that nucleus.


suggestion:
"between the gene expression profiles of the cytoplasm (cell - nucleus) and the cell region overlapping the nucleus.

LazDaria · 2026-02-03T14:47:32Z

src/segtraq/rs/region_similarity.py

-        Neighborhood radius factor in the same coordinate units as the shapes.
+    neighborhood_radius_factor : float, default=2.0
+        For each cell, the neighborhood consists of the cells whose centroids
+        lie within the radius of the cell times this factor.


I wasn't aware that the neighbor centroids have to lie within that distance. Is that really the case?

As far as I can tell, this is what you do in rs/utils.py/_compute_ncvs_within_radius().
Code snippet:

# Query neighbors within radius (including itself) idxs = tree.query_ball_point(coords[i], r=radii[i] * neighborhood_radius_factor)

LazDaria

Looks good to me! I would suggest that you re-run the notebook, after merging into main due to the changes in join_points_regions, which might affect the results.

…tion

mjemons

lgtm!

mjemons · 2026-02-04T07:33:07Z

src/segtraq/rs/region_similarity.py

+        total_counts = (counts_intersection_raw + counts_remainder_raw).sum(axis=1).replace(0, np.nan)
+        counts_intersection_norm = counts_intersection_raw.div(total_counts, axis=0) * scale
+        counts_remainder_norm = counts_remainder_raw.div(total_counts, axis=0) * scale
+        counts_intersection_norm = np.log1p(counts_intersection_norm).fillna(0.0)


we should come back to this after meeting with WH discussing the normalisation step

MeyerBender added 9 commits January 29, 2026 14:06

moved assert from SegTraQ.py into module, removed tqdm and updated te…

956e59e

…st name to be in line with the new method name

added functionality that ensures a nucleus isn't assigned to two or m…

1990103

…ore cells

made variable and method names more verbose, moved validate_spatialda…

43a6916

…ta into conftest

refactored similarity between nucleus and cytoplasm

6f29a65

started refactor of border neighborhood similarity, but something see…

a84148d

…ms off

renamed region correlation module to region similarity

9a1cd6c

started updating docs

509fdda

started implementing runner

7c1de80

renamed IoU to iou and updated notebook

4f58606

MeyerBender requested review from LazDaria and mjemons February 3, 2026 09:45

LazDaria reviewed Feb 3, 2026

View reviewed changes

LazDaria approved these changes Feb 3, 2026

View reviewed changes

MeyerBender added 2 commits February 3, 2026 16:26

made docstrings less ambiguous

7a5e1b4

Merge remote-tracking branch 'origin/main' into review/region_correla…

121e58d

…tion

mjemons reviewed Feb 4, 2026

View reviewed changes

MeyerBender added 2 commits February 4, 2026 11:29

fixed conflicts arising from merge

de12d51

updated rs and ps notebook

1165fb2

MeyerBender merged commit 121ef41 into main Feb 4, 2026
4 checks passed

MeyerBender deleted the review/region_correlation branch February 4, 2026 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Review/region correlation#100

Review/region correlation#100
MeyerBender merged 13 commits intomainfrom
review/region_correlation

MeyerBender commented Feb 3, 2026

Uh oh!

LazDaria Feb 3, 2026

Uh oh!

LazDaria Feb 3, 2026

Uh oh!

LazDaria Feb 3, 2026

Uh oh!

MeyerBender Feb 3, 2026

Uh oh!

LazDaria left a comment

Uh oh!

mjemons left a comment

Uh oh!

mjemons Feb 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MeyerBender commented Feb 3, 2026

Uh oh!

LazDaria Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

LazDaria Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

LazDaria Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

MeyerBender Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

LazDaria left a comment

Choose a reason for hiding this comment

Uh oh!

mjemons left a comment

Choose a reason for hiding this comment

Uh oh!

mjemons Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants