Method to detect specimens in H&E images #1044

timtreis · 2025-09-30T19:39:51Z

For other downstream functions, such as #1036, one needs a function to robustly detect where in the image the tissue is and how many of those there are.

This function
a) implements two algorithms for identifying the tissue (otsu & felzenszwalb)
b) deals with arbitrary channel input
c) heuristically tries to identify what is a sample and what is either just random stuff (dirt, Visium frame etc). As a fallback, one can pass in the number of samples expected which should be more robust
d) adds the mask back to the sdata object with the same structure and transformations as the original image had
e) does everything in dask so it's quite fast

sdata = sq.datasets.visium_hne_sdata()

sq.exp.im.detect_tissue(
    sdata,
    image_key="hne",
)

sdata

SpatialData object, with associated Zarr store: [/Users/tim.treis/.cache/squidpy/visium_hne_sdata.zarr](https://file+.vscode-resource.vscode-cdn.net/Users/tim.treis/.cache/squidpy/visium_hne_sdata.zarr)
├── Images
│     └── 'hne': DataTree[cyx] (3, 11757, 11291), (3, 5878, 5645), (3, 2939, 2822), (3, 1469, 1411)
├── Labels
│     └── 'hne_tissue': DataTree[yx] (11757, 11291), (5878, 5645), (2939, 2822), (1469, 1411)
├── Shapes
│     └── 'spots': GeoDataFrame shape: (2688, 2) (2D shapes)
└── Tables
      └── 'adata': AnnData (2688, 18078)
with coordinate systems:
    ▸ 'global', with elements:
        hne (Images), hne_tissue (Labels), spots (Shapes)
with the following elements not in the Zarr store:
    ▸ hne_tissue (Labels)

(
    sdata
    .pl.render_images("hne")
    .pl.render_labels("hne_tissue", fill_alpha=0, contour_px=10, outline_alpha=1)
    .pl.show()
)

Todo

Manual tests on a bunch of different inputs IHC / H&E / DAPI / multichannel etc
Test with multiple samples in the same image
Write unit tests for functions

selmanozleyen

hi these are some initial feedbacks I will get into more details tomorrow

src/squidpy/experimental/__init__.py

src/squidpy/exp/im/_detect_tissue.py

timtreis · 2025-10-13T14:45:18Z

Notes to self: Works fine in the happy path (white bg, rgb specimen) but fails when the specimen is weird. Potential other idea:

Use https://scikit-image.org/docs/0.25.x/auto_examples/segmentation/plot_trainable_segmentation.html

Start off by defining corners (potentially overwriteable by user) as background class.
Randomly sample squares across the image, if it's (across channels) within median +/- 1 sd, assign as other bg tiles, everything else is tissue
Build essentially a 2 class-mask which is then fed to the classifier

…generate-tissue-masks-in-he

hatch.toml

timtreis · 2025-10-26T19:16:28Z

Re-requesting also from @flying-sheep because I had to make changes to the hatch logic for stuff to even work. Not sure how the tests passed beforehand but if I'm not mistaken, certain actions just didn't exist?

flying-sheep

there were no commands missing, hatch pre-defines them: https://hatch.pypa.io/latest/config/internal/testing/#scripts

unfortunately adding the diff-cover command means you have to re-define everything, since you can’t just override individual scripts in an environment (known hatch issue).

To explain how things should work:

all test/coverage dependencies should be in one spot (here the test extra)
the hatch-test env should define the commands as explained in the link above:
- run should run the tests using pytest ...
- run-cov should run the tests using coverage run -m pytest ...
- cov-combine should combine the .coverage-xyz files written from the subprocesses spawned by pytest-xdist
- cov-report should do all the reporting
locally, you should use hatch test [-c] [-p] ... which will use these commands behind the scenes
- VS Code only understands pytest-cov, which is therefore only needed for VS Code’s test running GUI.
in CI, we sadly need to use matrix_name, so we can’t use the nice hatch test command and need to manually use the run* and cov* commands

hatch.toml

flying-sheep · 2025-10-27T10:21:29Z

hmm, strange, upload successful, but the link doesn’t work, and I don’t see the “codecov” CI job below:

info - 2025-10-27 10:14:23,124 -- ci service found: github-actions
info - 2025-10-27 10:14:23,188 -- Found 1 coverage files to report
info - 2025-10-27 10:14:23,188 -- > /home/runner/work/squidpy/squidpy/coverage.xml
info - 2025-10-27 10:14:23,757 -- Your upload is now processing. When finished, results will be available at: https://app.codecov.io/github/selmanozleyen/squidpy/commit/11fedf3561e44764fd652ac51a47ae0eeba3ea64
info - 2025-10-27 10:14:24,000 -- Process Upload complete

timtreis · 2025-10-27T15:05:58Z

Should Selmans handle be in that link? Shouldn't it be something at the org level?

flying-sheep · 2025-10-27T15:10:22Z

yeah, that confused me too. the uploads should definitely go to the scverse org, something must be broken with the config here.

timtreis · 2025-10-27T16:04:44Z

Did Selman maybe overwrite the codecov token with a private one?

flying-sheep · 2025-10-27T16:08:13Z

possible, let’s make sure this isn’t the case.

/edit: done

selmanozleyen · 2025-10-27T19:44:25Z

True, there wasn't a CODECOV token because we now take it env variables from the gh repo settings and I just used mine. Thought it would use the orgs since I was part of it. Nice catch

* mvp for function; without testgs * added option to retain holes * refactor + 1 test * added missing import * renamed test so that a plot would be generated * added img from runner; cross-os-data-cache * improved docstring * added data download script to correct location * updated hatch commands * modified coverage combine * removed superflous combine step * first download data, then run tests * attempt to simplify * aligned testing * updated toml * aligned __init__ files * no uv cache for data download * removed download step that'd never get hit * simplify * parallel * speed up tests --------- Co-authored-by: Phil Schaf <[email protected]>

mvp for function; without testgs

dacc00c

timtreis linked an issue Sep 30, 2025 that may be closed by this pull request

Function to automatically generate tissue masks in H&E #1042

Closed

added option to retain holes

34eab19

selmanozleyen reviewed Sep 30, 2025

View reviewed changes

src/squidpy/experimental/__init__.py Show resolved Hide resolved

src/squidpy/exp/im/_detect_tissue.py Outdated Show resolved Hide resolved

flying-sheep reviewed Oct 1, 2025

View reviewed changes

src/squidpy/exp/im/_detect_tissue.py Outdated Show resolved Hide resolved

timtreis and others added 7 commits October 26, 2025 17:08

Merge branch 'main' into feature/issue1042-function-to-automatically-…

f806188

…generate-tissue-masks-in-he

refactor + 1 test

ec5fc41

added missing import

6fd630e

renamed test so that a plot would be generated

b3cdc9e

added img from runner; cross-os-data-cache

97ad9e3

improved docstring

7323df5

added data download script to correct location

b5e621c

timtreis commented Oct 26, 2025

View reviewed changes

hatch.toml Show resolved Hide resolved

timtreis added 8 commits October 26, 2025 19:28

updated hatch commands

4b52a36

modified coverage combine

c5e0946

removed superflous combine step

cb68a27

first download data, then run tests

76b4065

attempt to simplify

8c3e62a

aligned testing

aa3a964

updated toml

9a879ec

aligned __init__ files

efd1eb7

timtreis requested review from flying-sheep and selmanozleyen October 26, 2025 19:15

timtreis added 2 commits October 26, 2025 20:20

no uv cache for data download

1e8d6d0

removed download step that'd never get hit

5b9b4e0

flying-sheep reviewed Oct 27, 2025

View reviewed changes

hatch.toml Outdated Show resolved Hide resolved

hatch.toml Show resolved Hide resolved

flying-sheep added 2 commits October 27, 2025 10:50

simplify

d7ee04b

parallel

d6b6d18

speed up tests

11fedf3

flying-sheep approved these changes Oct 27, 2025

View reviewed changes

timtreis merged commit 900d7b3 into main Oct 27, 2025
10 checks passed

timtreis deleted the feature/issue1042-function-to-automatically-generate-tissue-masks-in-he branch October 27, 2025 15:05

Method to detect specimens in H&E images #1044

Method to detect specimens in H&E images #1044

Uh oh!

Conversation

timtreis commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Todo

Uh oh!

selmanozleyen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timtreis commented Oct 13, 2025

Uh oh!

Uh oh!

timtreis commented Oct 26, 2025

Uh oh!

flying-sheep left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

flying-sheep commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

timtreis commented Oct 27, 2025

Uh oh!

flying-sheep commented Oct 27, 2025

Uh oh!

timtreis commented Oct 27, 2025

Uh oh!

flying-sheep commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

selmanozleyen commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

timtreis commented Sep 30, 2025 •

edited

Loading

flying-sheep left a comment •

edited

Loading

flying-sheep commented Oct 27, 2025 •

edited

Loading

flying-sheep commented Oct 27, 2025 •

edited

Loading