MicroSSIM: Improved Structural Similarity for Comparing Microscopy Data

1Human Technopole, Italy
ECCV 2024

Summary

We find that the classical SSIM measure is not well suited for comparing predictions made on low-SNR microscopy data with the corresponding high-SNR groundtruth. This kind of comparison is needed for evaluation of unsupervised denoising tasks. We report the phenomenon of saturation, which is the behaviour of SSIM to achieve a high score even when the predictions are far from the groundtruth. We show this unambigously by computing SSIM between a microscopy image and pure noise. We propose a new measure, MicroSSIM, which is an extension of SSIM that is better suited for this domain. We show that MicroSSIM has less saturation and that it focuses on the foreground content. Our approach is such that it can be applied to numerous SSIM variants, and we show this by extending the (Multiscale) MS-SSIM measure to MicroMS3IM.

MY ALT TEXT

Failure mode of SSIM on Microscopy Data: (Top) A noisy microscopy image, i.e. a micrograph, its denoised version predicted using N2V, and the corresponding High-SNR (noise free) ground truth is shown. There is a problem in the evaluation of denoising quality, which is that the pixel intensity distribution of the ground truth and the prediction (as shown in respective insets) are very different. This is specifically true for the foreground content which comprises brighter pixels. So applying SSIM directly on it will not give a sensible value. We solve this issue with MicroSSIM. (Bottom) We show one example to demonstrate an apparent counter-intuitive behavior of SSIM. The SSIM between a natural image (taken from Imagenet) and a pure noisy image drawn from the uniform distribution is much lower than the SSIM between a micrograph and a noisy image with identical distribution as before. The expectation naturally is to have SSIM ≈ 0 in both cases. We solve this issue with MicroSSIM and appropriate data pre-processing and show it in bottom right plot where 30 random microscopy images are used.

Our Approach

We remove the background from the ground truth (x) and the prediction (y). We scale the prediction with a scalar. We then compute the SSIM between these background removed images. We use the same scalars for all images in a dataset.

MY ALT TEXT

BibTeX


        @InProceedings{ashesh_2024_BIC_ECCV,
          author       = {Ashesh, Joran Deschamps and
                          Florian Jug},
          title        = {MicroSSIM: Improved Structural Similarity for Comparing Microscopy Data},
          month        = {October},
          year         = {2024},
          booktitle = {Bio Image Computing workshop at ECCV 2024},
        }