We find that the classical SSIM measure is not well suited for comparing predictions made on low-SNR microscopy data with the corresponding high-SNR groundtruth. This kind of comparison is needed for evaluation of unsupervised denoising tasks. We report the phenomenon of saturation, which is the behaviour of SSIM to achieve a high score even when the predictions are far from the groundtruth. We show this unambigously by computing SSIM between a microscopy image and pure noise. We propose a new measure, MicroSSIM, which is an extension of SSIM that is better suited for this domain. We show that MicroSSIM has less saturation and that it focuses on the foreground content. Our approach is such that it can be applied to numerous SSIM variants, and we show this by extending the (Multiscale) MS-SSIM measure to MicroMS3IM.
We remove the background from the ground truth (x) and the prediction (y). We scale the prediction with a scalar. We then compute the SSIM between these background removed images. We use the same scalars for all images in a dataset.
@InProceedings{ashesh_2024_BIC_ECCV,
author = {Ashesh, Joran Deschamps and
Florian Jug},
title = {MicroSSIM: Improved Structural Similarity for Comparing Microscopy Data},
month = {October},
year = {2024},
booktitle = {Bio Image Computing workshop at ECCV 2024},
}