A benchmark for hypothalamus segmentation on T1-weighted MR images

Livia Rodrigues; Thiago Junqueira Ribeiro Rezende; Guilherme Wertheimer; Yves Santos; Marcondes França; Leticia Rittner

doi:10.1016/j.neuroimage.2022.119741

A benchmark for hypothalamus segmentation on T1-weighted MR images

Neuroimage. 2022 Dec 1:264:119741. doi: 10.1016/j.neuroimage.2022.119741. Epub 2022 Nov 8.

Authors

Livia Rodrigues¹, Thiago Junqueira Ribeiro Rezende², Guilherme Wertheimer², Yves Santos², Marcondes França², Leticia Rittner³

Affiliations

¹ Medical Image Computing Lab, School of Electrical and Computer Engineering (FEEC), University of Campinas, Albert Einstein Street, 400, Campinas, SP 13083-887, Brazil. Electronic address: l180545@dac.unicamp.br.
² Department of Neurology, School of Medical Sciences, University of Campinas, Tessalia Vieira de Camargo Street, 126, Campinas, SP 13083-887, Brazil.
³ Medical Image Computing Lab, School of Electrical and Computer Engineering (FEEC), University of Campinas, Albert Einstein Street, 400, Campinas, SP 13083-887, Brazil.

PMID: 36368499
DOI: 10.1016/j.neuroimage.2022.119741

Abstract

The hypothalamus is a small brain structure that plays essential roles in sleep regulation, body temperature control, and metabolic homeostasis. Hypothalamic structural abnormalities have been reported in neuropsychiatric disorders, such as schizophrenia, amyotrophic lateral sclerosis, and Alzheimer's disease. Although mag- netic resonance (MR) imaging is the standard examination method for evaluating this region, hypothalamic morphological landmarks are unclear, leading to subjec- tivity and high variability during manual segmentation. Due to these limitations, it is common to find contradicting results in the literature regarding hypothalamic volumetry. To the best of our knowledge, only two automated methods are available in the literature for hypothalamus segmentation, the first of which is our previous method based on U-Net. However, both methods present performance losses when predicting images from different datasets than those used in training. Therefore, this project presents a benchmark consisting of a diverse T1-weighted MR image dataset comprising 1381 subjects from IXI, CC359, OASIS, and MiLI (the latter created specifically for this benchmark). All data were provided using automatically generated hypothalamic masks and a subset containing manually annotated masks. As a baseline, a method for fully automated segmentation of the hypothalamus on T1-weighted MR images with a greater generalization ability is presented. The pro- posed method is a teacher-student-based model with two blocks: segmentation and correction, where the second corrects the imperfections of the first block. After using three datasets for training (MiLI, IXI, and CC359), the prediction performance of the model was measured on two test sets: the first was composed of data from IXI, CC359, and MiLI, achieving a Dice coefficient of 0.83; the second was from OASIS, a dataset not used for training, achieving a Dice coefficient of 0.74. The dataset, the baseline model, and all necessary codes to reproduce the experiments are available at https://github.com/MICLab-Unicamp/HypAST and https://sites.google.com/ view/calgary-campinas-dataset/hypothalamus-benchmarking. In addition, a leaderboard will be maintained with predictions for the test set submitted by anyone working on the same task.

Keywords: Benchmarking; Dataset; Deep Learning; Hypothalamus; MRI; Segmentation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Alzheimer Disease*
Humans
Image Processing, Computer-Assisted* / methods
Magnetic Resonance Imaging / methods