PottsMGNet : A Mathematical Explanation of Encoder-Decoder Based Neural Networks

Xue Cheng TAI, Hao LIU*, Raymond CHAN

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

1 Citation (Scopus)

Abstract

For problems in image processing and many other fields, a large class of effective neural networks has encoder-decoder-based architectures. Although these networks have shown impressive performance, mathematical explanations of their architectures are still underdeveloped. In this paper, we study the encoder-decoder-based network architecture from the algorithmic perspective and provide a mathematical explanation. We use the two-phase Potts model for image segmentation as an example for our explanations. We associate the segmentation problem with a control problem in the continuous setting. Then, the continuous control model is time discretized by an operatorsplitting scheme, the PottsMGNet, and space discretized by the multigrid method. We show that the resulting discrete PottsMGNet is equivalent to an encoder-decoder-based network. With minor modifications, it is shown that a number of the popular encoder-decoder-based neural networks are just instances of the proposed PottsMGNet. By incorporating the soft-threshold-dynamics into the PottsMGNet as a regularizer, the PottsMGNet has shown to be robust with the network parameters such as network width and depth and has achieved remarkable performance on datasets with very large noise. In nearly all our experiments, the new network always performs better than or as well as on accuracy and dice score compared to existing networks for image segmentation.

Original languageEnglish
Pages (from-to)540-594
Number of pages55
JournalSIAM Journal on Imaging Sciences
Volume17
Issue number1
Early online date7 Mar 2024
DOIs
Publication statusPublished - Mar 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2024 Society for Industrial and Applied Mathematics.

Funding

Funding: The work of the first author is partially supported by HKRGC-NSFC Grant N-CityU214-19, HKRGC CRF Grant C1013-21GF and NORCE Kompetanseoppbygging program. The work of the second author is partially supported by NSFC 12201530, HKRGC ECS 22302123 and HKBU 179356. The work of the third author is partially supported by HKRGC GRF grants CityU1101120 and CityU11309922 and CRF grant C1013-21GF. \\dagger NORCE Norwegian Research Centre, Nyg\\ar rdstangen, NO-5838 Bergen, Norway ([email protected], [email protected]). \\ddagger Corresponding author. Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong (haoliu@ hkbu.edu.hk). \\S Department of Mathematics, City University of Hong Kong, Hong Kong; and Hong Kong Centre for Cerebro-Cardiovascular Health Engineering ([email protected]).

Keywords

  • deep neural network
  • image segmentation
  • operator splitting
  • Potts model

Fingerprint

Dive into the research topics of 'PottsMGNet : A Mathematical Explanation of Encoder-Decoder Based Neural Networks'. Together they form a unique fingerprint.

Cite this