Multimodal Deep Unfolding for Guided Image Super-Resolution

The reconstruction of a high resolution image given a low resolution observation is an ill-posed inverse problem in imaging. Deep learning methods rely on training data to learn an end-to-end mapping from a low-resolution input to a highresolution output. Unlike existing deep multimodal models that...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - PP(2020) vom: 12. Aug.
1. Verfasser:	Marivani, Iman (VerfasserIn)
Weitere Verfasser:	Tsiligianni, Evaggelia, Cornelis, Bruno, Deligiannis, Nikos
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2020
Zugriff auf das übergeordnete Werk:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:	Journal Article


LEADER	01000caa a22002652c 4500
001	NLM313589232
003	DE-627
005	20250227185019.0
007	cr uuu---uuuuu
008	231225s2020 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TIP.2020.3014729 \|2 doi
028	5	2	\|a pubmed25n1045.xml
035			\|a (DE-627)NLM313589232
035			\|a (NLM)32784140
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Marivani, Iman \|e verfasserin \|4 aut
245	1	0	\|a Multimodal Deep Unfolding for Guided Image Super-Resolution
264		1	\|c 2020
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Revised 27.02.2024
500			\|a published: Print-Electronic
500			\|a Citation Status Publisher
520			\|a The reconstruction of a high resolution image given a low resolution observation is an ill-posed inverse problem in imaging. Deep learning methods rely on training data to learn an end-to-end mapping from a low-resolution input to a highresolution output. Unlike existing deep multimodal models that do not incorporate domain knowledge about the problem, we propose a multimodal deep learning design that incorporates sparse priors and allows the effective integration of information from another image modality into the network architecture. Our solution relies on a novel deep unfolding operator, performing steps similar to an iterative algorithm for convolutional sparse coding with side information; therefore, the proposed neural network is interpretable by design. The deep unfolding architecture is used as a core component of a multimodal framework for guided image super-resolution. An alternative multimodal design is investigated by employing residual learning to improve the training efficiency. The presented multimodal approach is applied to super-resolution of near-infrared and multi-spectral images as well as depth upsampling using RGB images as side information. Experimental results show that our model outperforms state-ofthe-art methods
650		4	\|a Journal Article
700	1		\|a Tsiligianni, Evaggelia \|e verfasserin \|4 aut
700	1		\|a Cornelis, Bruno \|e verfasserin \|4 aut
700	1		\|a Deligiannis, Nikos \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society \|d 1992 \|g PP(2020) vom: 12. Aug. \|w (DE-627)NLM09821456X \|x 1941-0042 \|7 nnas
773	1	8	\|g volume:PP \|g year:2020 \|g day:12 \|g month:08
856	4	0	\|u http://dx.doi.org/10.1109/TIP.2020.3014729 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d PP \|j 2020 \|b 12 \|c 08