Showing papers on "Iterative reconstruction published in 2019"

PDF

Open Access

Journal Article•DOI•

Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks

[...]

Wei-Sheng Lai¹, Jia-Bin Huang², Narendra Ahuja³, Ming-Hsuan Yang¹•Institutions (3)

University of California, Merced¹, Virginia Tech², University of Illinois at Urbana–Champaign³

01 Nov 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper proposes the deep Laplacian Pyramid Super-Resolution Network for fast and accurate image super-resolution, and utilizes the recursive layers to share parameters across as well as within pyramid levels, and thus drastically reduce the number of parameters.

...read moreread less

Abstract: Convolutional neural networks have recently demonstrated high-quality reconstruction for single image super-resolution. However, existing methods often require a large number of network parameters and entail heavy computational loads at runtime for generating high-accuracy super-resolution results. In this paper, we propose the deep Laplacian Pyramid Super-Resolution Network for fast and accurate image super-resolution. The proposed network progressively reconstructs the sub-band residuals of high-resolution images at multiple pyramid levels. In contrast to existing methods that involve the bicubic interpolation for pre-processing (which results in large feature maps), the proposed method directly extracts features from the low-resolution input space and thereby entails low computational loads. We train the proposed network with deep supervision using the robust Charbonnier loss functions and achieve high-quality image reconstruction. Furthermore, we utilize the recursive layers to share parameters across as well as within pyramid levels, and thus drastically reduce the number of parameters. Extensive quantitative and qualitative evaluations on benchmark datasets show that the proposed algorithm performs favorably against the state-of-the-art methods in terms of run-time and image quality.

...read moreread less

510 citations

Journal Article•DOI•

Deep Generative Adversarial Neural Networks for Compressive Sensing MRI

[...]

Morteza Mardani¹, Enhao Gong¹, Joseph Y. Cheng¹, Shreyas S. Vasanawala¹, Greg Zaharchuk¹, Lei Xing¹, John M. Pauly¹ - Show less +3 more•Institutions (1)

Stanford University¹

01 Jan 2019-IEEE Transactions on Medical Imaging

TL;DR: A novel CS framework that uses generative adversarial networks (GAN) to model the (low-dimensional) manifold of high-quality MR images that retrieves higher quality images with improved fine texture details compared with conventional Wavelet-based and dictionary- learning-based CS schemes as well as with deep-learning-based schemes using pixel-wise training.

...read moreread less

Abstract: Undersampled magnetic resonance image (MRI) reconstruction is typically an ill-posed linear inverse task. The time and resource intensive computations require tradeoffs between accuracy and speed . In addition, state-of-the-art compressed sensing (CS) analytics are not cognizant of the image diagnostic quality . To address these challenges, we propose a novel CS framework that uses generative adversarial networks (GAN) to model the (low-dimensional) manifold of high-quality MR images. Leveraging a mixture of least-squares (LS) GANs and pixel-wise $\ell _{1}/\ell _{2}$ cost, a deep residual network with skip connections is trained as the generator that learns to remove the aliasing artifacts by projecting onto the image manifold. The LSGAN learns the texture details, while the $\ell _{1}/\ell _{2}$ cost suppresses high-frequency noise. A discriminator network, which is a multilayer convolutional neural network (CNN), plays the role of a perceptual cost that is then jointly trained based on high-quality MR images to score the quality of retrieved images. In the operational phase, an initial aliased estimate (e.g., simply obtained by zero-filling) is propagated into the trained generator to output the desired reconstruction. This demands a very low computational overhead. Extensive evaluations are performed on a large contrast-enhanced MR dataset of pediatric patients. Images rated by expert radiologists corroborate that GANCS retrieves higher quality images with improved fine texture details compared with conventional Wavelet-based and dictionary-learning-based CS schemes as well as with deep-learning-based schemes using pixel-wise training. In addition, it offers reconstruction times of under a few milliseconds, which are two orders of magnitude faster than the current state-of-the-art CS-MRI schemes.

...read moreread less

468 citations

Journal Article•DOI•

Convolutional Recurrent Neural Networks for Dynamic MR Image Reconstruction

[...]

Chen Qin¹, Jo Schlemper¹, Jose Caballero¹, Anthony N. Price², Joseph V. Hajnal², Daniel Rueckert¹ - Show less +2 more•Institutions (2)

Imperial College London¹, King's College London²

01 Jan 2019-IEEE Transactions on Medical Imaging

TL;DR: This paper proposes a unique, novel convolutional recurrent neural network architecture which reconstructs high quality cardiac MR images from highly undersampled k-space data by jointly exploiting the dependencies of the temporal sequences as well as the iterative nature of the traditional optimization algorithms.

...read moreread less

Abstract: Accelerating the data acquisition of dynamic magnetic resonance imaging leads to a challenging ill-posed inverse problem, which has received great interest from both the signal processing and machine learning communities over the last decades. The key ingredient to the problem is how to exploit the temporal correlations of the MR sequence to resolve aliasing artifacts. Traditionally, such observation led to a formulation of an optimization problem, which was solved using iterative algorithms. Recently, however, deep learning-based approaches have gained significant popularity due to their ability to solve general inverse problems. In this paper, we propose a unique, novel convolutional recurrent neural network architecture which reconstructs high quality cardiac MR images from highly undersampled k-space data by jointly exploiting the dependencies of the temporal sequences as well as the iterative nature of the traditional optimization algorithms. In particular, the proposed architecture embeds the structure of the traditional iterative algorithms, efficiently modeling the recurrence of the iterative reconstruction stages by using recurrent hidden connections over such iterations. In addition, spatio–temporal dependencies are simultaneously learnt by exploiting bidirectional recurrent hidden connections across time sequences. The proposed method is able to learn both the temporal dependence and the iterative reconstruction process effectively with only a very small number of parameters, while outperforming current MR reconstruction methods in terms of reconstruction accuracy and speed.

...read moreread less

408 citations

Proceedings Article•DOI•

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow

[...]

Yurui Ren¹, Xiaoming Yu¹, Ruonan Zhang, Thomas H. Li¹, Shan Liu², Ge Li¹ - Show less +2 more•Institutions (2)

Peking University¹, Tencent²

01 Oct 2019

TL;DR: A two-stage model which splits the inpainting task into two parts: structure reconstruction and texture generation is proposed, which shows superior performance on multiple publicly available datasets.

...read moreread less

Abstract: Image inpainting techniques have shown significant improvements by using deep neural networks recently. However, most of them may either fail to reconstruct reasonable structures or restore fine-grained textures. In order to solve this problem, in this paper, we propose a two-stage model which splits the inpainting task into two parts: structure reconstruction and texture generation. In the first stage, edge-preserved smooth images are employed to train a structure reconstructor which completes the missing structures of the inputs. In the second stage, based on the reconstructed structures, a texture generator using appearance flow is designed to yield image details. Experiments on multiple publicly available datasets show the superior performance of the proposed network.

...read moreread less

314 citations

Journal Article•DOI•

Scan-specific robust artificial-neural-networks for k-space interpolation (RAKI) reconstruction: Database-free deep learning for fast imaging.

[...]

Mehmet Akcakaya¹, Steen Moeller¹, Sebastian Weingärtner¹, Sebastian Weingärtner², Kâmil Uğurbil¹ - Show less +1 more•Institutions (2)

University of Minnesota¹, Heidelberg University²

01 Jan 2019-Magnetic Resonance in Medicine

TL;DR: An improved k‐space reconstruction method using scan‐specific deep learning that is trained on autocalibration signal (ACS) data is developed.

...read moreread less

Abstract: Purpose To develop an improved k-space reconstruction method using scan-specific deep learning that is trained on autocalibration signal (ACS) data Theory Robust artificial-neural-networks for k-space interpolation (RAKI) reconstruction trains convolutional neural networks on ACS data This enables nonlinear estimation of missing k-space lines from acquired k-space data with improved noise resilience, as opposed to conventional linear k-space interpolation-based methods, such as GRAPPA, which are based on linear convolutional kernels Methods The training algorithm is implemented using a mean square error loss function over the target points in the ACS region, using a gradient descent algorithm The neural network contains 3 layers of convolutional operators, with 2 of these including nonlinear activation functions The noise performance and reconstruction quality of the RAKI method was compared with GRAPPA in phantom, as well as in neurological and cardiac in vivo data sets Results Phantom imaging shows that the proposed RAKI method outperforms GRAPPA at high (≥4) acceleration rates, both visually and quantitatively Quantitative cardiac imaging shows improved noise resilience at high acceleration rates (rate 4:23% and rate 5:48%) over GRAPPA The same trend of improved noise resilience is also observed in high-resolution brain imaging at high acceleration rates Conclusion The RAKI method offers a training database-free deep learning approach for MRI reconstruction, with the potential to improve many existing reconstruction approaches, and is compatible with conventional data acquisition protocols

...read moreread less

309 citations

Journal Article•DOI•

Edge-Enhanced GAN for Remote Sensing Image Superresolution

[...]

Kui Jiang¹, Zhongyuan Wang¹, Peng Yi¹, Guangcheng Wang¹, Tao Lu², Junjun Jiang³ - Show less +2 more•Institutions (3)

Wuhan University¹, Wuhan Institute of Technology², Harbin Institute of Technology³

29 Mar 2019-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: A generative adversarial network (GAN)-based edge-enhancement network (EEGAN) for robust satellite image SR reconstruction along with the adversarial learning strategy that is insensitive to noise is proposed.

...read moreread less

Abstract: The current superresolution (SR) methods based on deep learning have shown remarkable comparative advantages but remain unsatisfactory in recovering the high-frequency edge details of the images in noise-contaminated imaging conditions, e.g., remote sensing satellite imaging. In this paper, we propose a generative adversarial network (GAN)-based edge-enhancement network (EEGAN) for robust satellite image SR reconstruction along with the adversarial learning strategy that is insensitive to noise. In particular, EEGAN consists of two main subnetworks: an ultradense subnetwork (UDSN) and an edge-enhancement subnetwork (EESN). In UDSN, a group of 2-D dense blocks is assembled for feature extraction and to obtain an intermediate high-resolution result that looks sharp but is eroded with artifacts and noises as previous GAN-based methods do. Then, EESN is constructed to extract and enhance the image contours by purifying the noise-contaminated components with mask processing. The recovered intermediate image and enhanced edges can be combined to generate the result that enjoys high credibility and clear contents. Extensive experiments on Kaggle Open Source Data set , Jilin-1 video satellite images, and Digitalglobe show superior reconstruction performance compared to the state-of-the-art SR approaches.

...read moreread less

305 citations

Journal Article•DOI•

The evolution of image reconstruction for CT—from filtered back projection to artificial intelligence

[...]

Martin J. Willemink¹, Peter B. Noël²•Institutions (2)

Stanford University¹, University of Pennsylvania²

01 May 2019-European Radiology

TL;DR: A concise look at the overall evolution of CT image reconstruction and its clinical implementations is taken, finding IR is essential for photon-counting CT, phase-contrast CT, and dark-field CT.

...read moreread less

Abstract: The first CT scanners in the early 1970s already used iterative reconstruction algorithms; however, lack of computational power prevented their clinical use. In fact, it took until 2009 for the first iterative reconstruction algorithms to come commercially available and replace conventional filtered back projection. Since then, this technique has caused a true hype in the field of radiology. Within a few years, all major CT vendors introduced iterative reconstruction algorithms for clinical routine, which evolved rapidly into increasingly advanced reconstruction algorithms. The complexity of algorithms ranges from hybrid-, model-based to fully iterative algorithms. As a result, the number of scientific publications on this topic has skyrocketed over the last decade. But what exactly has this technology brought us so far? And what can we expect from future hardware as well as software developments, such as photon-counting CT and artificial intelligence? This paper will try answer those questions by taking a concise look at the overall evolution of CT image reconstruction and its clinical implementations. Subsequently, we will give a prospect towards future developments in this domain. KEY POINTS: • Advanced CT reconstruction methods are indispensable in the current clinical setting. • IR is essential for photon-counting CT, phase-contrast CT, and dark-field CT. • Artificial intelligence will potentially further increase the performance of reconstruction methods.

...read moreread less

304 citations

Journal Article•DOI•

Competitive performance of a modularized deep neural network compared to commercial algorithms for low-dose CT image reconstruction.

[...]

Hongming Shan¹, Atul Padole², Fatemeh Homayounieh², Uwe Kruger¹, Ruhani Doda Khera², Chayanin Nitiwarangkul³, Chayanin Nitiwarangkul², Mannudeep K. Kalra², Ge Wang¹ - Show less +5 more•Institutions (3)

Rensselaer Polytechnic Institute¹, Harvard University², Mahidol University³

01 Jun 2019-Nature Machine Intelligence

TL;DR: In this article, a modularized neural network for low-dose CT (LDCT) was proposed and compared with commercial iterative reconstruction methods from three leading CT vendors, and the learned workflow allows radiologists-in-the-loop to optimize the denoising depth in a task-specific fashion.

...read moreread less

Abstract: Commercial iterative reconstruction techniques help to reduce CT radiation dose but altered image appearance and artifacts limit their adoptability and potential use. Deep learning has been investigated for low-dose CT (LDCT). Here we design a modularized neural network for LDCT and compared it with commercial iterative reconstruction methods from three leading CT vendors. While popular networks are trained for an end-to-end mapping, our network performs an end-to-process mapping so that intermediate denoised images are obtained with associated noise reduction directions towards a final denoised image. The learned workflow allows radiologists-in-the-loop to optimize the denoising depth in a task-specific fashion. Our network was trained with the Mayo LDCT Dataset, and tested on separate chest and abdominal CT exams from Massachusetts General Hospital. The best deep learning reconstructions were systematically compared to the best iterative reconstructions in a double-blinded reader study. This study confirms that our deep learning approach performed either favorably or comparably in terms of noise suppression and structural fidelity, and is much faster than the commercial iterative reconstruction algorithms.

...read moreread less

265 citations

Proceedings Article•DOI•

Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics

[...]

Michael Niemeyer¹, Lars Mescheder¹, Michael Oechsle¹, Andreas Geiger¹•Institutions (1)

University of Tübingen¹

01 Oct 2019

TL;DR: This work presents Occupancy Flow, a novel spatio-temporal representation of time-varying 3D geometry with implicit correspondences which can be used for interpolation and reconstruction tasks, and believes that Occupancy flow is a promising new 4D representation which will be useful for a variety of spatio/temporal reconstruction tasks.

...read moreread less

Abstract: Deep learning based 3D reconstruction techniques have recently achieved impressive results. However, while state-of-the-art methods are able to output complex 3D geometry, it is not clear how to extend these results to time-varying topologies. Approaches treating each time step individually lack continuity and exhibit slow inference, while traditional 4D reconstruction methods often utilize a template model or discretize the 4D space at fixed resolution. In this work, we present Occupancy Flow, a novel spatio-temporal representation of time-varying 3D geometry with implicit correspondences. Towards this goal, we learn a temporally and spatially continuous vector field which assigns a motion vector to every point in space and time. In order to perform dense 4D reconstruction from images or sparse point clouds, we combine our method with a continuous 3D representation. Implicitly, our model yields correspondences over time, thus enabling fast inference while providing a sound physical description of the temporal dynamics. We show that our method can be used for interpolation and reconstruction tasks, and demonstrate the accuracy of the learned correspondences. We believe that Occupancy Flow is a promising new 4D representation which will be useful for a variety of spatio-temporal reconstruction tasks.

...read moreread less

262 citations

Proceedings Article•DOI•

Accurate 3D Face Reconstruction With Weakly-Supervised Learning: From Single Image to Image Set

[...]

Yu Deng¹, Jiaolong Yang¹, Sicheng Xu², Dong Chen¹, Yunde Jia², Xin Tong¹ - Show less +2 more•Institutions (2)

Microsoft¹, Beijing Institute of Technology²

16 Jun 2019

TL;DR: Deep3DFaceReconstruction as mentioned in this paper leverages a robust, hybrid loss function for weakly supervised learning which takes into account both low-level and perception-level information for supervision, and performs multi-image face reconstruction by exploiting complementary information from different images for shape aggregation.

...read moreread less

Abstract: Recently, deep learning based 3D face reconstruction methods have shown promising results in both quality and efficiency. However, training deep neural networks typically requires a large volume of data, whereas face images with ground-truth 3D face shapes are scarce. In this paper, we propose a novel deep 3D face reconstruction approach that 1) leverages a robust, hybrid loss function for weakly-supervised learning which takes into account both low-level and perception-level information for supervision, and 2) performs multi-image face reconstruction by exploiting complementary information from different images for shape aggregation. Our method is fast, accurate, and robust to occlusion and large pose. We provide comprehensive experiments on MICC Florence and Facewarehouse datasets, systematically comparing our method with fifteen recent methods and demonstrating its state-of-the-art performance. Code available at https://github.com/Microsoft/Deep3DFaceReconstruction

...read moreread less

255 citations

Journal Article•DOI•

Rank Minimization for Snapshot Compressive Imaging

[...]

Yang Liu¹, Xin Yuan², Jinli Suo¹, David J. Brady³, Qionghai Dai¹ - Show less +1 more•Institutions (3)

Tsinghua University¹, Bell Labs², Duke University³

01 Dec 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A joint model is built to integrate the nonlocal self-similarity of video/hyperspectral frames and the rank minimization approach with the SCI sensing process and an alternating minimization algorithm is developed to solve the non-convex problem of SCI reconstruction.

...read moreread less

Abstract: Snapshot compressive imaging (SCI) refers to compressive imaging systems where multiple frames are mapped into a single measurement, with video compressive imaging and hyperspectral compressive imaging as two representative applications. Though exciting results of high-speed videos and hyperspectral images have been demonstrated, the poor reconstruction quality precludes SCI from wide applications. This paper aims to boost the reconstruction quality of SCI via exploiting the high-dimensional structure in the desired signal. We build a joint model to integrate the nonlocal self-similarity of video/hyperspectral frames and the rank minimization approach with the SCI sensing process. Following this, an alternating minimization algorithm is developed to solve this non-convex problem. We further investigate the special structure of the sampling process in SCI to tackle the computational workload and memory issues in SCI reconstruction. Both simulation and real data (captured by four different SCI cameras) results demonstrate that our proposed algorithm leads to significant improvements compared with current state-of-the-art algorithms. We hope our results will encourage the researchers and engineers to pursue further in compressive imaging for real applications.

...read moreread less

Journal Article•DOI•

DR2-Net: Deep Residual Reconstruction Network for image compressive sensing

[...]

Hantao Yao, Feng Dai¹, Shiliang Zhang², Yongdong Zhang³, Qi Tian⁴, Changsheng Xu¹ - Show less +2 more•Institutions (4)

Chinese Academy of Sciences¹, Peking University², University of Science and Technology of China³, University of Texas at San Antonio⁴

24 Sep 2019-Neurocomputing

TL;DR: A novel Deep Residual Reconstruction Network (DR2-Net) to reconstruct the image from its Compressively Sensed measurement by outperforms traditional iterative methods and recent deep learning-based methods by large margins at measurement rates 0.01, 0.1, and 0.25.

...read moreread less

Journal Article•DOI•

Deep learning reconstruction improves image quality of abdominal ultra-high-resolution CT

[...]

Motonori Akagi¹, Yuko Nakamura¹, Toru Higaki¹, Keigo Narita¹, Yukiko Honda¹, Jian Zhou, Zhou Yu, Naruomi Akino, Kazuo Awai¹ - Show less +5 more•Institutions (1)

Hiroshima University¹

11 Apr 2019-European Radiology

TL;DR: DLR improved the quality of abdominal U-HRCT images and image noise and overall image quality for hepatic ultra-high-resolution CT images improved with deep learning reconstruction as compared to hybrid and model-based iterative reconstruction.

...read moreread less

Abstract: Deep learning reconstruction (DLR) is a new reconstruction method; it introduces deep convolutional neural networks into the reconstruction flow. This study was conducted in order to examine the clinical applicability of abdominal ultra-high-resolution CT (U-HRCT) exams reconstructed with a new DLR in comparison to hybrid and model-based iterative reconstruction (hybrid-IR, MBIR). Our retrospective study included 46 patients seen between December 2017 and April 2018. A radiologist recorded the standard deviation of attenuation in the paraspinal muscle as the image noise and calculated the contrast-to-noise ratio (CNR) for the aorta, portal vein, and liver. The overall image quality was assessed by two other radiologists and graded on a 5-point confidence scale ranging from 1 (unacceptable) to 5 (excellent). The difference between CT images subjected to hybrid-IR, MBIR, and DLR was compared. The image noise was significantly lower and the CNR was significantly higher on DLR than hybrid-IR and MBIR images (p < 0.01). DLR images received the highest and MBIR images the lowest scores for overall image quality. DLR improved the quality of abdominal U-HRCT images. • The potential degradation due to increased noise may prevent implementation of ultra-high-resolution CT in the abdomen. • Image noise and overall image quality for hepatic ultra-high-resolution CT images improved with deep learning reconstruction as compared to hybrid- and model-based iterative reconstruction.

...read moreread less

Journal Article•DOI•

Deep learning for photoacoustic tomography from sparse data.

[...]

Stephan Antholzer¹, Markus Haltmeier¹, Johannes Schwab¹•Institutions (1)

University of Innsbruck¹

01 Jan 2019-Inverse Problems in Science and Engineering

TL;DR: In this article, the sparse data problem for image reconstruction in photoacousti... is investigated and a fast and accurate image reconstruction algorithm is proposed for computed tomography with sparse data.

...read moreread less

Abstract: The development of fast and accurate image reconstruction algorithms is a central aspect of computed tomography. In this paper, we investigate this issue for the sparse data problem in photoacousti...

...read moreread less

Proceedings Article•DOI•

Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images

[...]

Haozhe Xie¹, Hongxun Yao¹, Xiaoshuai Sun¹, Shangchen Zhou², Shengping Zhang¹ - Show less +1 more•Institutions (2)

Harbin Institute of Technology¹, Nanyang Technological University²

01 Oct 2019

TL;DR: Pix2Vox as mentioned in this paper proposes a context-aware fusion module to adaptively select high-quality reconstructions for each part from different coarse 3D volumes to obtain a fused 3D volume.

...read moreread less

Abstract: Recovering the 3D representation of an object from single-view or multi-view RGB images by deep neural networks has attracted increasing attention in the past few years. Several mainstream works (e.g., 3D-R2N2) use recurrent neural networks (RNNs) to fuse multiple feature maps extracted from input images sequentially. However, when given the same set of input images with different orders, RNN-based approaches are unable to produce consistent reconstruction results. Moreover, due to long-term memory loss, RNNs cannot fully exploit input images to refine reconstruction results. To solve these problems, we propose a novel framework for single-view and multi-view 3D reconstruction, named Pix2Vox. By using a well-designed encoder-decoder, it generates a coarse 3D volume from each input image. Then, a context-aware fusion module is introduced to adaptively select high-quality reconstructions for each part (e.g., table legs) from different coarse 3D volumes to obtain a fused 3D volume. Finally, a refiner further refines the fused 3D volume to generate the final output. Experimental results on the ShapeNet and Pix3D benchmarks indicate that the proposed Pix2Vox outperforms state-of-the-arts by a large margin. Furthermore, the proposed method is 24 times faster than 3D-R2N2 in terms of backward inference time. The experiments on ShapeNet unseen 3D categories have shown the superior generalization abilities of our method.

...read moreread less

Journal Article•DOI•

DeepPET: A deep encoder–decoder network for directly solving the PET image reconstruction inverse problem

[...]

Ida Häggström¹, C. Ross Schmidtlein¹, Gabriele Campanella¹, Thomas J. Fuchs¹•Institutions (1)

Memorial Sloan Kettering Cancer Center¹

01 May 2019

TL;DR: This study shows that an end‐to‐end encoder–decoder network can produce high quality PET images at a fraction of the time compared to conventional methods, and was successfully applied to real clinical data.

...read moreread less

Abstract: The purpose of this research was to implement a deep learning network to overcome two of the major bottlenecks in improved image reconstruction for clinical positron emission tomography (PET). These are the lack of an automated means for the optimization of advanced image reconstruction algorithms, and the computational expense associated with these state-of-the art methods. We thus present a novel end-to-end PET image reconstruction technique, called DeepPET, based on a deep convolutional encoder–decoder network, which takes PET sinogram data as input and directly and quickly outputs high quality, quantitative PET images. Using simulated data derived from a whole-body digital phantom, we randomly sampled the configurable parameters to generate realistic images, which were each augmented to a total of more than 291,000 reference images. Realistic PET acquisitions of these images were simulated, resulting in noisy sinogram data, used for training, validation, and testing the DeepPET network. We demonstrated that DeepPET generates higher quality images compared to conventional techniques, in terms of relative root mean squared error (11%/53% lower than ordered subset expectation maximization (OSEM)/filtered back-projection (FBP), structural similarity index (1%/11% higher than OSEM/FBP), and peak signal-to-noise ratio (1.1/3.8 dB higher than OSEM/FBP). In addition, we show that DeepPET reconstructs images 108 and 3 times faster than OSEM and FBP, respectively. Finally, DeepPET was successfully applied to real clinical data. This study shows that an end-to-end encoder–decoder network can produce high quality PET images at a fraction of the time compared to conventional methods.

...read moreread less

Journal Article•DOI•

Iterative PET Image Reconstruction Using Convolutional Neural Network Representation

[...]

Kuang Gong¹, Jiahui Guan², Kyungsang Kim¹, Xuezhu Zhang², Jaewon Yang³, Youngho Seo³, Georges El Fakhri¹, Jinyi Qi², Quanzheng Li¹ - Show less +5 more•Institutions (3)

Harvard University¹, University of California, Davis², University of California, San Francisco³

01 Mar 2019-IEEE Transactions on Medical Imaging

TL;DR: Quantification results show that the proposed iterative neural network method can outperform the neural network denoising and conventional penalized maximum likelihood methods.

...read moreread less

Abstract: PET image reconstruction is challenging due to the ill-poseness of the inverse problem and limited number of detected photons. Recently, the deep neural networks have been widely and successfully used in computer vision tasks and attracted growing interests in medical imaging. In this paper, we trained a deep residual convolutional neural network to improve PET image quality by using the existing inter-patient information. An innovative feature of the proposed method is that we embed the neural network in the iterative reconstruction framework for image representation, rather than using it as a post-processing tool. We formulate the objective function as a constrained optimization problem and solve it using the alternating direction method of multipliers algorithm. Both simulation data and hybrid real data are used to evaluate the proposed method. Quantification results show that our proposed iterative neural network method can outperform the neural network denoising and conventional penalized maximum likelihood methods.

...read moreread less

Journal Article•DOI•

PET Image Reconstruction Using Deep Image Prior

[...]

Kuang Gong¹, Ciprian Catana¹, Jinyi Qi², Quanzheng Li¹•Institutions (2)

Harvard University¹, University of California, Davis²

01 Jul 2019-IEEE Transactions on Medical Imaging

TL;DR: Quantification results based on simulation and real data show that the proposed reconstruction framework can outperform Gaussian post-smoothing and anatomically guided reconstructions using the kernel method or the neural-network penalty.

...read moreread less

Abstract: Recently, deep neural networks have been widely and successfully applied in computer vision tasks and have attracted growing interest in medical imaging. One barrier for the application of deep neural networks to medical imaging is the need for large amounts of prior training pairs, which is not always feasible in clinical practice. This is especially true for medical image reconstruction problems, where raw data are needed. Inspired by the deep image prior framework, in this paper, we proposed a personalized network training method where no prior training pairs are needed, but only the patient’s own prior information. The network is updated during the iterative reconstruction process using the patient-specific prior information and measured data. We formulated the maximum-likelihood estimation as a constrained optimization problem and solved it using the alternating direction method of multipliers algorithm. Magnetic resonance imaging guided positron emission tomography reconstruction was employed as an example to demonstrate the effectiveness of the proposed framework. Quantification results based on simulation and real data show that the proposed reconstruction framework can outperform Gaussian post-smoothing and anatomically guided reconstructions using the kernel method or the neural-network penalty.

...read moreread less

Journal Article•DOI•

Image Reconstruction Based on Convolutional Neural Network for Electrical Resistance Tomography

[...]

Chao Tan¹, Shuhua Lv¹, Feng Dong¹, Masahiro Takei²•Institutions (2)

Tianjin University¹, Chiba University²

01 Jan 2019-IEEE Sensors Journal

TL;DR: Results show that the proposed CNN method has better reconstruction results than LBP, Tikhonov, and Landweber, and the network has good generalization ability.

...read moreread less

Abstract: Image reconstruction is a key problem for electrical resistance tomography (ERT). Because of the soft-field nature and the ill-posed problem in solving inverse problem, traditional image reconstruction methods cannot achieve high accuracy and the process is usually time consuming. Since deep learning is good at mapping complicated nonlinear function, a deep learning method based on convolutional neural network (CNN) is proposed for image reconstruction of ERT. To establish the database, 41122 samples were generated with numerical simulations. 10-fold cross validation was used to divide all samples into training set and validation set. The network structure was based on LeNet, and refined by applying dropout layer and moving average. After 346 training epochs, the image correlation coefficient (ICC) on validation set was 0.95. When white Gaussian noise with a signal-to-noise ratio of 30, 40, and 50 were added to validation set, the ICC was 0.79, 0.89, and 0.93, respectively, which proved the anti-noise capability of the network. The reconstruction results on samples which have more inclusions, different conductivity, and other shapes explained the network has good generalization ability. Furthermore, experimental data from a 16-electrode industrial ERT system was used to compare the accuracy of the proposed model with some typical reconstruction methods. Results show that the proposed CNN method has better reconstruction results than LBP, Tikhonov, and Landweber.

...read moreread less

Journal Article•DOI•

PET Image Denoising Using a Deep Neural Network Through Fine Tuning

[...]

Kuang Gong¹, Jiahui Guan¹, Chih-Chieh Liu¹, Jinyi Qi¹•Institutions (1)

University of California, Davis¹

01 Mar 2019

TL;DR: A deep convolutional neural network was trained to improve PET image quality by employingceptual loss based on features derived from a pretrained VGG network instead of the conventional mean squared error to preserve image details.

...read moreread less

Abstract: Positron emission tomography (PET) is a functional imaging modality widely used in clinical diagnosis. In this paper, we trained a deep convolutional neural network to improve PET image quality. Perceptual loss based on features derived from a pretrained VGG network, instead of the conventional mean squared error, was employed as the training loss function to preserve image details. As the number of real patient data set for training is limited, we propose to pretrain the network using simulation data and fine-tune the last few layers of the network using real data sets. Results from simulation, real brain, and lung data sets show that the proposed method is more effective in removing noise than the traditional Gaussian filtering method.

...read moreread less

Journal Article•DOI•

Large Scale Image Segmentation with Structured Loss Based Deep Learning for Connectome Reconstruction

[...]

Jan Funke¹, Fabian Tschopp², William Grisaitis³, Arlo Sheridan³, Chandan Singh³, Stephan Saalfeld³, Srinivas C. Turaga³ - Show less +3 more•Institutions (3)

Spanish National Research Council¹, ETH Zurich², Howard Hughes Medical Institute³

01 Jul 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A method combining affinity prediction with region agglomeration, which improves significantly upon the state of the art of neuron segmentation from electron microscopy (EM) in accuracy and scalability, and suggests that a single method can be applied to both nearly isotropic block-face EM data and anisotropic serial sectioned EM data.

...read moreread less

Abstract: We present a method combining affinity prediction with region agglomeration, which improves significantly upon the state of the art of neuron segmentation from electron microscopy (EM) in accuracy and scalability. Our method consists of a 3D U-Net, trained to predict affinities between voxels, followed by iterative region agglomeration. We train using a structured loss based on Malis, encouraging topologically correct segmentations obtained from affinity thresholding. Our extension consists of two parts: First, we present a quasi-linear method to compute the loss gradient, improving over the original quadratic algorithm. Second, we compute the gradient in two separate passes to avoid spurious gradient contributions in early training stages. Our predictions are accurate enough that simple learning-free percentile-based agglomeration outperforms more involved methods used earlier on inferior predictions. We present results on three diverse EM datasets, achieving relative improvements over previous results of 27, 15, and 250 percent. Our findings suggest that a single method can be applied to both nearly isotropic block-face EM data and anisotropic serial sectioned EM data. The runtime of our method scales linearly with the size of the volume and achieves a throughput of $\sim$∼ 2.6 seconds per megavoxel, qualifying our method for the processing of very large datasets.

...read moreread less

Journal Article•DOI•

Deep learning optoacoustic tomography with sparse data

[...]

Neda Davoudi¹, Neda Davoudi², Xosé Luís Deán-Ben³, Xosé Luís Deán-Ben¹, Daniel Razansky¹, Daniel Razansky³ - Show less +2 more•Institutions (3)

ETH Zurich¹, Technische Universität München², University of Zurich³

01 Oct 2019-Nature Machine Intelligence

TL;DR: A new framework for efficient recovery of image quality from sparse optoacoustic data based on a deep convolutional neural network is proposed and its performance with whole body mouse imaging in vivo is demonstrated.

...read moreread less

Abstract: The rapidly evolving field of optoacoustic (photoacoustic) imaging and tomography is driven by a constant need for better imaging performance in terms of resolution, speed, sensitivity, depth and contrast. In practice, data acquisition strategies commonly involve sub-optimal sampling of the tomographic data, resulting in inevitable performance trade-offs and diminished image quality. We propose a new framework for efficient recovery of image quality from sparse optoacoustic data based on a deep convolutional neural network and demonstrate its performance with whole body mouse imaging in vivo. To generate accurate high-resolution reference images for optimal training, a full-view tomographic scanner capable of attaining superior cross-sectional image quality from living mice was devised. When provided with images reconstructed from substantially undersampled data or limited-view scans, the trained network was capable of enhancing the visibility of arbitrarily oriented structures and restoring the expected image quality. Notably, the network also eliminated some reconstruction artefacts present in reference images rendered from densely sampled data. No comparable gains were achieved when the training was performed with synthetic or phantom data, underlining the importance of training with high-quality in vivo images acquired by full-view scanners. The new method can benefit numerous optoacoustic imaging applications by mitigating common image artefacts, enhancing anatomical contrast and image quantification capacities, accelerating data acquisition and image reconstruction approaches, while also facilitating the development of practical and affordable imaging systems. The suggested approach operates solely on image-domain data and thus can be seamlessly applied to artefactual images reconstructed with other modalities. Optoacoustic imaging can achieve high spatial and temporal resolution but image quality is often compromised by suboptimal data acquisition. A new method employing deep learning to recover high-quality images from sparse or limited-view optoacoustic scans has been developed and demonstrated for whole-body mouse imaging in vivo.

...read moreread less

Journal Article•DOI•

Assessment of the generalization of learned image reconstruction and the potential for transfer learning

[...]

Florian Knoll¹, Kerstin Hammernik², Kerstin Hammernik¹, Erich Kobler², Thomas Pock², Thomas Pock³, Michael P. Recht¹, Daniel K. Sodickson¹ - Show less +4 more•Institutions (3)

New York University¹, Graz University of Technology², Austrian Institute of Technology³

01 Jan 2019-Magnetic Resonance in Medicine

TL;DR: In this paper, the authors evaluated the generalization ability of learned image reconstruction with respect to deviations in the acquisition settings between training and testing, and provided an outlook for the potential of transfer learning to fine-tune trainings to a particular target application using only a small number of training cases.

...read moreread less

Abstract: Purpose Although deep learning has shown great promise for MR image reconstruction, an open question regarding the success of this approach is the robustness in the case of deviations between training and test data. The goal of this study is to assess the influence of image contrast, SNR, and image content on the generalization of learned image reconstruction, and to demonstrate the potential for transfer learning. Methods Reconstructions were trained from undersampled data using data sets with varying SNR, sampling pattern, image contrast, and synthetic data generated from a public image database. The performance of the trained reconstructions was evaluated on 10 in vivo patient knee MRI acquisitions from 2 different pulse sequences that were not used during training. Transfer learning was evaluated by fine-tuning baseline trainings from synthetic data with a small subset of in vivo MR training data. Results Deviations in SNR between training and testing led to substantial decreases in reconstruction image quality, whereas image contrast was less relevant. Trainings from heterogeneous training data generalized well toward the test data with a range of acquisition parameters. Trainings from synthetic, non-MR image data showed residual aliasing artifacts, which could be removed by transfer learning-inspired fine-tuning. Conclusion This study presents insights into the generalization ability of learned image reconstruction with respect to deviations in the acquisition settings between training and testing. It also provides an outlook for the potential of transfer learning to fine-tune trainings to a particular target application using only a small number of training cases.

...read moreread less

Journal Article•DOI•

An Online Plug-and-Play Algorithm for Regularized Image Reconstruction

[...]

Yu Sun¹, Brendt Wohlberg², Ulugbek S. Kamilov¹•Institutions (2)

Washington University in St. Louis¹, Los Alamos National Laboratory²

17 Jan 2019-IEEE Transactions on Computational Imaging

TL;DR: A new online PnP algorithm based on the proximal gradient method (PGM) is introduced, which uses only a subset of measurements at every iteration, which makes it scalable to very large datasets.

...read moreread less

Abstract: Plug-and-play priors (PnP) is a powerful framework for regularizing imaging inverse problems by using advanced denoisers within an iterative algorithm. Recent experimental evidence suggests that PnP algorithms achieve the state-of-the-art performance in a range of imaging applications. In this paper, we introduce a new online PnP algorithm based on the proximal gradient method (PGM). The proposed algorithm uses only a subset of measurements at every iteration, which makes it scalable to very large datasets. We present a new theoretical convergence analysis, for both batch and online variants of PnP-PGM, for denoisers that do not necessarily correspond to proximal operators. We also present simulations illustrating the applicability of the algorithm to image reconstruction in diffraction tomography. The results in this paper have the potential to expand the applicability of the PnP framework to very large datasets.

...read moreread less

Journal Article•DOI•

Domain Progressive 3D Residual Convolution Network to Improve Low-Dose CT Imaging

[...]

Xiangrui Yin¹, Jean-Louis Coatrieux, Qianlong Zhao¹, Jin Liu², Wei Yang³, Jian Yang⁴, Guotao Quan, Yang Chen¹, Huazhong Shu¹, Limin Luo¹ - Show less +6 more•Institutions (4)

Southeast University¹, Anhui Polytechnic University², Southern Medical University³, Beijing Institute of Technology⁴

17 May 2019-IEEE Transactions on Medical Imaging

TL;DR: A domain progressive 3D residual convolution network (DP-ResNet) for the LDCT imaging procedure that contains three stages: sinogram domain network (SD-net), filtered back projection (FBP), and imagedomain network (ID-net).

...read moreread less

Abstract: The wide applications of X-ray computed tomography (CT) bring low-dose CT (LDCT) into a clinical prerequisite, but reducing the radiation exposure in CT often leads to significantly increased noise and artifacts, which might lower the judgment accuracy of radiologists. In this paper, we put forward a domain progressive 3D residual convolution network (DP-ResNet) for the LDCT imaging procedure that contains three stages: sinogram domain network (SD-net), filtered back projection (FBP), and image domain network (ID-net). Though both are based on the residual network structure, the SD-net and ID-net provide complementary effect on improving the final LDCT quality. The experimental results with both simulated and real projection data show that this domain progressive deep-learning network achieves significantly improved performance by combing the network processing in the two domains.

...read moreread less

Journal Article•DOI•

Deep-Neural-Network-Based Sinogram Synthesis for Sparse-View CT Image Reconstruction

[...]

Hoyeon Lee, Jongha Lee, Hyeongseok Kim, Byungchul Cho¹, Seungryong Cho - Show less +1 more•Institutions (1)

Asan Medical Center¹

01 Mar 2019

TL;DR: A deep-neural-network-enabled sinogram synthesis method for sparse-view CT is introduced and its outperformance to the existing interpolation methods and also to the iterative image reconstruction approach is shown.

...read moreread less

Abstract: Recently, a number of approaches to low-dose computed tomography (CT) have been developed and deployed in commercialized CT scanners. Tube current reduction is perhaps the most actively explored technology with advanced image reconstruction algorithms. Sparse data sampling is another viable option to the low-dose CT, and sparse-view CT has been particularly of interest among the researchers in CT community. Since analytic image reconstruction algorithms would lead to severe image artifacts, various iterative algorithms have been developed for reconstructing images from sparsely view-sampled projection data. However, iterative algorithms take much longer computation time than the analytic algorithms, and images are usually prone to different types of image artifacts that heavily depend on the reconstruction parameters. Interpolation methods have also been utilized to fill the missing data in the sinogram of sparse-view CT thus providing synthetically full data for analytic image reconstruction. In this paper, we introduce a deep-neural-network-enabled sinogram synthesis method for sparse-view CT, and show its outperformance to the existing interpolation methods and also to the iterative image reconstruction approach.

...read moreread less

Journal Article•DOI•

CNN-Based Real-Time Dense Face Reconstruction with Inverse-Rendered Photo-Realistic Face Images

[...]

Yudong Guo¹, Juyong Zhang¹, Jianfei Cai², Boyi Jiang¹, Jianmin Zheng² - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Nanyang Technological University²

01 Jun 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Juyong et al. as discussed by the authors proposed a coarse-to-fine learning framework consisting of three convolutional networks for real-time detailed 3D face reconstruction from monocular video as well as from a single image.

...read moreread less

Abstract: With the powerfulness of convolution neural networks (CNN), CNN based face reconstruction has recently shown promising performance in reconstructing detailed face shape from 2D face images. The success of CNN-based methods relies on a large number of labeled data. The state-of-the-art synthesizes such data using a coarse morphable face model, which however has difficulty to generate detailed photo-realistic images of faces (with wrinkles). This paper presents a novel face data generation method. Specifically, we render a large number of photo-realistic face images with different attributes based on inverse rendering. Furthermore, we construct a fine-detailed face image dataset by transferring different scales of details from one image to another. We also construct a large number of video-type adjacent frame pairs by simulating the distribution of real video data.11.All these coarse-scale and fine-scale photo-realistic face image datasets can be downloaded from https://github.com/Juyong/3DFace. With these nicely constructed datasets, we propose a coarse-to-fine learning framework consisting of three convolutional networks. The networks are trained for real-time detailed 3D face reconstruction from monocular video as well as from a single image. Extensive experimental results demonstrate that our framework can produce high-quality reconstruction but with much less computation time compared to the state-of-the-art. Moreover, our method is robust to pose, expression and lighting due to the diversity of data.

...read moreread less

Proceedings Article•DOI•

lambda-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement

[...]

Xin Miao¹, Xin Yuan², Yunchen Pu³, Vassilis Athitsos¹•Institutions (3)

University of Texas at Arlington¹, Bell Labs², Facebook³

01 Oct 2019

TL;DR: The λ-net, which reconstructs hyperspectral images from a single shot measurement, can finish the reconstruction task within sub-seconds instead of hours taken by the most recently proposed DeSCI algorithm, thus speeding up the reconstruction >1000 times.

...read moreread less

Abstract: We propose the λ-net, which reconstructs hyperspectral images (e.g., with 24 spectral channels) from a single shot measurement. This task is usually termed snapshot compressive-spectral imaging (SCI), which enjoys low cost, low bandwidth and high-speed sensing rate via capturing the three-dimensional (3D) signal i.e., (x, y, λ), using a 2D snapshot. Though proposed more than a decade ago, the poor quality and low-speed of reconstruction algorithms preclude wide applications of SCI. To address this challenge, in this paper, we develop a dual-stage generative model to reconstruct the desired 3D signal in SCI, dubbed λ-net. Results on both simulation and real datasets demonstrate the significant advantages of λ-net, which leads to >4dB improvement in PSNR for real-mask-in-the-loop simulation data compared to the current state-of-the-art. Furthermore, λ-net can finish the reconstruction task within sub-seconds instead of hours taken by the most recently proposed DeSCI algorithm, thus speeding up the reconstruction >1000 times.

...read moreread less

Journal Article•DOI•

On the Reconstruction of Face Images from Deep Face Templates

[...]

Guangcan Mai¹, Kai Cao², Pong C. Yuen¹, Anil K. Jain²•Institutions (2)

Hong Kong Baptist University¹, Michigan State University²

01 May 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper proposes a neighborly de-convolutional neural network (NbNet) to reconstruct face images from their deep templates and demonstrates the need to secure deep templates in face recognition systems.

...read moreread less

Abstract: State-of-the-art face recognition systems are based on deep (convolutional) neural networks. Therefore, it is imperative to determine to what extent face templates derived from deep networks can be inverted to obtain the original face image. In this paper, we study the vulnerabilities of a state-of-the-art face recognition system based on template reconstruction attack. We propose a neighborly de-convolutional neural network ( NbNet ) to reconstruct face images from their deep templates. In our experiments, we assumed that no knowledge about the target subject and the deep network are available. To train the NbNet reconstruction models, we augmented two benchmark face datasets (VGG-Face and Multi-PIE) with a large collection of images synthesized using a face generator. The proposed reconstruction was evaluated using type-I (comparing the reconstructed images against the original face images used to generate the deep template) and type-II (comparing the reconstructed images against a different face image of the same subject) attacks. Given the images reconstructed from NbNets , we show that for verification, we achieve TAR of 95.20 percent (58.05 percent) on LFW under type-I (type-II) attacks @ FAR of 0.1 percent. Besides, 96.58 percent (92.84 percent) of the images reconstructed from templates of partition fa ( fb ) can be identified from partition fa in color FERET. Our study demonstrates the need to secure deep templates in face recognition systems.

...read moreread less

Journal Article•DOI•

Optimizing a Parameterized Plug-and-Play ADMM for Iterative Low-Dose CT Reconstruction

[...]

Ji He¹, Yan Yang², Yongbo Wang¹, Dong Zeng¹, Zhaoying Bian¹, Hao Zhang³, Jian Sun², Zongben Xu², Jianhua Ma¹ - Show less +5 more•Institutions (3)

Southern Medical University¹, Xi'an Jiaotong University², Johns Hopkins University³

01 Feb 2019-IEEE Transactions on Medical Imaging

TL;DR: Experimental results obtained on clinical patient datasets demonstrate that the proposed deep learning-based strategy for MBIR can achieve promising gains over existing algorithms for LdCT image reconstruction in terms of noise-induced artifact suppression and edge detail preservation.

...read moreread less

Abstract: Reducing the exposure to X-ray radiation while maintaining a clinically acceptable image quality is desirable in various CT applications. To realize low-dose CT (LdCT) imaging, model-based iterative reconstruction (MBIR) algorithms are widely adopted, but they require proper prior knowledge assumptions in the sinogram and/or image domains and involve tedious manual optimization of multiple parameters. In this paper, we propose a deep learning (DL)-based strategy for MBIR to simultaneously address prior knowledge design and MBIR parameter selection in one optimization framework. Specifically, a parameterized plug-and-play alternating direction method of multipliers (3pADMM) is proposed for the general penalized weighted least-squares model, and then, by adopting the basic idea of DL, the parameterized plug-and-play (3p) prior and the related parameters are optimized simultaneously in a single framework using a large number of training data. The main contribution of this paper is that the 3p prior and the related parameters in the proposed 3pADMM framework can be supervised and optimized simultaneously to achieve robust LdCT reconstruction performance. Experimental results obtained on clinical patient datasets demonstrate that the proposed method can achieve promising gains over existing algorithms for LdCT image reconstruction in terms of noise-induced artifact suppression and edge detail preservation.

...read moreread less

Collapse