---
title: Photoacoustic imaging with conditional priors from normalizing flows
author:
- name: Rafael Orozco
  email: rorozco@gatech.edu
  affiliation: Georgia Institute of Technology
- name: Ali Siahkoohi
  email: alisk@gatech.edu
  affiliation: Georgia Institute of Technology
- name: Gabrio Rizzuti
  email: g.rizzuti@uu.nl
  affiliation: Utrecht University
- name: Tristan van Leeuwen
  email: t.van.leeuwen@cwi.nl
  affiliation: Centrum Wiskunde & Informatica
- name: Felix J. Herrmann
  email: felix.herrmann@gatech.edu
  affiliation: Georgia Institute of Technology
runninghead: Conditional flows for photoacoustic imaging
bibliography:
    - deep_inverse_2021.bib
---

## Abstract:

For many ill-posed inverse problems, such as photoacoustic imaging, the uncertainty of the solution is highly affected by measurement noise and data incompleteness (due, for example, to limited aperture). For these problems, the choice of prior information is a crucial aspect of a computationally effective solution scheme. We propose a regularization scheme for photoacoustic imaging that leverages prior information learned by a generative network. We train conditional normalizing flows on pairs of photoacoustic sources (the unknowns of the problem) and the associated data in order to exploit the posterior distribution of the solution. The learned prior is combined with physics-based optimization (enforced by partial differential equations), according to the deep prior framework, in order to achieve robustness with respect to out-of-distribution data. Numerical evidence suggests the superiority of this approach with respect to non-conditional deep priors, and the ability to retrieve features of the unknowns that are typically challenging for limited-view photoacoustics.

## Introduction

Photoacoustic imaging is a biomedical imaging technique based on the so-called photoacoustic effect [@xu2006photoacoustic]. It leverages the interplay between optics and acoustics as a mean to circumvent the limitations of imaging modalities relying
on single-type physics. Light beams generated by a pulsed laser can penetrate biological tissues by several centimeters, and are absorbed based on oxygen saturation or hemoglobin concentration. While optical absorption is in principle an ideal parameter for medical imaging (e.g., with respect to the detection of cancerous tissue), strong scattering imposes important limitations in its imaging resolution. Ultrasonics, on the other hand, can theoretically provide resolution of medical diagnostic value, but produce images of mechanical properties whose contrasts are not sensitive for detecting early stage tumors. In photoacoustics, optical and acoustic effects are combined to gain the best of both worlds. Under conditions of thermal and stress confinement, where the light pulse duration is negligible with respect to the heat diffusion and stress propagation time scales, thermal energy can efficiently build up in biological tissues, which in turn undergo thermal expansion and effectively act as a spatially distributed acoustic source. In photoacoustic imaging, the actual object of interest is the induced source, as it is directly related to optical absorption and can be recovered with a relatively higher resolution than pure optical imaging, based on the acquired ultrasonic data.

Photoacoustic imaging boils down to an initial value problem. The pressure field ``p``, function of space and time, obeys the acoustic wave equation:
```math #eq:wave-equation
\left\{
    \begin{array}{rcl}
        c^{-2}\partial_{tt}p-\Delta p & = &0,\\
                   \partial_t p|_{t=0} & = & 0,\\
                              p|_{t=0} & = & p_0.
    \end{array}
\right.
```
We assume that the velocity parameter ``c`` is a positive constant (though this could be easily extended to the inhomogeneous case). We are interested in retrieving the initial pressure distribution ``p_0`` from acoustic data. However, due to the intrinsic limitations of optics, the viable imaging depth for human tissue ranges around 1-4 cm, so that the receiver geometry has typically a one-sided view of the object [as for planar sensors, @jathoul2015deep]. This restricted view poses serious challenges for photoacoustic imaging.

Due to measurement incompleteness, one has to resort to prior information. In a Bayesian setting, we posit a data likelihood ``\mathbf{d}=A\mathbf{x}+\pmb{\varepsilon}``, where the Gaussian white noise ``\pmb{\varepsilon}`` is independent from ``\mathbf{x}``, which represents the initial distribution value ``p_0`` appearing in equation (#eq:wave-equation). The linear operator ``A`` comprises the solution of the equation (#eq:wave-equation) followed by the restriction to the receiver position. The prior information is encapsulated by a probability density ``p_{\mathrm{prior}}``. The solution is estimated by solving the problem:
```math #eq:minimization
\min_{\mathbf{x}}\dfrac{1}{2}\left\lVert A\mathbf{x}-\mathbf{d}\right\rVert^2-\lambda\log p_{\mathrm{prior}}(\mathbf{x}),
```
for a given weighting parameter ``\lambda``, to be properly tuned. For example, the compressed sensing paradigm enforces sparsity of the unknown in some transformed domain via ``\ell^1``-norm regularization. This class of "hand-crafted" priors are well studied from a theoretical point of view [@candes2006stable], but they typically do not take advantage of domain-specific knowledge of the problem.

In recent years, many applications of deep learning to linear inverse problems have dealt with priors implicitly defined by neural networks, e.g. by pre-training a generative network on a collection of candidate solutions [@bora2017compressed] or without training by directly exploiting the network inductive bias that favors natural-looking images [@ulyanov2018deep]. Our proposal consists of a deep prior solution to photoacoustic imaging, similarly to @bora2017compressed\, where the generative model is now trained on joint pairs of photoacoustic sources and associated measurements. The network is structured as a conditional normalizing flow [@ardizzone2019guided;@winkler2019learning], so that it is able to generate samples from the posterior distribution of ``\mathbf{x}`` given data ``\mathbf{y}``. For out-of-distribution data ``\mathbf{y}``, however, the learned posterior distribution does not adequately represent the analytical posterior. We then resort to the physics-based formulation in equation (#eq:minimization) in combination with the learned conditional prior to compute the maximum a posteriori solution (MAP). Our approach takes advantage of the favorable trade-off between the representation error and overfit for invertible networks when compared to other generative models [@asim2020invertible;@kelkar2021compressible].

## Related work

The regularization scheme described in this paper follows the deep prior literature originally spun from the work of @bora2017compressed\, which consists in reparameterizing the inverse problem unknowns by the input of a pre-trained generator. A closely related method is the deep image prior of @ulyanov2018deep\, which aims to counteract the potential representation error that might affect a pre-trained network. Contrary to @bora2017compressed\, the unknowns are now the network parameters, randomly initialized, while the input remains fixed. This method must resort to early stopping strategy to avoid overfit and only relies on the regularization effect of the network inductive bias. The complementary strenghts of these two approaches can be combined in a hybrid scheme, e.g. by using image-adaptive priors [@hussein2020image]. Our proposal is similar to the deep prior application to magnetic resonance imaging discussed in @kelkar2021compressible\, which uses invertible networks and further regularization of the latent-space variables. The advantages of using normalizing flows in the deep prior context were originally advocated by @asim2020invertible\. We extend the normalizing flow approach by using conditional architectures, as described in @ardizzone2019guided and @winkler2019learning\. Normalizing flows have been subject to a great deal of research, we refer to @kobyzev2020normalizing and @papamakarios2019normalizing for general overviews. The network architecture analyzed in this paper is HINT, discussed in @ardizzone2018analyzing (and later generalized to the conditional setting in @ardizzone2019guided\). An application of conditional normalizing flows in variational inference was recently discussed in @siahkoohi2021AABIpto\.

## Method

The proposed method aims at regularizing the inverse problem in equation (#eq:minimization) with a learned prior. The first stage of the algorithm consists of training a generator given access to a dataset of joint pairs ``\mathbf{x},\mathbf{y}\sim p_{X,Y}(\mathbf{x},\mathbf{y})``, where ``\mathbf{y}=A^T\mathbf{d}`` is time-reversed data (referred to as the adjoint solution). We train a normalizing flow ``f_{\pmb{\theta}}:X\times Y\to Z_x\times Z_y`` by minimizing the Kullback-Leibler divergence between the target density ``p_{X,Y}`` and the pull-back of the standard Gaussian distribution defined on the latent space ``Z_x\times Z_y`` via ``f_{\pmb{\theta}}`` [@kobyzev2020normalizing;@papamakarios2019normalizing]:
```math #eq:KLmin
\min_{\pmb{\theta}}\mathrm{KL}(p_{X,Y}\lVert p_{\pmb{\theta}})=\mathbb{E}_{\mathbf{x},\mathbf{y}\sim p_{X,Y}(\mathbf{x},\mathbf{y})}\dfrac{1}{2}\left\lVert f_{\pmb{\theta}}(\mathbf{x},\mathbf{y})\right\rVert^2-\log|\det\mathrm{J}_{f_{\pmb{\theta}}}(\mathbf{x},\mathbf{y})|.
```
Here, ``\mathrm{J}_{f_{\pmb{\theta}}}`` denotes the Jacobian of ``f_{\pmb{\theta}}`` with respect to the input. Following @ardizzone2019guided\, we impose a triangular structure on the normalizing flow, e.g. ``f_{\pmb{\theta}}(\mathbf{x},\mathbf{y})=(f^{z_x}_{\pmb{\theta}}(\mathbf{x},\mathbf{y}),f^{z_y}_{\pmb{\theta}}(\mathbf{y}))``. We choose the conditional architecture of the HINT invertible network, whose original (non-conditional) form was proposed in @ardizzone2018analyzing\. After an ideal optimization of equation (#eq:KLmin), sampling from the posterior distribution ``\mathbf{x}\sim p_{X|Y}(\mathbf{x}|\mathbf{y})`` is equivalent to computing the latent data code ``\mathbf{z}_y=f^{z_y}_{\pmb{\theta}}(\mathbf{y})`` and evaluating ``f^{-1}_{\pmb{\theta}}(\mathbf{z}_x,\mathbf{z}_y)`` for random ``\mathbf{z}_x``'s. This first phase does not require repeated evaluation of the physical model ``A`` appearing in equation (#eq:minimization), which helps in relieving the computational burden given by solving partial differential equations at scale.

When faced with out-of-distribution data, we can reasonably assume that the results previously obtained are not as reliable as for in-distribution data. The second stage of the proposed method then relies on the physics model in equation (#eq:minimization) and uses the pre-trained generator as an implicit deep prior. Formally, we end up solving the problem:
```math #eq:deepprior
\min_{\mathbf{z}_x}\dfrac{1}{2}\left\lVert A\,f^{-1}_{\pmb{\theta}}(\mathbf{z}_x,\mathbf{z}_y)-\mathbf{d}\right\rVert^2+\dfrac{\phantom{^2}\lambda^2}{2}\left\lVert\mathbf{z}_x\right\rVert^2,\quad\mathbf{z}_y=f^{z_y}_{\pmb{\theta}}(A^T\mathbf{d}).
```
Note that the minimization is carried over the latent variables related to the unknowns ``\mathbf{x}``. The computational effort to evaluate ``A`` and its adjoint is managenable when single data ``\mathbf{d}`` is considered.

## Results

The results related to the training of the generative model are collected in Figure #posterior-samples\. The training set consists of images of blood vessel systems ``\mathbf{x}`` and time-reversed data ``\mathbf{y}=A^T\mathbf{d}`` measured at the receiver location located on top of the image [@dataset]. The measurements are generated synthetically by solving the wave equation and adding Gaussian white noise. We qualitatively compare samples drawn from the learned posterior distribution for in- and out-of-distribution data.

### Figure: {#posterior-samples}
![](figs/dist=in_sample_i=1hint3_samples.png){width=50% #posterior-samples-in}
![](figs/dist=out_sample_i=1hint3_samples.png){width=50% #posterior-samples-out}
:Samples from the posterior distribution learned in the pre-training phase. Here, we display the results for: (a) in-distribution data (e.g. test data); (b) out-of-distribution data. Due to limited aperture (the receivers are located at the position ``z=0``), the areas of highest (relative) uncertainty correspond to the vertical features of the blood vessel system.

In Figure #reconstruction-results\, we compare the reconstruction results for out-of-distribution data by solving the problem in equation (#eq:minimization) with different choices for the prior. In particular, we consider a non-informative Gaussian prior and learned priors trained on a dataset of samples ``\mathbf{x}``'s and joint pairs ``\mathbf{x},\mathbf{y}``'s (our proposal). A qualitative inspection and quality metrics indicate superior results for the conditional prior approach. Further comparison of the results regarding data fit is shown in Figure #reconstruction-results-data\.

### Figure: {#reconstruction-results}
![](figs/gt_sol.png){width=25% #gtruth}
![](figs/gauss_sol.png){width=25% #gausspr}
![](figs/noncond_sol.png){width=25% #noncond}
![](figs/cond_sol.png){width=25% #cond}
:Reconstruction results for photoacoustic imaging. We compare MAP estimates obtained with different priors: (a) ground-truth, (b) Gaussian prior, (c) learned (marginal) prior ``p_X(\mathbf{x})``, (d) learned conditional prior ``p_{X,Y}(\mathbf{x},\mathbf{y})``.

### Figure: {#reconstruction-results-data}
![](figs/data_map_gauss_comp.png){width=33% #data-cm}
![](figs/data_map_noncond_comp.png){width=33% #data-cm-res}
![](figs/data_map_cond_comp.png){width=33% #data-map}
:Data fit for some of the reconstruction results shown in Figure #reconstruction-results\: (a) Gaussian prior, (b) learned (marginal) prior ``p_X(\mathbf{x})``, (c) learned conditional prior ``p_{X,Y}(\mathbf{x},\mathbf{y})``.

## Discussion and conclusions

In limited-view photoacoustic imaging, the resolution of deep near-vertical structures in a blood vessel system is a fundamental challenge due to the relatively weak imprint on the measurements. Note that similar issues are common for other wave-equation-based imaging applications such as geophysical prospecting [@tarantola2005inverse]. Conventional regularization methods based on sparsity in some transformed domain are typically too generic and practically unable to fill the sensitivity gap. A straightforward alternative is to use problem-specific information.

We proposed a regularization scheme for photoacoustic imaging that employs deep priors learned in an offline phase. The generative model is trained on a dataset containing pairs of candidate solutions and associated measurements, the goal is to learn the posterior distribution of the solution given some data. The primary scope of this work is to compare the regularization effect of conditional deep priors with "marginal" deep priors proposed in the recent past for many imaging applications. With "marginal" priors, we refer to generative models trained on the marginal distribution of the unknowns (e.g. without considering the associated data).

The results presented in this paper suggest that the MAP estimate based on conditional deep priors is potentially able to recover the vertical features of a blood vessel image. Our results also show that the marginal deep prior is more prone to misplace features, for both in- and out-of-distribution data.

In our view, learned prior regularization can be beneficial for highly ill-posed problems such as limited-view photoacoustic imaging. In order to avoid misleading reconstruction affected by "hallucinations", however, we must further assess the strong bias imposed by this type of learned priors.

## Data and software

The dataset used in this work is a derivative of the ELCAP lung dataset prepared by @dataset\. The results presented here uses a Julia implementation of invertible network architectures: [InvertibleNetworks.jl](https://github.com/slimgroup/InvertibleNetworks.jl) [@invnet]. <!---A Julia repository to reproduce the experiments herein described can be found in [PhotoVI.jl](https://github.gatech.edu/rorozcom3/PhotoVI.jl).--->