--- title: Compressive time-lapse seismic monitoring of carbon storage and sequestration with the joint recovery model author: | Ziyi Yin^1^, Mathias Louboutin^2^, Felix J. Herrmann^1,2^ \ ^1^ School of Computational Science and Engineering, Georgia Institute of Technology \ ^2^ School of Earth and Atmospheric Sciences, Georgia Institute of Technology \ bibliography: - yin2021SEGcts.bib --- ## Summary \vspace*{-0.3cm} Time-lapse seismic monitoring of carbon storage and sequestration is often challenging because the time-lapse signature of the growth of CO2 plumes is weak in amplitude and therefore difficult to detect seismically. This situation is compounded by the fact that the surveys are often coarsely sampled and not replicated to reduce costs. As a result, images obtained for different vintages (baseline and monitor surveys) often contain artifacts that may be attributed wrongly to time-lapse changes. To address these issues, we propose to invert the baseline and monitor surveys jointly. By using the joint recovery model, we exploit information shared between multiple time-lapse surveys. Contrary to other time-lapse methods, our approach does not rely on replicating the surveys to detect time-lapse changes. To illustrate this advantage, we present a numerical sensitivity study where CO2 is injected in a realistic synthetic model. This model is representative of the geology in the southeast of the North Sea, an area currently considered for carbon sequestration. Our example demonstrates that the joint recovery model improves the quality of time-lapse images allowing us to monitor the CO2 plume seismically. \vspace*{-0.45cm} ## Introduction \vspace*{-0.3cm} Time-lapse seismic imaging technology has recently been deployed by Carbon Capture and Storage (CCS) practitioners to monitor CO2 dynamics in porous media over long periods of time, with the Sleipner project in Norway as a pioneer [@ARTS2003347;@FURRE20173916]. There have been several major developments to adapt compressive sensing [@janiszewski2014improvements;@herrmann2008GJInps] and machine learning [@bharadwaj2020symae;@kaur2020time] to improve the productivity of (time-lapse) seismic data acquisition [@oghenekohwo2016GEOPctl;@wason2016GEOPctl] and imaging [@oghenekohwo2015CSEGctl] by exploiting similarities between baseline and monitor surveys. Recent work by @li2020coupled indicates the potential benefits of a coupled-physics framework where the slow-time subsurface CO2 flow, a rock physics model, and the fast-time wave physics are combined to better understand the dynamics of CO2 plumes and their effect on time-lapse seismic data for the duration of a CCS project. Because the time-lapse signal of CO2 plumes is weak, it continues to pose challenges on the robustness of time-lapse imaging [@lumley1997assessing]\. To address some of these challenges, @oghenekohwo2016GEOPctl;@wason2016GEOPctl proposed the joint recovery model (JRM) designed to reap the benefits of low-cost randomized non-replicated acquisition. It takes advantage of the fact that time-lapse seismic data and subsurface structure undergoing localized changes share information among the different vintages. In this extended abstract, we present a synthetic case study to make the case that the dynamics of CO2 plumes can indeed be monitored with active-source surface seismic. The presented work can be considered as a continuation of recent contributions by @li2020coupled\, who considered seismic monitoring in a much simpler geological setting involving cross-well tomography. In our study, we consider a 2D subsection of the BG Compass model with a geology representative of the southeast of the North Sea. Blunt sandstones in this area are considered as possible reservoirs for injection of CO2. Aside from assessing the sensitivity of seismic imaging towards monitoring CO2 plumes, we also study the effects of changes in the acquisition and the presence of noise, and how the JRM can be deployed to mitigate these effects of incomplete and non-replicated data. Our contributions are organized as follows. First, we briefly discuss full-waveform inversion and its linearization as part of time-lapse seismic imaging. Next, we introduce our joint inversion framework that explores commonalities between seismic images associated with the different vintages collected for the duration of a CCS project. We conclude by discussing a realistic time-lapse imaging experiment juxtaposing images obtained by inverting the vintages independently or jointly with JRM. \vspace*{-0.45cm} ## Methodology \vspace*{-0.3cm} We first briefly introduce the theoretical framework on which our wave-equation based monitoring framework for CCS is based. While we understand that more complex wave physics, e.g. elastic [@thomsen1986weak] and nonlinear inversions [@Li11TRfrfwi], may eventually be needed, we begin by deriving a model based on the scalar wave equation parameterized by the squared slowness. We start by introducing the objective for full-waveform inversion and its linearization with respect to an assumed given smooth background model. Based on this linearization, we propose a joint recovery model, which we solve computationally efficiently with linearized Bregman iterations [@yin2008bregman;@witte2018cls]\. \vspace*{-0.15cm} ### Full-waveform inversion (FWI) and its linearization \vspace*{-0.15cm} To derive a joint wave-based seismic monitoring system, we start by introducing the objective of full-waveform inversion for the baseline (``j=1``) and multiple monitor surveys (``2 \leq j \leq n_v``) with ``n_v\geq 2`` as the number of vintages. For each vintage, this data misfit objective reads ```math #eqfwi \underset{\mathbf{m}_j}{\operatorname{min}}\quad \|\mathbf{d}_j-\mathcal{F}_j(\mathbf{m}_j)\|_2^2 \quad \text{for}\quad j=\{1,2,\cdots,n_v\}. ``` In this expression, ``\mathcal{F}_j`` is the nonlinear forward modeling operator, which for given sources generates synthetic data at the receiver locations from the discretized time-lapse model parameters ``\mathbf{m}_j``. The observed data is, for each vintage, collected in the vector ``\mathbf{d}_j``. When provided with background velocity models ``\mathbf{\overline{m}}_j`` for each vintage, the above forward model can be linearized yielding ```math #eqgaussnewton \underset{\mathbf{\delta m}_j}{\operatorname{min}} \quad \|\mathbf{\delta d}_j-\nabla \mathcal{F}_j(\mathbf{\overline{m}}_j)\mathbf{\delta m}_j\|_2^2, ``` where ``\mathbf{\delta d}_j=\mathbf{d}_j-\mathcal{F}_j(\mathbf{\overline{m}}_j)`` are the linearized datasets for each vintage and ``\nabla \mathcal{F}_j`` the linearized forward operator relating perturbations in the squared slowness for each vintage, ``\mathbf{\delta m}_j``, to linearized data ``\mathbf{\delta d}_j``\. \vspace*{-0.15cm} ### Time-lapse monitoring with the joint recovery model \vspace*{-0.15cm} There exists an extensive literature on wave-based time-lapse imaging aimed at producing images that show time-lapse differences in the image space by carrying out inversion rather than imaging [@qu2017simultaneous;@yang2016time;@queisser2013full;@maharramov2019integrated]. A prominent example of such inversion method is formed by time-lapse monitoring via double differences [@zhang2013double;@yang2015double] where differences are obtained by inverting differences between the monitor and baseline residuals rather than between observed and synthetic monitoring data. While this type of approach can lead to good results, it assumes the baseline and monitor surveys to be relatively noise free, well sampled, and above all replicated---i.e. the acquisition geometries between different surveys need to be identical. In our approach, we build on a low-cost formulation for time-lapse seismic monitoring designed to handle low-cost non-replicated sparsely sampled surveys. The basic idea of this approach is to focus on what is shared amongst the different vintages rather than focusing on what is different. We argue that time-lapse monitoring benefits from such an approach if the Earth model being monitored undergoes relatively localized changes, an assumption that likely holds during CCS. To arrive at a successful joint inversion scheme, we augment the block diagonal linear system undergirding independent imaging (by minimizing Equation #eqgaussnewton independently for each vintage) with an extra column and a corresponding unknown common component. With these steps, we write ```math #eqjrm \mathbf{A} = \begin{bmatrix} \frac{1}{\gamma}\nabla\mathcal{F}_1(\mathbf{\overline{m}}_1) & \nabla\mathcal{F}_1(\mathbf{\overline{m}}_1) & \mathbf{0} &\mathbf{0} \\ \cdots & \mathbf{0} & \cdots & \mathbf{0} \\ \frac{1}{\gamma}\nabla\mathcal{F}_{n_v}(\mathbf{\overline{m}}_{n_v}) & \mathbf{0} & \mathbf{0} & \nabla\mathcal{F}_{n_v}(\mathbf{\overline{m}}_{n_v}) \end{bmatrix}. ``` This matrix ``\mathbf{A}`` relates the linearized data for each vintage ``\{\mathbf{\delta d}_j\}_{j=1}^{n_v}``, collected in the vector ``\mathbf{b} = \left[\mathbf{\delta d}_1^\top, \cdots, \mathbf{\delta d}_{n_v}^\top\right]^\top``, to the common component ``\mathbf{z}_0`` and innovations with respect to this common component ``\{\mathbf{z}_j\}_{i=1}^{n_v}`` all collected in the vector ``\mathbf{z} = \left[\mathbf{z}_0^\top,\cdots,\mathbf{z}_{n_v}^\top\right]^\top``. This formulation emphasizes what is common amongst the vintages when ``\gamma`` is close to ``0``, given that ``0<\gamma 170``mD) Bunter Sandstone of about ``300-500``m thick and that serves as the CO2 reservoir (red area in Figure #fig:perm, which includes location of the injection well, denoted by the white ``\times``, and the production well denoted by the yellow ``\bullet``); *(ii)* the primary seal (permeability ``10^{-4}-10^{-2}``mD) made of the Rot Halite Member, which is ``50``m thick and is typically continuous (black layer in Figure #fig:perm); *(iii)* the secondary seal made of the Haisborough group, which is ``>300``m thick and consists of low-permeable (permeability ``15-18``mD) mudstones. By assuming a linear relationship between the compressional wavespeed and permeability in each stratigraphic section, we arrive at the synthetic permeability model plotted in Figure #fig:perm\. To convert wavespeed to permeability, we assume for each section that an increase of ``1``km/s in the compressional wavespeed corresponds to an increase of ``1.63``mD in the permeability. Given these values for the permeability, we derive values for the porosity using the Kozeny-Carman equation [@costa2006permeability] ``K = \mathbf{\phi}^3 \left(\frac{1.527}{0.0314*(1-\mathbf{\phi})}\right)^2``, where ``K`` and ``\phi`` denote permeability (mD) and porosity (%) with constants taken from the Strategic UK CCS Storage Appraisal Project report\. ### Figure: {#fig:perm} ![](figs/permeability.png){width=80% #fig:permeability} :Permeability of 2D slice of BG Compass model including injection (white ``\times``) and production well (yellow ``\bullet``). \vspace*{-0.15cm} ### Simulation of CO2 dynamics \vspace*{-0.15cm} We use the open-source software [FwiFlow](https://github.com/lidongzh/FwiFlow.jl) [@li2020coupled] to numerically simulate the growth of the CO2 plume by solving the partial differential equations for two-phase flow for a period of ``60`` years with a time step of ``20`` days and a grid spacing of ``25``m. While our numerical simulations are in 2D, we assume the permeability model to extend by ``1.6`` km in the third (perpendicular) direction. The reservoir is initially filled with saline water, with a density of ``1.053\mathrm{g/cm}^3`` and a viscosity of ``1.0`` centipoise (cP). We inject CO2 at a constant rate of ``7`` Mt/y for ``60`` years, amounting to a total of ``420`` million metric tons. During the injection, we assume the supercritical CO2 to have a density of ``776.6 \mathrm{g/cm}^3`` and a viscosity of ``0.1``cP suggested by [@li2020coupled] and the Strategic UK CCS Storage Appraisal Project. This two-phase flow simulation provides us with a map of the CO2 concentration (in percentage) at the aformentioned time steps. We adopt the patchy saturation model [@avseth2010quantitative;@li2020coupled] to convert these CO2 concentrations to decreases in bulk moduli and then to decreases in compressional wavespeeds, which are included in Figure #fig:plume\. From this figure, we can see how the plume develops over time under influence of buoyancy and the production well to the right. We also observe that the plume gives rise to a difference in p-wavespeed between ``50``m/s to ``300``m/s, which should in principle be detectable via seismic monitoring. ### Figure: {#fig:plume} ![](figs/plume.png){width=100%} :Simulated time-lapse compressional velocity decrease after ``\{15,30,45,60\}`` years of CO2 injection. \vspace*{-0.15cm} ### Time-lapse data acquisition \vspace*{-0.15cm} We collect five surveys over a period of ``60`` years, with a baseline survey conducted before CO2 injection starts, followed by ``4`` monitor surveys taken after ``\{15,30,45,60\}`` years of injection. We propose a low-cost sparse non-replicated acquisition scheme with compressive sensing, capable of producing time-lapse seismic data with a high degree of repeatability without relying on replicating the surveys. To keep the costs down, we work with relatively sparsely sampled ocean bottom hydrophones connected to underwater buoys located above the ocean bottom. Compared to ocean bottom nodes, this type of acquisition avoids picking up coherent noise related to interface waves while it shares the advantage of being static and therefore highly replicable. To avoid aliasing, we jitter sample [@herrmann2008GJInps] the receiver positions of ``64`` hydrophones located at a depth of ``120``m with an average spacing of ``250``m horizontally. To reduce the costs on the source side, we propose non-replicated continuous simultaneous source shooting, which after deblending [@li2013joint;@xuan2020deblending] yields ``1272`` sources with a dense source sampling of ``12.5``m horizontally. To mimic the challenges related to active-source acquisition, we vary for each survey the tow-depth of sources uniformly randomly from ``5``m to ``15``m. We also apply random horizontal shifts varying uniformly in a range of ``-6``m to ``6``m, between sources in the different surveys. Since the receivers are coarsely sampled, we employ source-receiver reciprocity during imaging to further reduce computational costs. \vspace*{-0.15cm} ### Time-lapse monitoring with the joint recovery model \vspace*{-0.15cm} As stated earlier, we assume to have access to kinematically correct background velocity models ``\{\mathbf{\overline{m}}\}_{j=1}^n``. To demonstrate what is in principle achievable under ideal wave-physics, we generate linear data for each vintage via ``\mathbf{\delta d}_j=\nabla\mathcal{F}_j(\mathbf{\overline{m}_j}),\, j=1\cdots n_v``. We implemented both inversion schemes with [JUDI](https://github.com/slimgroup/JUDI.jl) -- the Julia Devito Inversion framework [@witte2018alf]\. This open-source package uses the highly optimized time-domain finite-difference propagators of [Devito](https://github.com/devitocodes/devito) [@luporini2018aap;@louboutin2018dae]\. We use a grid spacing of ``10``m during the wave-equation simulations and a Ricker wavelet with a peak frequency of ``25``Hz. Despite committing the inversion crime by generating data with the demigration operator, our monitoring problem is complicated by a rather complex but realistic geology (shown in Figure #fig:perm) and by the fact that the sampling is coarse on the receiver site and not replicated at the source site. To increase realism, we also use a random trace estimation technique during imaging. During random trace estimation, memory use is reduced drastically, a prerequisite for scaling our approach to ``3``D. Using the open-source software [TimeProbeSeismic.jl](https://github.com/slimgroup/TimeProbeSeismic.jl)\, we chose ``32`` random probing vectors to approximate our gradient calculations, which leads to noisy imaging artifacts. We refer to @louboutin2021ultra and to the proceedings of this conference for further details. \vspace*{-0.15cm} ### Independent recovery \vspace*{-0.15cm} We first recover the velocity perturbations in each vintage independently. During each Bregman iteration, we randomly select eight shots without replication during imaging, while we select ``16`` during the first iteration to build up a good starting image. We use the suggested steplength of @lorenz2014linearized and the ``90``th percentile of ``|\mathbf{u}_k|`` at the first iteration as the threshold ``\lambda``. For each vintage, we do ``23`` Bregman iterations, which amounts to three data passes. To observe the growth of the CO2 plume seismically, we plot the differences of the recovered images for each monitor survey with respect to the baseline in Figure #fig:LSRTMdiff\. These difference are computed via ``\mathbf{x}_j-\mathbf{x}_1`` for ``j=2,\cdots,n_v`` where ``\mathbf{x}_j`` are the images recovered independently. From these images, we observe that time-lapse signal is visible albeit it is weak while the overall image is plagued by strong artifacts, making it difficult to monitor the CO2 plume. To quantify the degree of repeatability of the time-lapse images, we follow @kragh2002seismic and use the normalized root mean square (NRMS) values defined as ```math #eq:NRMS NRMS(\mathbf{x}_1,\mathbf{x}_j)=\frac{200\times RMS(\mathbf{x}_1-\mathbf{x}_j)}{RMS(\mathbf{x}_1)+RMS(\mathbf{x}_j)}, \quad j=2,\cdots,n_v. ``` By definition, NRMS values range from ``0`` to ``200`` in percentage where a smaller value indicates recovery results are more repeatable, with ``10`` percent as acceptable by today's best 4D practices. These NRMS values are calculated only using the regions without time-lapse difference. ### Figure: {#fig:LSRTMdiff} ![](figs/independent.png){width=100% } :Difference plots w.r.t. the baseline survey yielded by independent recovery. The NRMS values computed for each pair are ``\{18.38\%, 15.74\%, 26.63\%, 16.55\%\}``. \vspace*{-0.15cm} ### Joint recovery \vspace*{-0.15cm} Finally, we recover the common component and innovations jointly via Equation #eqjrm with the same number of data passes (same number of PDE solves) and the same random subsets of shots as during the independent recovery. For each vintage, a different threshold ``\lambda_j`` is chosen based on the ``90``th percentile of the corresponding subset of ``|\mathbf{u}_k|`` at the first iteration. Figure #fig:JRMdiff shows the difference plots. Thanks to the JRM, we observe significantly fewer artifacts except for some artifacts in the middle due to overlapping shots during the early iterations, a problem that can easily be fixed. The improved quality of the difference plots is also reflected in the NMRS values, which are now well within the acceptable range. The scripts to reproduce the experiments in this expanded abstract are made available on the [SLIM](https://slim.gatech.edu) GitHub page [https://github.com/slimgroup/Software.SEG2021](https://github.com/slimgroup/Software.SEG2021). ### Figure: {#fig:JRMdiff} ![](figs/jrm.png){width=100% } :Difference plots w.r.t. the baseline survey yielded by joint recovery. The NRMS values computed for each pair are ``\{9.07\%, 8.83\%, 9.62\%, 11.23\%\}``. \vspace*{-0.45cm} ## Discussion and conclusions \vspace*{-0.3cm} Time-lapse seismic monitoring of carbon storage and sequestration is difficult due to the fact that the velocity changes induced by the growing CO2 plume are relatively small, making it challenging to detect these changes seismically especially when the surveys are poorly sampled and not replicated. By using the joint recovery model, we demonstrably overcome this problem by producing significant improvements in the difference plots associated with a realistic 60 year sequestration scenario. While similar improvements by the joint recovery model have been reported in the literature, the benefit of this approach has not yet been demonstrated for multiple monitor surveys. Neither has this model been used to monitor the development of a CO2 plume in a realistic geological setting in the North Sea that is currently being considered as a site for CO2 sequestration. While these initial results yielding NMRS values of ``10``\% on average are encouraging, many challenges remain in the development of a low-cost 3D seismic monitoring system designed to control and optimize the CO2 sequestration while minimizing its risk. \vspace*{-0.45cm} ## Acknowledgement \vspace*{-0.3cm} We would like to thank Charles Jones for the constructive discussion and thank BG Group for providing the Compass model. The CCS project information is taken from the Strategic UK CCS Storage Appraisal Project, funded by DECC, commissioned by the ETI and delivered by Pale Blue Dot Energy, Axis Well Technology and Costain. The information contains copyright information licensed under [ETI Open Licence](https://s3-eu-west-1.amazonaws.com/assets.eti.co.uk/legacyUploads/2016/04/ETI-licence-v2.1.pdf). This research was carried out with the support of Georgia Research Alliance and partners of the ML4Seismic Center.