Seismic Dataset Curation from UK National Data Repository to Validate SAGE and WISE
| Title | Seismic Dataset Curation from UK National Data Repository to Validate SAGE and WISE |
| Publication Type | Conference |
| Year of Publication | 2025 |
| Authors | Bhar, I, Huseyin Tuna Erdinc, Souza, T, Rafael Orozco, Felix J. Herrmann |
| Conference Name | ML4SEISMIC Partners Meeting |
| Month | 11 |
| Keywords | Bayesian inference, generative model, Imaging, ML4SEISMIC, North-sea data, RTM, SLIM, WISE |
| Abstract | This work presents a curated seismic dataset pipeline developed for data residing in the UK National Data Repository (UK NDR) in support of advanced generative geophysical modeling and machine learning. The workflow integrates checkshots and 3D post-stack seismic data volumes to generate accurate depth-domain seismic sections. Custom interpolation algorithms are used to construct velocity models from checkshot data, which are then used for time-to-depth conversion with [OpendTect]https://dgbes.com/software/opendtect. The converted seismic cubes are subsequently processed in Python using [SegySAK]https://segysak.readthedocs.io/en/latest/, a python package, enabling efficient extraction and visualization of 2D lines. These datasets will be used to further validate the workflow of SAGE (Subsurface foundational model for AI-driven Geostatistical Extraction), which focuses on large-scale representation learning for the geosciences. The trained generative models will be used as input to WISE and ASPIRE that form foundation models for seismic inference designed to characterize the subsurface. This curated dataset pipeline streamlines seismic data preparation, enhances reproducibility, and bridges the gap between conventional geophysical workflows and emerging data-driven inference methods. |
| URL | https://slim.gatech.edu/Publications/Public/Conferences/ML4SEISMIC/2025/bhar2025ML4SEISMICsdc |
| Citation Key | bhar2025ML4SEISMICsdc |
