cff-version: 1.2.0 abstract: "
This dataset contains 4,847 NetCDF files generated with the Delft Advanced Research Terra Simulator (DARTS). Each file represents a distinct high-resolution reservoir simulation, designed for machine learning research in carbon storage and reservoir engineering. The simulations include pressure, temperature, saturations, flow fields, permeability, porosity, and production variables. This is the dataset used on the publication Integrating Score-Based Diffusion Models with Machine Learning-Enhanced Localization for Advanced Data Assimilation in Geological Carbon Storage, which is still preprint.
" authors: - family-names: Serrao Seabra given-names: Gabriel orcid: "https://orcid.org/0009-0002-0558-8117" - family-names: Vossepoel given-names: Femke orcid: "https://orcid.org/0000-0002-3391-6651" - family-names: Voskov given-names: Denis orcid: "https://orcid.org/0000-0002-5399-1755" - family-names: Mücke given-names: Nikolaj orcid: "https://orcid.org/0000-0002-2635-9586" - family-names: Silva given-names: Vinicius orcid: "https://orcid.org/0000-0001-7282-0524" - family-names: Emerick given-names: Alexandre orcid: "https://orcid.org/0000-0002-4921-4902" title: "Dataset for the publication "Integrating Score-Based Diffusion Models with Machine Learning-Enhanced Localization for Advanced Data Assimilation in Geological Carbon Storage"" keywords: version: 1 identifiers: - type: doi value: 10.4121/a8ad7808-b923-4335-ba7a-898c8c1232be.v1 license: CC BY 4.0 date-released: 2025-10-13