Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

 
 
Session Overview
Session
14.c) Data Meets Earth: AI-Driven Innovations in Geoscience
Time:
Wednesday, 25/Sept/2024:
2:30pm - 4:00pm

Session Chair: Stefan Broda, Federal Institute for Geosciences and Natural Resources (BGR)
Session Chair: Marco Brysch, Bundesanstalt für Geowissenschaften und Rohstoffe
Session Chair: Jewgenij Torizin, Bundesanstalt für Geowissenschaften und Rohstoffe
Session Chair: Simon Müller, Bundesanstalt für Materialforschung und -prüfung
Location: Saal Rotterdam

60 PAX
Session Topics:
14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Show help for 'Increase or decrease the abstract text size'
Presentations
2:30pm - 2:45pm
ID: 299 / LeS 13 Mi - 14.c: 1
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Machine Learning Ensembles for Probabilistic Segmentation of Pores in Electron Microscopy

Marco Brysch1, Ben Laurich1, Monika Sester2

1Bundesanstalt für Geowissenschaften und Rohstoffe, Germany; 2Institute of Cartography and Geoinformatics, Gottfried Wilhelm Leibniz University, Hannover

The structural integrity of geological materials are closely tied to their porosity. Accurate knowledge of microspores in potential host rocks such as Opalinus Clay is essential for assessing their physical properties, including permeability and strength. Traditional methods for porosity analysis, such as mercury intrusion porosimetry (MIP) and gas pycnometry, provide valuable quantitative data on porosity and pore size distribution but do not offer insights into pore morphology or spatial distribution.

A methodological advancement comes with the combination of broad ion beam (BIB) milling and scanning electron microscopy (SEM), which allows for the visualization of pores at the nanoscale and facilitates detailed analysis of pore structures. However, segmenting pores from BIB-SEM images poses challenges due to the complexity of the images and the variability in pore shapes and sizes. This task is further complicated by the limited resolution of SEM and the subjective nature of manual pore identification.

To address these challenges, machine learning (ML) has emerged as a useful tool for automating the segmentation of pores from BIB-SEM images. We explore the use of conditional random fields (CRFs) as an ensemble method that improves segmentation by utilizing spatial and contextual information within the images. CRFs enhance segmentation accuracy and offer a robust framework for integrating results from multiple ML-classifiers. This probabilistic approach not only refines the segmentation accuracy but also enables the assessment of uncertainty levels in segmented pores, which is beneficial for accurately interpreting the microstructural properties.



2:45pm - 3:00pm
ID: 179 / LeS 13 Mi - 14.c: 2
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Denoising of Seismic Waveform Data and its Impact on the Analysis of North Korean Nuclear Tests

Peter Gaebler, Andreas Steinberg, Gernot Hartmann, Johanna Lehr, Christoph Pilger

BGR Hannover, Germany

In the past years numerous machine learning based applications have been introduced to the field of seismology. These applications for example address issues such as earthquake detection, event classification, feature extraction and waveform data analysis.

In this study we focus on the denoising of waveform data by separating the seismic signal from different noise sources. Machine learning models are able to recognize noise patterns and can effectively suppress unwanted noise, enhancing the quality of the waveform signals. A deep learning based denoising autoencoder algorithm is tested on regional and teleseismic seismological and hydroacoustic datasets, which are compiled from the International Monitoring System of the Comprehensive Nuclear-Test-Ban Treaty Organisation. We focus on seismic and hydroacoustic stations which can be relevant to investigate North Korean nuclear tests.

We investigate the performance of different denoising autoencoder models, for short- and long waveform periods, trained on the complete station network as well as on individual stations. We investigate if the denoised waveform signals are useful for seismic source analysis and if the denoised waveforms can reliably be used in further analysis steps, such as the comparison of computed array beams, seismic phase picking and or amplitude estimation.

The declared North Korean nuclear tests are a suitable benchmark test set, as they have been extensively researched and their source type and location can be assumed known. Further the verification of the source type is of particular interest for potential nuclear tests under international law.



3:00pm - 3:15pm
ID: 219 / LeS 13 Mi - 14.c: 3
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

A machine-learning based monitoring system for local seismic events in Germany

Catalina Ramos, Stefanie Donner, Klaus Stammler

BGR Hannover, Germany

Monitoring local seismic events is among the responsibilities of the German Federal Seismic Survey. This entity comprises a seismological subdepartment responsible for overseeing the operations of the German Regional Seismic Network and a data center tasked with collecting, archiving, and distributing continuous seismological and infrasound waveform data. As the amount of recorded seismic data dramatically increases every year, the imperative for an appropriate automatic real-time monitoring system becomes apparent. Leveraging advances in deep-learning methods in seismology, we develop a Python wrapper for the automatic estimation of hypocenter, magnitude and first-motion polarity in real time. To assess the performance of our algorithm, we compare the resulting event locations with catalogs of manually located events, with promising outcomes.



3:15pm - 3:30pm
ID: 170 / LeS 13 Mi - 14.c: 4
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Enhancing Model Transparency in Geothermal Settings: Clustering to Reduce Aleatoric Uncertainty

Magued Al-Aghbary1,2, Mohamed Sobh3, Christian Gerhards1

1TU Bergakademie Freiberg, Freiberg, Germany; 2Centre d’Etudes et de Recherche de Djibouti, Dschibuti; 3Leibniz Institute for Applied Geophysics, Hannover, Germany

Aleatoric uncertainty, inherent in the variability of data itself, presents a significant challenge in predictive modeling, especially in scenarios with intrinsic randomness and noise. Traditionally viewed as irreducible, this type of uncertainty fundamentally limits the precision of predictions, as it is directly tied to the stochastic nature of the underlying data. However, this research proposes a methodology that combines clustering with subsequent predictive modeling to mitigate the effects of aleatoric uncertainty, thereby enhancing the transparency and reliability of model outputs. Our approach begins with a clustering process, where data points are grouped based on similarity in features to form homogeneous subsets. Following clustering, we employ quantile random forests on each subset rendering the modeling tailored to each cluster's specific characteristics. This strategy allows for the models to not only be more sensitive to the subtle nuances within a group but also more robust against the noise inherent in the dataset. Finally, we estimate heat flow over continental Africa. Through extensive quantitative analysis, this study demonstrates that while aleatoric uncertainty is indeed irreducible from a theoretical standpoint, practical interventions like quality data acquisition combined with clustering can effectively diminish its impact on predictive accuracy.



3:30pm - 3:45pm
ID: 284 / LeS 13 Mi - 14.c: 5
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Semantic segmentation as a part of geological mapping using artificially blended texture dataset

Jewgenij Torizin1, Nick Schüßler1, Michael Fuchs1, Dirk Kuhn1, Karsten Schütze2, Steffen Prüfer1, Claudia Gunkel1

1Bundesanstalt für Geowissenschaften und Rohstoffe, Germany; 2Landesamt für Umwelt, Naturschutz und Geologie, Mecklenburg-Vorpommern, Germany

Geological mapping is essential for understanding the Earth's surface and subsurface structures, aiding resource exploration, environmental monitoring, and hazard assessment. Semantic segmentation, a computer vision technique, has shown promise in automating geological mapping processes by classifying image pixels into meaningful categories. This study explores the integration of semantic segmentation into geological mapping workflows by leveraging an artificially blended texture dataset.

Traditional geological mapping relies on extensive fieldwork in combination with manual aerial or satellite imagery interpretation, which can be time-consuming and subjective. Semantic segmentation can efficiently classify geological features by learning distinctive patterns and textures from data. However, obtaining high-quality datasets for this purpose is challenging due to the heterogeneous nature of geological formations and limited ground truth data.

We address this challenge by employing an artificially blended texture dataset that combines real-world geological textures. This blended dataset aims to enrich the training data with diverse texture and geological feature combinations, potentially enhancing the model's ability to generalize to unseen terrain conditions. Moreover, it reduces the potential for label bias by eliminating the need for manual delineation of label classes in the image, instead relying on generated borders.

Through experimental evaluation, we explore the effectiveness of semantic segmentation with the blended texture dataset in accurately delineating geological units and structures. We also discuss the implications of incorporating semantic segmentation into geological mapping workflows at the Baltic cliff coast, including its potential for improving mapping efficiency, reducing human bias, and facilitating remote sensing data integration with geological interpretations.



3:45pm - 4:00pm
ID: 463 / LeS 13 Mi - 14.c: 6
Topics: 14.c) Data Meets Earth: AI-Driven Innovations in Geoscience

Advancing Short-Term Groundwater Level Forecasting Using Temporal Fusion Transformer (TFT) and Neural Hierarchical Interpolation for Time Series Forecasting (N-HiITS)

Stefan Kunz1, Alexander Schulz2, Maximilian Nölscher1, Maria Wetzel1, Teodor Chiaburu2, Felix Biessmann2, Stefan Broda1

1Bundesanstalt für Geowissenschaften und Rohstoffe (BGR); 2Berliner Hochschule für Technik (BHT)

Machine learning approaches are increasingly used to predict groundwater levels, with local models for single monitoring wells currently being state of the art. Global models enable training and forecasting at multiple monitoring wells simultaneously, incorporating dynamic (e.g., meteorological) and static (e.g., hydro(geo)logical) features. These models can generalize predictions to wells with similar site characteristics and offer computational scaling benefits by requiring only one model for a larger area.

This study presents two global machine-learning models for short-term groundwater level prediction (up to 12 weeks): the Temporal Fusion Transformer (TFT) and Neural Hierarchical Interpolation for Time Series Forecasting (N-HiTS). The TFT combines recurrent neural networks with an attention mechanism and can determine the significance of individual input variables (feature importance). The N-HiTS model uses a fork architecture with multiple stacks to model different data frequencies, enhancing prediction accuracy.

We used a dataset of approximately 5300 monitoring wells across Germany, with groundwater levels from 1990 to 2016 (around 4.5 million values). Input features included groundwater levels, meteorological parameters, and site-specific environmental features such as hydro(geo)logical, soil, and spatial characteristics.

The TFT model showed a median NSE of 0.34, while the N-HiTS model performed better with a median NSE of 0.5 for the 12-week forecast. Around 25% of the test sites achieved an NSE over 0.68. Key features for forecast quality included historical groundwater levels, precipitation, the standard deviation of groundwater levels, and major hydrogeological districts. The topographical wetness index was the most important static feature, though its impact on model performance was minimal.