Conference on Artificial Intelligence in Materials Science and Engineering - AI MSE 2023
Lecture
22.11.2023
Enhancing Semantic Segmentation in High-Resolution TEM Images through Pretraining on Unlabeled Data
BK

Dr.-Ing. Bashir Kazimi

Forschungszentrum Jülich GmbH

Kazimi, B. (Speaker)¹; Sandfeld, S.²
¹Forschungszentrum Jülich GmbH, Aachen; ²Forschungszentrum Juelich, Aachen
Vorschau
20 Min. Untertitel (CC)

In the field of transmission electron microscopy (TEM), accurate  analysis of acquired data often lags behind the advancements in  acquisition methods. Manual tailoring of image processing techniques to  individual datasets hinders progress in data interpretation. To address  this challenge, we propose a study focused on improving the performance  of deep learning models for semantic segmentation in high-resolution TEM  images by leveraging pretraining on unlabeled images.

Drawing insights from previous works, our study encompasses two key  stages: denoising and semantic segmentation. For denoising, we employ  two distinct approaches. Firstly, an encoder-decoder model is trained to  reconstruct clean TEM images from noisy inputs generated by applying  Poisson noise, as described in [1]. Secondly, we utilize the  conditional Generative Adversarial Network (GAN) model, Pix2Pix [2], to  train a generator capable of producing clean TEM images from noisy  inputs, while a discriminator learns to distinguish between the  generated and original clean images.

The pretrained denoising models, both the encoder-decoder and the  generator, are then fine-tuned on annotated TEM images for semantic  segmentation. Our study reveals several notable findings. Firstly, finetuning semantic segmentation models using pretrained denoising  models accelerates convergence, reducing the time and computational  resources required for training. Secondly, this approach proves to be  effective with smaller datasets, highlighting the potential for  leveraging limited annotated data in TEM image analysis. Additionally,  our approach facilitates comparable results across TEM datasets with  varying magnifications and resolutions, obviating the need for  customizing architectures to accommodate different receptive field  sizes which is extensively studied in [3].

By combining denoising and semantic segmentation through pretraining on  unlabeled data, our study offers a practical solution for enhancing the  analysis of high-resolution TEM images. These findings have significant  implications for the TEM community, as they enable faster and more  accurate interpretation of TEM data, while also reducing the resource  requirements for model training.



References
[1] L., Gambini; T., Mullarkey, L., Jones; S., Sanvito. Machine-learning approach for quantified resolvability enhancement of low-dose STEM data. Machine Learning: Science and Technology, 2023, 4(1), p.015025.
[2] P., Isola; J.Y., Zhu; T., Zhu; A.A., Efros. Image-to-image translation with conditional adversarial networks. CVPR, 2017, pp. 1125-1134
[3] K., Sytwu; C., Groschner; M.C., Scott. Understanding the influence of receptive field and network complexity in neural network-guided TEM image analysis JMicroscopy and Microanalysis, 2022, 28(6), pp. 1896-1904.

Abstract

Abstract

Erwerben Sie einen Zugang, um dieses Dokument anzusehen.

© 2026