(375p) Quantitative Evaluation of Catalyst Shape in SEM Images Using Gan and Semi-Supervised Semantic Segmentation
AIChE Annual Meeting
2024
2024 AIChE Annual Meeting
Computing and Systems Technology Division
Interactive Session: Data and Information Systems
Tuesday, October 29, 2024 - 3:30pm to 5:00pm
The application of AI to unstructured data research faces challenges, particularly in the early stages due to the high costs associated with acquiring large-scale training datasets. In fields that demand specialized knowledge, such as chemical engineering, access to high-quality image data is exceedingly limited. Furthermore, the quantity of SEM images highlighting shape is particularly scarce, posing a significant constraint on dataset construction. To address these challenges, our research employs Generative Adversarial Networks (GAN) to generate high-quality SEM image data from a limited number of original SEM images [3]. The training data for the GAN model consists of actual images obtained through web crawling on Google, figure images extracted from catalysis literature, and synthetic images created using 3D rendering techniques. Various preprocessing techniques were implemented, such as inpainting to restore text or annotations by leveraging surrounding pixel information, mirror extrapolatation for image size adjustment through symmetrical extension, and augmentation to increase dataset volume, were applied [4]. The prepared data were trained on a DCGAN (Deep Convolutional Generative Adversarial Network) model, employing feature matching and minibatch discrimination techniques, to generate an extensive dataset [5]. Subsequently, the quality of the generated dataset was validated using the Frechet Inception Distance (FID) score.
The dataset generated by the GAN model comprises unlabeled data, while the synthetic dataset created using 3D rendering technology is automatically labeled during its creation process, resulting in a dataset partially consisting of unlabeled and labeled data. This data configuration is suitable for semi-supervised learning approaches. In this research, semi-supervised semantic segmentation techniques were applied, utilizing a method that integrates the U-Net with the Mean Teacher method. This approach segmented the catalyst particles in Scanning Electron Microscopy (SEM) images into six distinct 3D shapes: cube, tetrahedron, octahedron, dodecahedron, icosahedron, and sphere. The accuracy of this segmentation was assessed based on pixel accuracy.
This research presents a shape for overcoming the difficulties associated with acquiring unstructured data and quantitatively representing the distribution of particle shapes in SEM images through the integration of GAN and semi-supervised semantic segmentation within the field of catalysis engineering. By enabling rapid and precise segmentation of catalyst shape, this approach is expected to significantly reduce the cost and time associated with researching the properties related to catalyst shapes, thereby maximizing efficiency.
References
- Sun, W., X. Li, C. Sun, Z. Huang, H. Xu, and W. Shen, âInsights into the pyrolysis processes of Ce-MOFs for preparing highly active catalysts of toluene combustion,â Catalysts, 9 (8), (2019).
- Muñoz-Mármol, M., J. Crespo, M.J. Fritts, and V. Maojo, âTowards the taxonomic categorization and recognition of nanoparticle shapes,â Nanomedicine: Nanotechnology, Biology, and Medicine, 11 (2), pp. 457â465 (2015).
- Creswell, A., T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta, and A.A. Bharath, âGenerative Adversarial Networks: An Overview,â IEEE Signal Processing Magazine 35, 2018, 53â65.
- Navab, N., J. Hornegger, W.M. Wells, and A.F. Frangi, âLNCS 9351 - Medical Image Computing and Computer-Assisted Intervention â MICCAI 2015,â (2015).
- Salimans, T., I. Goodfellow, W. Zaremba, V. Cheung, A. Radford, and X. Chen, âImproved Techniques for Training GANs,â (2016).
- Chen, X., Y. Yuan, G. Zeng, and J. Wang, âSemi-Supervised Semantic Segmentation with Cross Pseudo Supervision,â (2021).