(208c) Self-Supervised Learning Methods for Drug Substance and Drug Product Characterization in the Pharmaceutical Industry
AIChE Annual Meeting
2022
2022 Annual Meeting
Pharmaceutical Discovery, Development and Manufacturing Forum
Enabling Technologies: Mechanistic and statistical modeling
Monday, November 14, 2022 - 4:12pm to 4:33pm
Data-driven in nature, these models usually rely on large amounts of data to achieve goals such as classifying subvisible particles in a solution or detecting extraneous matter or impurity crystals in a vessel. However, training these models for such tasks requires labeled data that needs to be prepared by a human user, which can be a tedious task and very time consuming. In this talk, we will discuss how one can leverage a family of self-supervised or weakly supervised learning methods to facilitate performing speedy training tasks and thereby accelerate practical applications. These methods include autoencoder-based and contrastive learning-based approaches. In essence, the methods are built on the idea to invoke the networks to perform a pre-text task in which they learn the most important features of the available data without relying on labels provided by an operator. We will discuss applying such approaches to characterizing different systems from protein aggregates in sterile liquid formulations to impurity particles in small-molecule crystallization processes.