(403f) Data-Driven Discovery of Polymeric Vehicles for Gene Editing:Serendipity-Inspired Design Directions.
AIChE Annual Meeting
2020
2020 Virtual AIChE Annual Meeting
Topical Conference: Applications of Data Science to Molecules and Materials
Applications of Data Science to High Throughput Experimentation
Wednesday, November 18, 2020 - 9:15am to 9:30am
Drawing inspiration from high-throughput experimental approaches used in the pharmaceutical industry to screen drug libraries for therapeutic activity, we employed parallel polymer synthesis, formulation and well plate based biological assays for rapid screening of gene editing efficiency We synthesized a chemically diverse library of copolymers combining different ratios of 1) cationic monomers bearing amines spanning a broad range of basicity and 2) neutral monomers of varying hydrophilicity. Our combinatorially designed library allows for systematic investigation of the effect of amine basicity by studying variations in polymer pKa resulting from the use of 4 cationic monomers, in ratios of 100, 75, 50 and 25 %. Subsequent to parallel polymer synthesis, extensive physicochemical characterization was completed using automated tools -: composition and molecular weight analysis, pKa, polyplex size distribution, binding assays, and ζ-potential measurements were acquired in high-throughput modes to generate a rich dataset. Polymers were complexed with ribonucleoprotein (RNP) payloads and gene editing was quantified by estimating the proportion of mCherry positive cells. To resolve the trade-off between sensitivity and experimental throughput, we employed image cytometry and developed an image processing algorithm to quantify mCherry expression in a robust and automated fashion from a bank of images arising from 200 unique formulations. At the end of the high-throughput screening campaign, we obtained a high-performing hit polymer that outperformed state-of-the-art synthetic transfection reagents.
Having identified a hit formulation from our library, the challenge was to: 1) unravel the relationship between polymer attributes and gene editing efficiency. 2) build on these structure-activity relationships to guide the design of future polymer libraries that will yield a higher âhit rateâ. In order to derive these predictive relationships, we sought to understand how 10 polymer descriptors influenced biological performance. We turned to principal component analysis (PCA), to deal with the dimensionality challenge created by our dataset. Through PCA, we concluded that a single polymer descriptor cannot be used in isolation to guide the design of future libraries. Rather, complex non-linear relationships between several molecular attributes were responsible for editing performance. Discarding preconceived notions of how various chemical functional groups will influence delivery, we screened a large chemically diverse polymer library to discover design guidelines that do not conform to traditional heuristics as well as a promising hit polymer, which may not have been accessible through hypothesis-testing. If statistical learning and automated experimental workflows are applied in tandem, we can overcome challenges originating from the complexity of structure-function relationships governing polymeric gene delivery.