Deep Learning of the Regulatory Grammar of Yeast 5’ Untranslated Regions from 500,000 Random Sequences | AIChE

Deep Learning of the Regulatory Grammar of Yeast 5’ Untranslated Regions from 500,000 Random Sequences

Authors 

Groves, B. - Presenter, University of Washington
Our ability to predict protein expression from DNA sequence alone remains poor, reflecting our limited understanding of cis-regulatory grammar and hampering the design of engineered genes for synthetic biology applications. We have generated a model that predicts the translational efficiency of the 5’ untranslated region (UTR) of mRNAs in the yeast Saccharomyces cerevisiae. We constructed a library of nearly half a million 50 nucleotide-long random 5’ UTRs and assayed them in a single massively parallel growth selection experiment. The resulting data have allowed us to quantify the impact on translation of Kozak sequence composition, upstream open reading frames (uORFs) and secondary structure. With this data, we have trained a convolutional neural network on the random library and validated it by predicting the translational efficiency of 5’ UTRs that natively occur in yeast. The model additionally was used to computationally evolve highly translating 5’ UTRs. We have confirmed experimentally that the great majority of the evolved sequences lead to higher translation rates than the starting sequences, demonstrating the predictive utility of this model.

Checkout

This paper has an Extended Abstract file available; you must purchase the conference proceedings to access it.

Checkout

Do you already own this?

Pricing

Individuals

AIChE Explorer Members $250.00
Non-Members $250.00