1
|
Howard AJ, Rim EY, Garrett OD, Shim Y, Notwell JH, Ronald PC. Combining Directed Evolution with Machine Learning Enables Accurate Genotype-to-Phenotype Predictions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.27.635131. [PMID: 39974914 PMCID: PMC11838293 DOI: 10.1101/2025.01.27.635131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Linking sequence variation to phenotypic effects is critical for efficient exploitation of large genomic datasets. Here we present a novel approach combining directed evolution with protein language modeling to characterize naturally-evolved variants of a rice immune receptor. Using high-throughput directed evolution, we engineered the rice immune receptor Pik-1 to bind and recognize the fungal proteins Avr-PikC and Avr-PikF, which evade detection by currently characterized Pik-1 alleles. A protein language model was fine-tuned on this data to correlate sequence variation with ligand binding behavior. This modeling was then used to characterize Pik-1 variants found in the 3,000 Rice Genomes Project dataset. Two variants scored highly for binding against Avr-PikC, and in vitro analyses confirmed their improved ligand binding over the wild-type Pik-1 receptor. Overall, this machine learning approach identified promising sources of disease resistance in rice and shows potential utility for exploring the phenotypic variation of other proteins of interest.
Collapse
Affiliation(s)
- Alexander J. Howard
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
| | - Ellen Y. Rim
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
| | - Oscar D. Garrett
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
| | - Yejin Shim
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
| | - James H. Notwell
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
| | - Pamela C. Ronald
- Department of Plant Pathology and the Genome Center, University of California, Davis, CA, 95616, USA
- Joint BioEnergy Institute, Emeryville, CA 94608, USA
- Innovative Genomics Institute (IGI), University of California, Berkeley, CA 94720, USA
| |
Collapse
|