1
|
Boob AG, Zhu Z, Intasian P, Jain M, Petrov V, Lane ST, Tan SI, Xun G, Zhao H. CRISPR-COPIES: an in silico platform for discovery of neutral integration sites for CRISPR/Cas-facilitated gene integration. Nucleic Acids Res 2024; 52:e30. [PMID: 38346683 PMCID: PMC11014336 DOI: 10.1093/nar/gkae062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 01/09/2024] [Accepted: 01/19/2024] [Indexed: 04/14/2024] Open
Abstract
The CRISPR/Cas system has emerged as a powerful tool for genome editing in metabolic engineering and human gene therapy. However, locating the optimal site on the chromosome to integrate heterologous genes using the CRISPR/Cas system remains an open question. Selecting a suitable site for gene integration involves considering multiple complex criteria, including factors related to CRISPR/Cas-mediated integration, genetic stability, and gene expression. Consequently, identifying such sites on specific or different chromosomal locations typically requires extensive characterization efforts. To address these challenges, we have developed CRISPR-COPIES, a COmputational Pipeline for the Identification of CRISPR/Cas-facilitated intEgration Sites. This tool leverages ScaNN, a state-of-the-art model on the embedding-based nearest neighbor search for fast and accurate off-target search, and can identify genome-wide intergenic sites for most bacterial and fungal genomes within minutes. As a proof of concept, we utilized CRISPR-COPIES to characterize neutral integration sites in three diverse species: Saccharomyces cerevisiae, Cupriavidus necator, and HEK293T cells. In addition, we developed a user-friendly web interface for CRISPR-COPIES (https://biofoundry.web.illinois.edu/copies/). We anticipate that CRISPR-COPIES will serve as a valuable tool for targeted DNA integration and aid in the characterization of synthetic biology toolkits, enable rapid strain construction to produce valuable biochemicals, and support human gene and cell therapy applications.
Collapse
Affiliation(s)
- Aashutosh Girish Boob
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Zhixin Zhu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Pattarawan Intasian
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- School of Biomolecular Science and Engineering, Vidyasirimedhi Institute of Science and Technology, Wangchan Valley, Rayong 21210, Thailand
| | - Manan Jain
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Vassily Andrew Petrov
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Stephan Thomas Lane
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Shih-I Tan
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Guanhua Xun
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
2
|
Bi M, Wang Z, Cheng K, Cui Y, He Y, Ma J, Qi M. Construction of transcription factor mutagenesis population in tomato using a pooled CRISPR/Cas9 plasmid library. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2023; 205:108094. [PMID: 37995578 DOI: 10.1016/j.plaphy.2023.108094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 09/25/2023] [Accepted: 10/12/2023] [Indexed: 11/25/2023]
Abstract
Adequate mutant materials are the prerequisite for conducting gene function research or screening novel functional genes in plants. The strategy of constructing a large-scale mutant population using the pooled CRISPR/Cas9-sgRNA library has been implemented in several crops. However, the effective application of this CRISPR/Cas9 large-scale screening strategy to tomato remains to be attempted. Here, we identified 990 transcription factors in the tomato genome, designed and synthesized a CRISPR/Cas9 plasmid library containing 4379 sgRNAs. Using this pooled library, 487 T0 positive plants were obtained, among which 92 plants harbored a single sgRNA sequence, targeting 65 different transcription factors, with a mutation rate of 23%. In the T0 mutant population, the occurrence of homozygous and biallelic mutations was observed at higher frequencies. Additionally, the utilization of a small-scale CRISPR/Cas9 library targeting 30 transcription factors could enhance the efficacy of single sgRNA recognition in positive plants, increasing it from 19% to 42%. Phenotypic characterization of several mutants identified from the mutant population demonstrated the utility of our CRISPR/Cas9 mutant library. Taken together, our study offers insights into the implementation and optimization of CRISPR/Cas9-mediated large-scale knockout library in tomato.
Collapse
Affiliation(s)
- Mengxi Bi
- College of Horticulture, Shenyang Agricultural University, Shenyang, China; National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China; Key Laboratory of Horticultural Equipment, Ministry of Agriculture and Rural Affairs, Shenyang, China
| | - Zhijun Wang
- College of Horticulture, Shenyang Agricultural University, Shenyang, China; National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China; Key Laboratory of Horticultural Equipment, Ministry of Agriculture and Rural Affairs, Shenyang, China
| | - Keyan Cheng
- College of Horticulture, Shenyang Agricultural University, Shenyang, China; National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China; Key Laboratory of Horticultural Equipment, Ministry of Agriculture and Rural Affairs, Shenyang, China
| | - Yiqing Cui
- National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China
| | - Yi He
- National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China
| | - Jian Ma
- College of Horticulture, Shenyang Agricultural University, Shenyang, China; National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China; Key Laboratory of Horticultural Equipment, Ministry of Agriculture and Rural Affairs, Shenyang, China
| | - Mingfang Qi
- College of Horticulture, Shenyang Agricultural University, Shenyang, China; National & Local Joint Engineering Research Center of Northern Horticultural Facilities Design & Application Technology (Liaoning), Shenyang, China; Key Laboratory of Protected Horticulture (Shenyang Agricultural University), Ministry of Education, Shenyang, China; Key Laboratory of Horticultural Equipment, Ministry of Agriculture and Rural Affairs, Shenyang, China.
| |
Collapse
|
3
|
Develtere W, Waegneer E, Debray K, De Saeger J, Van Glabeke S, Maere S, Ruttink T, Jacobs TB. SMAP design: a multiplex PCR amplicon and gRNA design tool to screen for natural and CRISPR-induced genetic variation. Nucleic Acids Res 2023; 51:e37. [PMID: 36718951 PMCID: PMC10123101 DOI: 10.1093/nar/gkad036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 12/14/2022] [Accepted: 01/12/2023] [Indexed: 02/01/2023] Open
Abstract
Multiplex amplicon sequencing is a versatile method to identify genetic variation in natural or mutagenized populations through eco-tilling or multiplex CRISPR screens. Such genotyping screens require reliable and specific primer designs, combined with simultaneous gRNA design for CRISPR screens. Unfortunately, current tools are unable to combine multiplex gRNA and primer design in a high-throughput and easy-to-use manner with high design flexibility. Here, we report the development of a bioinformatics tool called SMAP design to overcome these limitations. We tested SMAP design on several plant and non-plant genomes and obtained designs for more than 80-90% of the target genes, depending on the genome and gene family. We validated the designs with Illumina multiplex amplicon sequencing and Sanger sequencing in Arabidopsis, soybean, and maize. We also used SMAP design to perform eco-tilling by tilling PCR amplicons across nine candidate genes putatively associated with haploid induction in Cichorium intybus. We screened 60 accessions of chicory and witloof and identified thirteen knockout haplotypes and their carriers. SMAP design is an easy-to-use command-line tool that generates highly specific gRNA and/or primer designs for any number of loci for CRISPR or natural variation screens and is compatible with other SMAP modules for seamless downstream analysis.
Collapse
Affiliation(s)
- Ward Develtere
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- VIB Center for Plant Systems Biology, (Technologiepark-Zwijnaarde 71), 9052, Ghent, Belgium
| | - Evelien Waegneer
- ILVO, Flanders Research Institute for Agriculture, Fisheries and Food, Plant Sciences Unit, (Caritasstraat 39), 9090, Melle, Belgium
- Laboratory for Plant Genetics and Crop Improvement, Division of Crop Biotechnics, Department of Biosystems, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Kevin Debray
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- VIB Center for Plant Systems Biology, (Technologiepark-Zwijnaarde 71), 9052, Ghent, Belgium
- ILVO, Flanders Research Institute for Agriculture, Fisheries and Food, Plant Sciences Unit, (Caritasstraat 39), 9090, Melle, Belgium
| | - Jonas De Saeger
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- VIB Center for Plant Systems Biology, (Technologiepark-Zwijnaarde 71), 9052, Ghent, Belgium
| | - Sabine Van Glabeke
- ILVO, Flanders Research Institute for Agriculture, Fisheries and Food, Plant Sciences Unit, (Caritasstraat 39), 9090, Melle, Belgium
| | - Steven Maere
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- VIB Center for Plant Systems Biology, (Technologiepark-Zwijnaarde 71), 9052, Ghent, Belgium
| | - Tom Ruttink
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- ILVO, Flanders Research Institute for Agriculture, Fisheries and Food, Plant Sciences Unit, (Caritasstraat 39), 9090, Melle, Belgium
| | - Thomas B Jacobs
- Department of Plant Biotechnology and Bioinformatics, Ghent University, (Technologiepark-Zwijnaarde 71) 9052, Ghent, Belgium
- VIB Center for Plant Systems Biology, (Technologiepark-Zwijnaarde 71), 9052, Ghent, Belgium
| |
Collapse
|
4
|
Devi R, Chauhan S, Dhillon TS. Genome editing for vegetable crop improvement: Challenges and future prospects. Front Genet 2022; 13:1037091. [DOI: 10.3389/fgene.2022.1037091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 10/28/2022] [Indexed: 11/23/2022] Open
Abstract
Vegetable crops are known as protective foods due to their potential role in a balanced human diet, especially for vegetarians as they are a rich source of vitamins and minerals along with dietary fibers. Many biotic and abiotic stresses threaten the crop growth, yield and quality of these crops. These crops are annual, biennial and perennial in breeding behavior. Traditional breeding strategies pose many challenges in improving economic crop traits. As in most of the cases the large number of backcrosses and stringent selection pressure is required for the introgression of the useful traits into the germplasm, which is time and labour-intensive process. Plant scientists have improved economic traits like yield, quality, biotic stress resistance, abiotic stress tolerance, and improved nutritional quality of crops more precisely and accurately through the use of the revolutionary breeding method known as clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein-9 (Cas9). The high mutation efficiency, less off-target consequences and simplicity of this technique has made it possible to attain novel germplasm resources through gene-directed mutation. It facilitates mutagenic response even in complicated genomes which are difficult to breed using traditional approaches. The revelation of functions of important genes with the advancement of whole-genome sequencing has facilitated the CRISPR-Cas9 editing to mutate the desired target genes. This technology speeds up the creation of new germplasm resources having better agro-economical traits. This review entails a detailed description of CRISPR-Cas9 gene editing technology along with its potential applications in olericulture, challenges faced and future prospects.
Collapse
|
5
|
A systematic mapping study on machine learning techniques for the prediction of CRISPR/Cas9 sgRNA target cleavage. Comput Struct Biotechnol J 2022; 20:5813-5823. [PMID: 36382194 PMCID: PMC9630617 DOI: 10.1016/j.csbj.2022.10.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 09/21/2022] [Accepted: 10/08/2022] [Indexed: 11/30/2022] Open
Abstract
CRISPR/Cas9 technology has greatly accelerated genome engineering research. The CRISPR/Cas9 complex, a bacterial immune response system, is widely adopted for RNA-driven targeted genome editing. The systematic mapping study presented in this paper examines the literature on machine learning (ML) techniques employed in the prediction of CRISPR/Cas9 sgRNA on/off-target cleavage, focusing on improving support in sgRNA design activities and identifying areas currently being researched. This area of research has greatly expanded recently, and we found it appropriate to work on a Systematic Mapping Study (SMS), an investigation that has proven to be an effective secondary study method. Unlike a classic review, in an SMS, no comparison of methods or results is made, while this task can instead be the subject of a systematic literature review that chooses one theme among those highlighted in this SMS. The study is illustrated in this paper. To the best of the authors' knowledge, no other SMS studies have been published on this topic. Fifty-seven papers published in the period 2017–2022 (April, 30) were analyzed. This study reveals that the most widely used ML model is the convolutional neural network (CNN), followed by the feedforward neural network (FNN), while the use of other models is marginal. Other interesting information has emerged, such as the wide availability of both open code and platforms dedicated to supporting the activity of researchers or the fact that there is a clear prevalence of public funds that finance research on this topic.
Collapse
|