1
|
Feussner M, Migur A, Mitrofanov A, Alkhnbashi OS, Backofen R, Beisel CL, Weinberg Z. Disparate mechanisms counteract extraneous CRISPR RNA production in type II-C CRISPR-Cas systems. MICROLIFE 2025; 6:uqaf007. [PMID: 40376300 PMCID: PMC12080349 DOI: 10.1093/femsml/uqaf007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 03/24/2025] [Accepted: 05/13/2025] [Indexed: 05/18/2025]
Abstract
CRISPR-Cas adaptive immune systems in bacteria and archaea enable precise targeting and elimination of invading genetic elements. An inherent feature of these systems is the 'extraneous' CRISPR RNA (ecrRNA), which is produced via the extra repeat in a CRISPR array lacking a corresponding spacer. As ecrRNAs would interact with the Cas machinery yet not direct acquired immunity, they pose a potential barrier to defence. Type II-A CRISPR-Cas systems resolve this barrier through the leader sequence upstream of a CRISPR array, which forms a hairpin structure with the extra repeat that inhibits ecrRNA production. However, the fate of ecrRNAs in other CRISPR types and subtypes remains to be explored. Here, we report that II-C systems likely employ disparate strategies to resolve the ecrRNA due to their distinct configuration in comparison to II-A. Applying bioinformatics analyses to over 650 II-C systems followed by experimental validation, we identified three strategies applicable to these systems: formation of an upstream Rho-independent terminator, formation of a hairpin that sequesters the ecrRNA guide, and mutations in the repeat expected to disrupt ecrRNA formation. These findings expand the list of mechanisms in CRISPR-Cas systems that could resolve the ecrRNA to optimize immune response.
Collapse
Affiliation(s)
- Maximilian Feussner
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, D-04107 Leipzig, Germany
| | - Angela Migur
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz-Centre for Infection Research (HZI), D-97080 Würzburg, Germany
| | - Alexander Mitrofanov
- Bioinformatics Group, Department of Computer Science, University of Freiburg, D-79110 Freiburg, Germany
| | - Omer S Alkhnbashi
- Center for Applied and Translational Genomics (CATG), Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Healthcare City, Dubai P.O. Box 505055, United Arab Emirates
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Healthcare City, Dubai P.O. Box 505055, United Arab Emirates
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, D-79110 Freiburg, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, D-79104 Freiburg, Germany
| | - Chase L Beisel
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz-Centre for Infection Research (HZI), D-97080 Würzburg, Germany
- Medical Faculty, University of Würzburg, D-97070 Würzburg, Germany
| | - Zasha Weinberg
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, D-04107 Leipzig, Germany
| |
Collapse
|
2
|
Qi C, Shen X, Li B, Liu C, Huang L, Lan H, Chen D, Jiang Y, Wang D. PAMPHLET: PAM Prediction HomoLogous-Enhancement Toolkit for precise PAM prediction in CRISPR-Cas systems. J Genet Genomics 2025; 52:258-268. [PMID: 39522681 DOI: 10.1016/j.jgg.2024.10.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2024] [Revised: 10/24/2024] [Accepted: 10/26/2024] [Indexed: 11/16/2024]
Abstract
CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms, evolving from a singular Cas9 model to a diverse CRISPR toolbox. A critical bottleneck in developing new Cas proteins is identifying protospacer adjacent motif (PAM) sequences. Due to the limitations of experimental methods, bioinformatics approaches have become essential. However, existing PAM prediction programs are limited by the small number of spacers in CRISPR-Cas systems, resulting in low accuracy. To address this, we develop PAMPHLET, a pipeline that uses homology searches to identify additional spacers, significantly increasing the number of spacers up to 18-fold. PAMPHLET is validated on 20 CRISPR-Cas systems and successfully predicts PAM sequences for 18 protospacers. These predictions are further validated using the DocMF platform, which characterizes protein-DNA recognition patterns via next-generation sequencing. The high consistency between PAMPHLET predictions and DocMF results for Cas proteins demonstrates the potential of PAMPHLET to enhance PAM sequence prediction accuracy, expedite the discovery process, and accelerate the development of CRISPR tools.
Collapse
Affiliation(s)
- Chen Qi
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, Guangdong 519087, China
| | - Xuechun Shen
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou, Guangdong 510006, China; BGI Research, Hangzhou, Zhejiang 310030, China
| | - Baitao Li
- BGI Research, Shenzhen, Guangdong 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chuan Liu
- Laboratory of Integrative Biomedicine, Department of Biology, Faculty of Science, University of Copenhagen, Copenhagen, Denmark; Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao, Shandong 266555, China; BGI Research, Shenzhen, Guangdong 518083, China
| | - Lei Huang
- School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, Zhejiang 310024, China
| | - Hongxia Lan
- BGI Research, Shenzhen, Guangdong 518083, China
| | - Donglong Chen
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, Guangdong 519087, China
| | - Yuan Jiang
- STOmics Americas Ltd., 2904 Orchard Pkwy, San Jose CA 95134, USA
| | - Dan Wang
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, Guangdong 519087, China.
| |
Collapse
|
3
|
Lee H, Rho WY, Kim YH, Chang H, Jun BH. CRISPR-Cas9 Gene Therapy: Non-Viral Delivery and Stimuli-Responsive Nanoformulations. Molecules 2025; 30:542. [PMID: 39942646 PMCID: PMC11820414 DOI: 10.3390/molecules30030542] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 01/07/2025] [Accepted: 01/17/2025] [Indexed: 02/16/2025] Open
Abstract
The CRISPR-Cas9 technology, one of the groundbreaking genome editing methods for addressing genetic disorders, has emerged as a powerful, precise, and efficient tool. However, its clinical translation remains hindered by challenges in delivery efficiency and targeting specificity. This review provides a comprehensive analysis of the structural features, advantages, and potential applications of various non-viral and stimuli-responsive systems, examining recent progress to emphasize the potential to address these limitations and advance CRISPR-Cas9 therapeutics. We describe how recent reports emphasize that nonviral vectors, including lipid-based nanoparticles, extracellular vesicles, polymeric nanoparticles, gold nanoparticles, and mesoporous silica nanoparticles, can offer diverse advantages to enhance stability, cellular uptake, and biocompatibility, based on their structures and physio-chemical stability. We also summarize recent progress on stimuli-responsive nanoformulations, a type of non-viral vector, to introduce precision and control in CRISPR-Cas9 delivery. Stimuli-responsive nanoformulations are designed to respond to pH, redox states, and external triggers, facilitate controlled and targeted delivery, and minimize off-target effects. The insights in our review suggest future challenges for clinical applications of gene therapy technologies and highlight the potential of delivery systems to enhance CRISPR-Cas9's clinical efficacy, positioning them as pivotal tools for future gene-editing therapies.
Collapse
Affiliation(s)
- Hyunwoo Lee
- Department of Bioscience and Biotechnology, Konkuk University, Seoul 05029, Republic of Korea; (H.L.); (Y.-H.K.)
| | - Won-Yeop Rho
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea;
| | - Yoon-Hee Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul 05029, Republic of Korea; (H.L.); (Y.-H.K.)
| | - Hyejin Chang
- Division of Science Education, Kangwon National University, 1 Gangwondaehakgil, Chuncheon-si 24341, Republic of Korea
| | - Bong-Hyun Jun
- Department of Bioscience and Biotechnology, Konkuk University, Seoul 05029, Republic of Korea; (H.L.); (Y.-H.K.)
| |
Collapse
|
4
|
Nguyen E, Poli M, Durrant MG, Kang B, Katrekar D, Li DB, Bartie LJ, Thomas AW, King SH, Brixi G, Sullivan J, Ng MY, Lewis A, Lou A, Ermon S, Baccus SA, Hernandez-Boussard T, Ré C, Hsu PD, Hie BL. Sequence modeling and design from molecular to genome scale with Evo. Science 2024; 386:eado9336. [PMID: 39541441 PMCID: PMC12057570 DOI: 10.1126/science.ado9336] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 09/09/2024] [Indexed: 11/16/2024]
Abstract
The genome is a sequence that encodes the DNA, RNA, and proteins that orchestrate an organism's function. We present Evo, a long-context genomic foundation model with a frontier architecture trained on millions of prokaryotic and phage genomes, and report scaling laws on DNA to complement observations in language and vision. Evo generalizes across DNA, RNA, and proteins, enabling zero-shot function prediction competitive with domain-specific language models and the generation of functional CRISPR-Cas and transposon systems, representing the first examples of protein-RNA and protein-DNA codesign with a language model. Evo also learns how small mutations affect whole-organism fitness and generates megabase-scale sequences with plausible genomic architecture. These prediction and generation capabilities span molecular to genomic scales of complexity, advancing our understanding and control of biology.
Collapse
Affiliation(s)
- Eric Nguyen
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Michael Poli
- Department of Computer Science, Stanford University, Stanford, CA, USA
- TogetherAI, San Francisco, CA, USA
| | | | - Brian Kang
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | | | - David B. Li
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | | | - Armin W. Thomas
- Stanford Data Science, Stanford University, Stanford, CA, USA
| | - Samuel H. King
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Garyk Brixi
- Arc Institute, Palo Alto, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Madelena Y. Ng
- Stanford Center for Biomedical Informatics Research, Stanford, CA, USA
| | - Ashley Lewis
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Aaron Lou
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Stefano Ermon
- Department of Computer Science, Stanford University, Stanford, CA, USA
- CZ Biohub, San Francisco, CA, USA
| | | | | | - Christopher Ré
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Patrick D. Hsu
- Arc Institute, Palo Alto, CA, USA
- Department of Bioengineering and Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Brian L. Hie
- Arc Institute, Palo Alto, CA, USA
- Stanford Data Science, Stanford University, Stanford, CA, USA
- Department of Chemical Engineering, Stanford University, Stanford, CA, USA
| |
Collapse
|
5
|
Hauns S, Alkhnbashi OS, Backofen R. Deepdefense: annotation of immune systems in prokaryotes using deep learning. Gigascience 2024; 13:giae062. [PMID: 39388605 PMCID: PMC11959188 DOI: 10.1093/gigascience/giae062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 02/19/2024] [Accepted: 08/01/2024] [Indexed: 10/12/2024] Open
Abstract
BACKGROUND Due to a constant evolutionary arms race, archaea and bacteria have evolved an abundance and diversity of immune responses to protect themselves against phages. Since the discovery and application of CRISPR-Cas adaptive immune systems, numerous novel candidates for immune systems have been identified. Previous approaches to identifying these new immune systems rely on hidden Markov model (HMM)-based homolog searches or use labor-intensive and costly wet-lab experiments. To aid in finding and classifying immune systems genomes, we use machine learning to classify already known immune system proteins and discover potential candidates in the genome. Neural networks have shown promising results in classifying and predicting protein functionality in recent years. However, these methods often operate under the closed-world assumption, where it is presumed that all potential outcomes or classes are already known and included in the training dataset. This assumption does not always hold true in real-world scenarios, such as in genomics, where new samples can emerge that were not previously accounted for in the training phase. RESULTS In this work, we explore neural networks for immune protein classification, deal with different methods for rejecting unrelated proteins in a genome-wide search, and establish a benchmark. Then, we optimize our approach for accuracy. Based on this, we develop an algorithm called Deepdefense to predict immune cassette classes based on a genome. This design facilitates the differentiation between immune system-related and unrelated proteins by analyzing variations in model-predicted confidence values, aiding in the identification of both known and potentially novel immune system proteins. Finally, we test our approach for detecting immune systems in the genome against an HMM-based method. CONCLUSIONS Deepdefense can automatically detect genes and define cassette annotations and classifications using 2 model classifications. This is achieved by creating an optimized deep learning model to annotate immune systems, in combination with calibration methods, and a second model to enable the scanning of an entire genome.
Collapse
Affiliation(s)
- Sven Hauns
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg 79110, Germany
| | - Omer S Alkhnbashi
- Center for Applied and Translational Genomics (CATG), Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai Healthcare City, Al Razi St. P.O 505055, Dubai, United Arab Emirates
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai Healthcare City, Al Razi St. 505055, Dubai, United Arab Emirates
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg 79110, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg 79104, Germany
| |
Collapse
|
6
|
Backofen R, Gorodkin J, Hofacker IL, Stadler PF. Comparative RNA Genomics. Methods Mol Biol 2024; 2802:347-393. [PMID: 38819565 DOI: 10.1007/978-1-0716-3838-5_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Over the last quarter of a century it has become clear that RNA is much more than just a boring intermediate in protein expression. Ancient RNAs still appear in the core information metabolism and comprise a surprisingly large component in bacterial gene regulation. A common theme with these types of mostly small RNAs is their reliance of conserved secondary structures. Large-scale sequencing projects, on the other hand, have profoundly changed our understanding of eukaryotic genomes. Pervasively transcribed, they give rise to a plethora of large and evolutionarily extremely flexible non-coding RNAs that exert a vastly diverse array of molecule functions. In this chapter we provide a-necessarily incomplete-overview of the current state of comparative analysis of non-coding RNAs, emphasizing computational approaches as a means to gain a global picture of the modern RNA world.
Collapse
Affiliation(s)
- Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
- Center for Non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark
| | - Jan Gorodkin
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, Frederiksberg, Denmark
| | - Ivo L Hofacker
- Institute for Theoretical Chemistry, University of Vienna, Wien, Austria
- Bioinformatics and Computational Biology research group, University of Vienna, Vienna, Austria
- Center for Non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Leipzig, Germany.
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany.
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.
- Universidad National de Colombia, Bogotá, Colombia.
- Institute for Theoretical Chemistry, University of Vienna, Wien, Austria.
- Center for Non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark.
- Santa Fe Institute, Santa Fe, NM, USA.
| |
Collapse
|
7
|
Ziemann M, Reimann V, Liang Y, Shi Y, Ma H, Xie Y, Li H, Zhu T, Lu X, Hess WR. CvkR is a MerR-type transcriptional repressor of class 2 type V-K CRISPR-associated transposase systems. Nat Commun 2023; 14:924. [PMID: 36801863 PMCID: PMC9938897 DOI: 10.1038/s41467-023-36542-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 02/06/2023] [Indexed: 02/20/2023] Open
Abstract
Certain CRISPR-Cas elements integrate into Tn7-like transposons, forming CRISPR-associated transposon (CAST) systems. How the activity of these systems is controlled in situ has remained largely unknown. Here we characterize the MerR-type transcriptional regulator Alr3614 that is encoded by one of the CAST (AnCAST) system genes in the genome of cyanobacterium Anabaena sp. PCC 7120. We identify a number of Alr3614 homologs across cyanobacteria and suggest naming these regulators CvkR for Cas V-K repressors. Alr3614/CvkR is translated from leaderless mRNA and represses the AnCAST core modules cas12k and tnsB directly, and indirectly the abundance of the tracr-CRISPR RNA. We identify a widely conserved CvkR binding motif 5'-AnnACATnATGTnnT-3'. Crystal structure of CvkR at 1.6 Å resolution reveals that it comprises distinct dimerization and potential effector-binding domains and that it assembles into a homodimer, representing a discrete structural subfamily of MerR regulators. CvkR repressors are at the core of a widely conserved regulatory mechanism that controls type V-K CAST systems.
Collapse
Affiliation(s)
- Marcus Ziemann
- Faculty of Biology, Institute of Biology III, Genetics and Experimental Bioinformatics, University of Freiburg, Schänzlestr. 1, Freiburg, D-79104, Germany
| | - Viktoria Reimann
- Faculty of Biology, Institute of Biology III, Genetics and Experimental Bioinformatics, University of Freiburg, Schänzlestr. 1, Freiburg, D-79104, Germany
| | - Yajing Liang
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China.,Shandong Energy Institute, Qingdao, 266101, China.,Qingdao New Energy Shandong Laboratory, Qingdao, 266101, China
| | - Yue Shi
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China.,Shandong Energy Institute, Qingdao, 266101, China.,Qingdao New Energy Shandong Laboratory, Qingdao, 266101, China
| | - Honglei Ma
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China.,Shandong Energy Institute, Qingdao, 266101, China.,Qingdao New Energy Shandong Laboratory, Qingdao, 266101, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yuman Xie
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Hui Li
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tao Zhu
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China. .,Shandong Energy Institute, Qingdao, 266101, China. .,Qingdao New Energy Shandong Laboratory, Qingdao, 266101, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Xuefeng Lu
- Qingdao Institute of Bioenergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, No.189 Songling Road, Qingdao, 266101, China. .,Shandong Energy Institute, Qingdao, 266101, China. .,Qingdao New Energy Shandong Laboratory, Qingdao, 266101, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China. .,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China.
| | - Wolfgang R Hess
- Faculty of Biology, Institute of Biology III, Genetics and Experimental Bioinformatics, University of Freiburg, Schänzlestr. 1, Freiburg, D-79104, Germany.
| |
Collapse
|
8
|
Maurya A, Szymanski M, Karlowski WM. ARA: a flexible pipeline for automated exploration of NCBI SRA datasets. Gigascience 2022; 12:giad067. [PMID: 37589306 PMCID: PMC10433097 DOI: 10.1093/gigascience/giad067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 07/07/2023] [Accepted: 08/01/2023] [Indexed: 08/18/2023] Open
Abstract
BACKGROUND One of the most effective and useful methods to explore the content of biological databases is searching with nucleotide or protein sequences as a query. However, especially in the case of nucleic acids, due to the large volume of data generated by the next-generation sequencing (NGS) technologies, this approach is often not available. The hierarchical organization of the NGS records is primarily designed for browsing or text-based searches of the information provided in metadata-related keywords, limiting the efficiency of database exploration. FINDINGS We developed an automated pipeline that incorporates the well-established NGS data-processing tools and procedures to allow easy and effective sampling of the NCBI SRA database records. Given a file with query nucleotide sequences, our tool estimates the matching content of SRA accessions by probing only a user-defined fraction of a record's sequences. Based on the selected parameters, it allows performing a full mapping experiment with records that meet the required criteria. The pipeline is designed to be easy to operate-it offers a fully automatic setup procedure and is fixed on tested supporting tools. The modular design and implemented usage modes allow a user to scale up the analyses into complex computational infrastructure. CONCLUSIONS We present an easy-to-operate and automated tool that expands the way a user can access and explore the information contained within the records deposited in the NCBI SRA database.
Collapse
Affiliation(s)
- Anand Maurya
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland
| | - Maciej Szymanski
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland
| | - Wojciech M Karlowski
- Department of Computational Biology, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznan, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland
| |
Collapse
|
9
|
Dong M, Liu J, Liu C, Wang H, Sun W, Liu B. CRISPR/CAS9: A promising approach for the research and treatment of cardiovascular diseases. Pharmacol Res 2022; 185:106480. [PMID: 36191879 DOI: 10.1016/j.phrs.2022.106480] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 09/27/2022] [Accepted: 09/29/2022] [Indexed: 10/31/2022]
Abstract
The development of gene-editing technology has been one of the biggest advances in biomedicine over the past two decades. Not only can it be used as a research tool to build a variety of disease models for the exploration of disease pathogenesis at the genetic level, it can also be used for prevention and treatment. This is done by intervening with the expression of target genes and carrying out precise molecular targeted therapy for diseases. The simple and flexible clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 gene-editing technology overcomes the limitations of zinc finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs). For this reason, it has rapidly become a preferred method for gene editing. As a new gene intervention method, CRISPR/Cas9 has been widely used in the clinical treatment of tumours and rare diseases; however, its application in the field of cardiovascular diseases is currently limited. This article reviews the application of the CRISPR/Cas9 editing technology in cardiovascular disease research and treatment, and discusses the limitations and prospects of this technology.
Collapse
Affiliation(s)
- Mengying Dong
- Department of Cardiology, The Second Hospital of Jilin University, 218 Ziqiang Road, Changchun, China, 130041
| | - Jiangen Liu
- Department of Cardiology, The Second Hospital of Jilin University, 218 Ziqiang Road, Changchun, China, 130041
| | - Caixia Liu
- Department of Neurology, The Liaoning Province People's Hospital, 33 Wenyi Road, ShenYang, China, 110016
| | - He Wang
- Department of Cardiology, The Second Hospital of Jilin University, 218 Ziqiang Road, Changchun, China, 130041
| | - Wei Sun
- Department of Cardiology, The Second Hospital of Jilin University, 218 Ziqiang Road, Changchun, China, 130041.
| | - Bin Liu
- Department of Cardiology, The Second Hospital of Jilin University, 218 Ziqiang Road, Changchun, China, 130041.
| |
Collapse
|