1
|
Wei YH, Lin F. Barcodes based on nucleic acid sequences: Applications and challenges (Review). Mol Med Rep 2025; 32:187. [PMID: 40314098 PMCID: PMC12076290 DOI: 10.3892/mmr.2025.13552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Accepted: 03/04/2025] [Indexed: 05/03/2025] Open
Abstract
Cells are the fundamental structural and functional units of living organisms and the study of these entities has remained a central focus throughout the history of biological sciences. Traditional cell research techniques, including fluorescent protein tagging and microscopy, have provided preliminary insights into the lineage history and clonal relationships between progenitor and descendant cells. However, these techniques exhibit inherent limitations in tracking the full developmental trajectory of cells and elucidating their heterogeneity, including sensitivity, stability and barcode drift. In developmental biology, nucleic acid barcode technology has introduced an innovative approach to cell lineage tracing. By assigning unique barcodes to individual cells, researchers can accurately identify and trace the origin and differentiation pathways of cells at various developmental stages, thereby illuminating the dynamic processes underlying tissue development and organogenesis. In cancer research, nucleic acid barcoding has played a pivotal role in analyzing the clonal architecture of tumor cells, exploring their heterogeneity and resistance mechanisms and enhancing our understanding of cancer evolution and inter‑clonal interactions. Furthermore, nucleic acid barcodes play a crucial role in stem cell research, enabling the tracking of stem cells from diverse origins and their derived progeny. This has offered novel perspectives on the mechanisms of stem cell self‑renewal and differentiation. The present review presented a comprehensive examination of the principles, applications and challenges associated with nucleic acid barcode technology.
Collapse
Affiliation(s)
- Ying Hong Wei
- Department of Clinical Laboratory, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi 530021, P.R. China
- State Key Laboratory of Targeting Oncology, National Center for International Research of Bio-targeting Theranostics, Guangxi Key Laboratory of Bio-targeting Theranostics, Collaborative Innovation Center for Targeting Tumor Diagnosis and Therapy, Guangxi Medical University, Nanning, Guangxi 530021, P.R. China
| | - Faquan Lin
- Department of Clinical Laboratory, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi 530021, P.R. China
| |
Collapse
|
2
|
Monge M, Giovanetti SM, Ravishankar A, Sadhu MJ. Highly replicated experiments studying complex genotypes using nested DNA barcodes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.18.643964. [PMID: 40166312 PMCID: PMC11956976 DOI: 10.1101/2025.03.18.643964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Many biological experiments involve studying the differences caused by genetic modifications, including genotypes composed of modifications at more than one locus. However, as the number and complexity of the genotypes increases, independently generating and tracking the necessary number of biological replicate samples becomes a major challenge. We developed a barcode-based method to track large numbers of independent replicates of combinatorial genotypes in a pooled format, enabling robust detection of subtle phenotypic differences. To construct a plasmid library of combinatorial genotypes, we utilized a nested serial cloning process to combine gene variants of interest that have associated DNA barcodes. The final plasmids each contain variants of multiple genes of interest, and a combined barcode that specifies the genotype of all the genes while also encoding a random sequence for tracking individual replicates. Sequencing of the pool of barcodes by next-generation sequencing allows the whole population to be studied in a single flask, enabling a high degree of replication even for complex genotypes. Using this approach, we tested the functionality of combinations of yeast, human, and null orthologs of the nucleotide excision repair factor I (NEF-1) complex and found that cells expressing all three yeast NEF-1 subunits had superior growth in DNA-damaging conditions. We also assessed the sensitivity of our method by simulating downsampling of barcodes across different degrees of phenotypic differentiation. Our results demonstrate the utility of NICR barcodes for high-throughput combinatorial genetic screens and provide a scalable framework for exploring complex genotype-phenotype relationships.
Collapse
Affiliation(s)
- Molly Monge
- Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
- Present address: Weill Medical College, Cornell University, New York, NY 10065, USA
| | - Simone M Giovanetti
- Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Apoorva Ravishankar
- Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
- Present address: MelioLabs Inc., 4655 Old Ironsides Dr Ste. 480, Santa Clara, CA 95054
| | - Meru J Sadhu
- Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
3
|
Saxe R, Stuart H, Marshall A, Abdullahi F, Chen Z, Emiliani F, McKenna A. Hierarchical Lineage Tracing Reveals Diverse Pathways of AML Treatment Resistance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.27.640600. [PMID: 40093111 PMCID: PMC11908168 DOI: 10.1101/2025.02.27.640600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Abstract
Cancer cells adapt to treatment, leading to the emergence of clones that are more aggressive and resistant to anti-cancer therapies. We have a limited understanding of the development of treatment resistance as we lack technologies to map the evolution of cancer under the selective pressure of treatment. To address this, we developed a hierarchical, dynamic lineage tracing method called FLARE (Following Lineage Adaptation and Resistance Evolution). We use this technique to track the progression of acute myeloid leukemia (AML) cell lines through exposure to Cytarabine (AraC), a front-line treatment in AML, in vitro and in vivo. We map distinct cellular lineages in murine and human AML cell lines predisposed to AraC persistence and/or resistance via the upregulation of cell adhesion and motility pathways. Additionally, we highlight the heritable expression of immunoproteasome 11S regulatory cap subunits as a potential mechanism aiding AML cell survival, proliferation, and immune escape in vivo. Finally, we validate the clinical relevance of these signatures in the TARGET-AML cohort, with a bisected response in blood and bone marrow. Our findings reveal a broad spectrum of resistance signatures attributed to significant cell transcriptional changes. To our knowledge, this is the first application of dynamic lineage tracing to unravel treatment response and resistance in cancer, and we expect FLARE to be a valuable tool in dissecting the evolution of resistance in a wide range of tumor types.
Collapse
Affiliation(s)
- Rachel Saxe
- Molecular and Systems Biology, Dartmouth College, Hanover, NH
- Molecular and Cellular Biology Program, Dartmouth College, Hanover, NH
| | - Hannah Stuart
- Molecular and Systems Biology, Dartmouth College, Hanover, NH
- Quantitative Biomedical Science Program, Dartmouth College, Lebanon, NH
| | - Abigail Marshall
- Molecular and Systems Biology, Dartmouth College, Hanover, NH
- Molecular and Cellular Biology Program, Dartmouth College, Hanover, NH
| | - Fahiima Abdullahi
- The Dartmouth MD-PhD Undergraduate Summer Fellowship Program, Lebanon, NH
| | - Zoë Chen
- Dartmouth Cancer Center, Dartmouth College, Lebanon, NH
| | - Francesco Emiliani
- Molecular and Systems Biology, Dartmouth College, Hanover, NH
- Molecular and Cellular Biology Program, Dartmouth College, Hanover, NH
| | - Aaron McKenna
- Molecular and Systems Biology, Dartmouth College, Hanover, NH
- Dartmouth Cancer Center, Dartmouth College, Lebanon, NH
| |
Collapse
|
4
|
Micallef P, Santamaría ML, Escobar M, Andersson D, Österlund T, Mouhanna P, Filges S, Johansson G, Fagman H, Vannas C, Ståhlberg A. Digital sequencing is improved by using structured unique molecular identifiers. Genome Biol 2025; 26:37. [PMID: 40001095 PMCID: PMC11853513 DOI: 10.1186/s13059-025-03504-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 02/12/2025] [Indexed: 02/27/2025] Open
Abstract
Digital sequencing uses unique molecular identifiers (UMIs) to correct for polymerase induced errors and amplification biases. Here, we design 19 different structured UMIs to minimize the capacity of primers to form non-specific PCR products during library construction using SiMSen-Seq, a PCR-based digital sequencing approach with flexible multiplexing capabilities suitable for tumor-informed mutation analysis. All structured UMI designs demonstrate enhanced assay performance compared with an unstructured reference UMI. The best performing structured UMI design shows significant improvements in all tested aspects of assay and sequencing performance with the ability to reliable detect low variant allele frequencies.
Collapse
Affiliation(s)
- Peter Micallef
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden
| | - Manuel Luna Santamaría
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 413 90, Sweden
| | - Mandy Escobar
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
| | - Daniel Andersson
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
| | - Tobias Österlund
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 413 90, Sweden
| | - Pia Mouhanna
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Department of Oncology, Ryhov County Hospital, Jönköping, 551 85, Sweden
| | - Stefan Filges
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Simsen Diagnostics AB, 411 26, Gothenburg, Sweden
| | - Gustav Johansson
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 413 90, Sweden
- Simsen Diagnostics AB, 411 26, Gothenburg, Sweden
| | - Henrik Fagman
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Department of Clinical Pathology, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden
| | - Christoffer Vannas
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden
- Department of Oncology, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden
| | - Anders Ståhlberg
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Center for Cancer Research, Sahlgrenska Academy, University of Gothenburg, Gothenburg, 413 90, Sweden.
- Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden.
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 413 90, Sweden.
| |
Collapse
|
5
|
Gupta S, Sharma A. NucBalancer: streamlining barcode sequence selection for optimal sample pooling for sequencing. GIGABYTE 2024; 2024:gigabyte138. [PMID: 39430727 PMCID: PMC11488490 DOI: 10.46471/gigabyte.138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 10/08/2024] [Indexed: 10/22/2024] Open
Abstract
Recent advancements in next-generation sequencing (NGS) technologies have brought to the forefront the necessity for versatile, cost-effective tools capable of adapting to a rapidly evolving landscape. The emergence of numerous new sequencing platforms, each with unique sample preparation and sequencing requirements, underscores the importance of efficient barcode balancing for successful pooling and accurate demultiplexing of samples. Recently launched new sequencing systems claiming better affordability comparable to more established platforms further exemplifies these challenges, especially when libraries originally prepared for one platform need conversion to another. In response to this dynamic environment, we introduce NucBalancer, a Shiny app developed for the optimal selection of barcode sequences. While initially tailored to meet the nucleotide, composition challenges specific to G400 and T7 series sequencers, NucBalancer's utility significantly broadens to accommodate the varied demands of these new sequencing technologies. Its application is particularly crucial in single-cell genomics, enabling the adaptation of libraries, such as those prepared for 10x technology, to various sequencers including G400 and T7 series sequencers. NucBalancer efficiently balances nucleotide composition and sample concentrations, reducing biases and enhancing the reliability of NGS data across platforms. Its adaptability makes it invaluable for addressing sequencing challenges, ensuring effective barcode balancing for sample pooling on any platform. Availability and implementation NucBalancer is implemented in R and is available at https://github.com/ersgupta/NucBalancer. Additionally, a shiny interface is available at https://ersgupta.shinyapps.io/NucBalancer/.
Collapse
Affiliation(s)
- Saurabh Gupta
- Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, 6 Verdun Street, Nedlands, Perth, Western Australia, 6009, Australia
- Curtin Medical School, Curtin Health Innovation Research Institute (CHIRI), Curtin University, Perth, Western Australia, 6102, Australia
| | - Ankur Sharma
- Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, 6 Verdun Street, Nedlands, Perth, Western Australia, 6009, Australia
- Curtin Medical School, Curtin Health Innovation Research Institute (CHIRI), Curtin University, Perth, Western Australia, 6102, Australia
- Translational Genomics Program, Garvan Institute of Medical Research and Kinghorn Cancer Centre, Darlinghurst, New South Wales, 2010, Australia
| |
Collapse
|
6
|
Ardell S, Martsul A, Johnson MS, Kryazhimskiy S. Environment-independent distribution of mutational effects emerges from microscopic epistasis. Science 2024; 386:87-92. [PMID: 39361740 PMCID: PMC11580693 DOI: 10.1126/science.adn0753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 08/22/2024] [Indexed: 10/05/2024]
Abstract
Predicting how new mutations alter phenotypes is difficult because mutational effects vary across genotypes and environments. Recently discovered global epistasis, in which the fitness effects of mutations scale with the fitness of the background genotype, can improve predictions, but how the environment modulates this scaling is unknown. We measured the fitness effects of ~100 insertion mutations in 42 strains of Saccharomyces cerevisiae in six laboratory environments and found that the global epistasis scaling is nearly invariant across environments. Instead, the environment tunes one global parameter, the background fitness at which most mutations switch sign. As a consequence, the distribution of mutational effects is predictable across genotypes and environments. Our results suggest that the effective dimensionality of genotype-to-phenotype maps across environments is surprisingly low.
Collapse
Affiliation(s)
- Sarah Ardell
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| | - Alena Martsul
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| | - Milo S. Johnson
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sergey Kryazhimskiy
- Department of Ecology, Behavior and Evolution, University of California San Diego, La Jolla, CA 92093
| |
Collapse
|
7
|
McGee RS, Kinsler G, Petrov D, Tikhonov M. Improving the Accuracy of Bulk Fitness Assays by Correcting Barcode Processing Biases. Mol Biol Evol 2024; 41:msae152. [PMID: 39041198 PMCID: PMC11316221 DOI: 10.1093/molbev/msae152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 06/28/2024] [Accepted: 07/11/2024] [Indexed: 07/24/2024] Open
Abstract
Measuring the fitnesses of genetic variants is a fundamental objective in evolutionary biology. A standard approach for measuring microbial fitnesses in bulk involves labeling a library of genetic variants with unique sequence barcodes, competing the labeled strains in batch culture, and using deep sequencing to track changes in the barcode abundances over time. However, idiosyncratic properties of barcodes can induce nonuniform amplification or uneven sequencing coverage that causes some barcodes to be over- or under-represented in samples. This systematic bias can result in erroneous read count trajectories and misestimates of fitness. Here, we develop a computational method, named REBAR (Removing the Effects of Bias through Analysis of Residuals), for inferring the effects of barcode processing bias by leveraging the structure of systematic deviations in the data. We illustrate this approach by applying it to two independent data sets, and demonstrate that this method estimates and corrects for bias more accurately than standard proxies, such as GC-based corrections. REBAR mitigates bias and improves fitness estimates in high-throughput assays without introducing additional complexity to the experimental protocols, with potential applications in a range of experimental evolution and mutation screening contexts.
Collapse
Affiliation(s)
| | - Grant Kinsler
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Palo Alto, CA, USA
| | - Mikhail Tikhonov
- Department of Physics, Washington University, St. Louis, MO, USA
| |
Collapse
|
8
|
Sullivan DK, Pachter L. Flexible parsing, interpretation, and editing of technical sequences with splitcode. Bioinformatics 2024; 40:btae331. [PMID: 38876979 PMCID: PMC11193061 DOI: 10.1093/bioinformatics/btae331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/14/2024] [Accepted: 06/12/2024] [Indexed: 06/16/2024] Open
Abstract
MOTIVATION Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed. RESULTS We present a tool called splitcode, that enables flexible and efficient parsing, interpreting, and editing of sequencing reads. This versatile tool facilitates simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays. AVAILABILITY AND IMPLEMENTATION The splitcode program is available at http://github.com/pachterlab/splitcode.
Collapse
Affiliation(s)
- Delaney K Sullivan
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, United States
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, United States
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, United States
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, United States
| |
Collapse
|
9
|
Saunders SH, Ahmed AM. ORBIT for E. coli: kilobase-scale oligonucleotide recombineering at high throughput and high efficiency. Nucleic Acids Res 2024; 52:e43. [PMID: 38587185 PMCID: PMC11077079 DOI: 10.1093/nar/gkae227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 02/28/2024] [Accepted: 03/19/2024] [Indexed: 04/09/2024] Open
Abstract
Microbiology and synthetic biology depend on reverse genetic approaches to manipulate bacterial genomes; however, existing methods require molecular biology to generate genomic homology, suffer from low efficiency, and are not easily scaled to high throughput. To overcome these limitations, we developed a system for creating kilobase-scale genomic modifications that uses DNA oligonucleotides to direct the integration of a non-replicating plasmid. This method, Oligonucleotide Recombineering followed by Bxb-1 Integrase Targeting (ORBIT) was pioneered in Mycobacteria, and here we adapt and expand it for Escherichia coli. Our redesigned plasmid toolkit for oligonucleotide recombineering achieved significantly higher efficiency than λ Red double-stranded DNA recombineering and enabled precise, stable knockouts (≤134 kb) and integrations (≤11 kb) of various sizes. Additionally, we constructed multi-mutants in a single transformation, using orthogonal attachment sites. At high throughput, we used pools of targeting oligonucleotides to knock out nearly all known transcription factor and small RNA genes, yielding accurate, genome-wide, single mutant libraries. By counting genomic barcodes, we also show ORBIT libraries can scale to thousands of unique members (>30k). This work demonstrates that ORBIT for E. coli is a flexible reverse genetic system that facilitates rapid construction of complex strains and readily scales to create sophisticated mutant libraries.
Collapse
Affiliation(s)
- Scott H Saunders
- Green Center for Systems Biology - Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX 75320, USA
| | - Ayesha M Ahmed
- Green Center for Systems Biology - Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX 75320, USA
| |
Collapse
|
10
|
Park S, Kim M, Lee JW. Optimizing Nucleic Acid Delivery Systems through Barcode Technology. ACS Synth Biol 2024; 13:1006-1018. [PMID: 38526308 DOI: 10.1021/acssynbio.3c00602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2024]
Abstract
Conventional biological experiments often focus on in vitro assays because of the inherent limitations when handling multiple variables in vivo, including labor-intensive and time-consuming procedures. Often only a subset of samples demonstrating significant efficacy in the in vitro assays can be evaluated in vivo. Nonetheless, because of the low correlation between the in vitro and in vivo tests, evaluation of the variables under examination in vivo and not solely in vitro is critical. An emerging approach to achieve high-throughput in vivo tests involves using a barcode system consisting of various nucleotide combinations. Unique barcodes for each variant enable the simultaneous testing of multiple entities, eliminating the need for separate individual tests. Subsequently, to identify crucial parameters, samples were collected and analyzed using barcode sequencing. This review explores the development of barcode design and its applications, including the evaluation of nucleic acid delivery systems and the optimization of gene expression in vivo.
Collapse
Affiliation(s)
- Soan Park
- Department of Chemical Engineering, Pohang University of Science and Technology, 77 CheongamRo, Gyeongbuk, 37673 NamGu, Pohang, Republic of Korea
| | - Mibang Kim
- Department of Chemical Engineering, Pohang University of Science and Technology, 77 CheongamRo, Gyeongbuk, 37673 NamGu, Pohang, Republic of Korea
| | - Jeong Wook Lee
- Department of Chemical Engineering, Pohang University of Science and Technology, 77 CheongamRo, Gyeongbuk, 37673 NamGu, Pohang, Republic of Korea
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, 77 CheongamRo, Gyeongbuk, 37673 NamGu, Pohang, Republic of Korea
| |
Collapse
|
11
|
Andersson D, Kebede FT, Escobar M, Österlund T, Ståhlberg A. Principles of digital sequencing using unique molecular identifiers. Mol Aspects Med 2024; 96:101253. [PMID: 38367531 DOI: 10.1016/j.mam.2024.101253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 01/26/2024] [Accepted: 02/03/2024] [Indexed: 02/19/2024]
Abstract
Massively parallel sequencing technologies have long been used in both basic research and clinical routine. The recent introduction of digital sequencing has made previously challenging applications possible by significantly improving sensitivity and specificity to now allow detection of rare sequence variants, even at single molecule level. Digital sequencing utilizes unique molecular identifiers (UMIs) to minimize sequencing-induced errors and quantification biases. Here, we discuss the principles of UMIs and how they are used in digital sequencing. We outline the properties of different UMI types and the consequences of various UMI approaches in relation to experimental protocols and bioinformatics. Finally, we describe how digital sequencing can be applied in specific research fields, focusing on cancer management where it can be used in screening of asymptomatic individuals, diagnosis, treatment prediction, prognostication, monitoring treatment efficacy and early detection of treatment resistance as well as relapse.
Collapse
Affiliation(s)
- Daniel Andersson
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, 413 90, Gothenburg, Sweden
| | - Firaol Tamiru Kebede
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, 413 90, Gothenburg, Sweden
| | - Mandy Escobar
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, 413 90, Gothenburg, Sweden
| | - Tobias Österlund
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, 413 90, Gothenburg, Sweden; Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, 413 90, Gothenburg, Sweden; Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, 413 45, Gothenburg, Sweden
| | - Anders Ståhlberg
- Sahlgrenska Center for Cancer Research, Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, 413 90, Gothenburg, Sweden; Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, 413 90, Gothenburg, Sweden; Department of Clinical Genetics and Genomics, Sahlgrenska University Hospital, 413 45, Gothenburg, Sweden.
| |
Collapse
|
12
|
Daniel BBJ, Steiger Y, Sintsova A, Field CM, Nguyen BD, Schubert C, Cherrak Y, Sunagawa S, Hardt WD, Vorholt JA. Assessing microbiome population dynamics using wild-type isogenic standardized hybrid (WISH)-tags. Nat Microbiol 2024; 9:1103-1116. [PMID: 38503975 PMCID: PMC10994841 DOI: 10.1038/s41564-024-01634-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 02/09/2024] [Indexed: 03/21/2024]
Abstract
Microbiomes feature recurrent compositional structures under given environmental conditions. However, these patterns may conceal diverse underlying population dynamics that require intrastrain resolution. Here we developed a genomic tagging system, termed wild-type isogenic standardized hybrid (WISH)-tags, that can be combined with quantitative polymerase chain reaction and next-generation sequencing for microbial strain enumeration. We experimentally validated the performance of 62 tags and showed that they can be differentiated with high precision. WISH-tags were introduced into model and non-model bacterial members of the mouse and plant microbiota. Intrastrain priority effects were tested using one species of isogenic barcoded bacteria in the murine gut and the Arabidopsis phyllosphere, both with and without microbiota context. We observed colonization resistance against late-arriving strains of Salmonella Typhimurium in the mouse gut, whereas the phyllosphere accommodated Sphingomonas latecomers in a manner proportional to their presence at the late inoculation timepoint. This demonstrates that WISH-tags are a resource for deciphering population dynamics underlying microbiome assembly across biological systems.
Collapse
Affiliation(s)
| | - Yves Steiger
- Institute of Microbiology, ETH Zurich, Zurich, Switzerland
| | - Anna Sintsova
- Institute of Microbiology, ETH Zurich, Zurich, Switzerland
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Sun W, Perkins M, Huyghe M, Faraldo MM, Fre S, Perié L, Lyne AM. Extracting, filtering and simulating cellular barcodes using CellBarcode tools. NATURE COMPUTATIONAL SCIENCE 2024; 4:128-143. [PMID: 38374363 PMCID: PMC10899113 DOI: 10.1038/s43588-024-00595-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 01/16/2024] [Indexed: 02/21/2024]
Abstract
Identifying true DNA cellular barcodes among polymerase chain reaction and sequencing errors is challenging. Current tools are restricted in the diversity of barcode types supported or the analysis strategies implemented. As such, there is a need for more versatile and efficient tools for barcode extraction, as well as for tools to investigate which factors impact barcode detection and which filtering strategies to best apply. Here we introduce the package CellBarcode and its barcode simulation kit, CellBarcodeSim, that allows efficient and versatile barcode extraction and filtering for a range of barcode types from bulk or single-cell sequencing data using a variety of filtering strategies. Using the barcode simulation kit and biological data, we explore the technical and biological factors influencing barcode identification and provide a decision tree on how to optimize barcode identification for different barcode settings. We believe that CellBarcode and CellBarcodeSim have the capability to enhance the reproducibility and interpretation of barcode results across studies.
Collapse
Affiliation(s)
- Wenjie Sun
- Institut Curie, Université PSL, Sorbonne Université, CNRS UMR168, Laboratoire Physico Chimie Curie, Paris, France.
| | - Meghan Perkins
- Institut Curie, Laboratory of Genetics and Developmental Biology, PSL Research University, INSERM U934, CNRS UMR3215, Paris, France
| | - Mathilde Huyghe
- Institut Curie, Laboratory of Genetics and Developmental Biology, PSL Research University, INSERM U934, CNRS UMR3215, Paris, France
| | - Marisa M Faraldo
- Institut Curie, Laboratory of Genetics and Developmental Biology, PSL Research University, INSERM U934, CNRS UMR3215, Paris, France
| | - Silvia Fre
- Institut Curie, Laboratory of Genetics and Developmental Biology, PSL Research University, INSERM U934, CNRS UMR3215, Paris, France
| | - Leïla Perié
- Institut Curie, Université PSL, Sorbonne Université, CNRS UMR168, Laboratoire Physico Chimie Curie, Paris, France.
| | - Anne-Marie Lyne
- Institut Curie, Université PSL, Sorbonne Université, CNRS UMR168, Laboratoire Physico Chimie Curie, Paris, France.
- INSERM U900, Paris, France.
- MINES ParisTech, CBIO-Centre for Computational Biology, PSL Research University, Paris, France.
| |
Collapse
|
14
|
Sullivan DK, Pachter L. Flexible parsing, interpretation, and editing of technical sequences with splitcode. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.20.533521. [PMID: 36993532 PMCID: PMC10055216 DOI: 10.1101/2023.03.20.533521] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
Abstract
Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed. We present a tool called splitcode, that enables flexible and efficient parsing, interpreting, and editing of sequencing reads. This versatile tool facilitates simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays.
Collapse
Affiliation(s)
- Delaney K. Sullivan
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125, USA
| |
Collapse
|
15
|
Theodosiou L, Farr AD, Rainey PB. Barcoding Populations of Pseudomonas fluorescens SBW25. J Mol Evol 2023; 91:254-262. [PMID: 37186220 PMCID: PMC10275814 DOI: 10.1007/s00239-023-10103-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 03/13/2023] [Indexed: 05/17/2023]
Abstract
In recent years, evolutionary biologists have developed an increasing interest in the use of barcoding strategies to study eco-evolutionary dynamics of lineages within evolving populations and communities. Although barcoded populations can deliver unprecedented insight into evolutionary change, barcoding microbes presents specific technical challenges. Here, strategies are described for barcoding populations of the model bacterium Pseudomonas fluorescens SBW25, including the design and cloning of barcoded regions, preparation of libraries for amplicon sequencing, and quantification of resulting barcoded lineages. In so doing, we hope to aid the design and implementation of barcoding methodologies in a broad range of model and non-model organisms.
Collapse
Affiliation(s)
- Loukas Theodosiou
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany.
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding, Cologne, Germany.
| | - Andrew D Farr
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Paul B Rainey
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Laboratory of Biophysics and Evolution, CBI, ESPCI Paris, Université PSL, CNRS, Paris, France
| |
Collapse
|
16
|
Geiler-Samerotte K, Lang GI. Best Practices in Microbial Experimental Evolution. J Mol Evol 2023; 91:237-240. [PMID: 37209159 PMCID: PMC10885815 DOI: 10.1007/s00239-023-10119-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 05/11/2023] [Indexed: 05/22/2023]
Affiliation(s)
- Kerry Geiler-Samerotte
- School of Life Sciences and Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ, 85287, USA.
| | - Gregory I Lang
- Department of Biological Sciences, Lehigh University, Bethlehem, PA, 18015, USA
| |
Collapse
|
17
|
Kinsler G, Schmidlin K, Newell D, Eder R, Apodaca S, Lam G, Petrov D, Geiler-Samerotte K. Extreme Sensitivity of Fitness to Environmental Conditions: Lessons from #1BigBatch. J Mol Evol 2023; 91:293-310. [PMID: 37237236 PMCID: PMC10276131 DOI: 10.1007/s00239-023-10114-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 04/30/2023] [Indexed: 05/28/2023]
Abstract
The phrase "survival of the fittest" has become an iconic descriptor of how natural selection works. And yet, precisely measuring fitness, even for single-celled microbial populations growing in controlled laboratory conditions, remains a challenge. While numerous methods exist to perform these measurements, including recently developed methods utilizing DNA barcodes, all methods are limited in their precision to differentiate strains with small fitness differences. In this study, we rule out some major sources of imprecision, but still find that fitness measurements vary substantially from replicate to replicate. Our data suggest that very subtle and difficult to avoid environmental differences between replicates create systematic variation across fitness measurements. We conclude by discussing how fitness measurements should be interpreted given their extreme environment dependence. This work was inspired by the scientific community who followed us and gave us tips as we live tweeted a high-replicate fitness measurement experiment at #1BigBatch.
Collapse
Affiliation(s)
| | - Kara Schmidlin
- Center for Mechanisms of Evolution, Arizona State University, Tempe, USA
| | - Daphne Newell
- Center for Mechanisms of Evolution, Arizona State University, Tempe, USA
- School of Life Sciences, Arizona State University, Tempe, USA
| | - Rachel Eder
- Center for Mechanisms of Evolution, Arizona State University, Tempe, USA
- School of Life Sciences, Arizona State University, Tempe, USA
| | - Sam Apodaca
- Center for Mechanisms of Evolution, Arizona State University, Tempe, USA
- School of Life Sciences, Arizona State University, Tempe, USA
| | | | | | - Kerry Geiler-Samerotte
- Center for Mechanisms of Evolution, Arizona State University, Tempe, USA.
- School of Life Sciences, Arizona State University, Tempe, USA.
| |
Collapse
|
18
|
Spealman P, De T, Chuong JN, Gresham D. Best Practices in Microbial Experimental Evolution: Using Reporters and Long-Read Sequencing to Identify Copy Number Variation in Experimental Evolution. J Mol Evol 2023; 91:356-368. [PMID: 37012421 PMCID: PMC10275804 DOI: 10.1007/s00239-023-10102-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 02/21/2023] [Indexed: 04/05/2023]
Abstract
Copy number variants (CNVs), comprising gene amplifications and deletions, are a pervasive class of heritable variation. CNVs play a key role in rapid adaptation in both natural, and experimental, evolution. However, despite the advent of new DNA sequencing technologies, detection and quantification of CNVs in heterogeneous populations has remained challenging. Here, we summarize recent advances in the use of CNV reporters that provide a facile means of quantifying de novo CNVs at a specific locus in the genome, and nanopore sequencing, for resolving the often complex structures of CNVs. We provide guidance for the engineering and analysis of CNV reporters and practical guidelines for single-cell analysis of CNVs using flow cytometry. We summarize recent advances in nanopore sequencing, discuss the utility of this technology, and provide guidance for the bioinformatic analysis of these data to define the molecular structure of CNVs. The combination of reporter systems for tracking and isolating CNV lineages and long-read DNA sequencing for characterizing CNV structures enables unprecedented resolution of the mechanisms by which CNVs are generated and their evolutionary dynamics.
Collapse
Affiliation(s)
- Pieter Spealman
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Titir De
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Julie N Chuong
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - David Gresham
- Department of Biology, New York University, New York, NY, 10003, USA.
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|