1
|
Syed S, Boland BS, Bourke LT, Chen LA, Churchill L, Dobes A, Greene A, Heller C, Jayson C, Kostiuk B, Moss A, Najdawi F, Plung L, Rioux JD, Rosen MJ, Torres J, Zulqarnain F, Satsangi J. Challenges in IBD Research 2024: Precision Medicine. Inflamm Bowel Dis 2024; 30:S39-S54. [PMID: 38778628 DOI: 10.1093/ibd/izae084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Indexed: 05/25/2024]
Abstract
Precision medicine is part of 5 focus areas of the Challenges in IBD Research 2024 research document, which also includes preclinical human IBD mechanisms, environmental triggers, novel technologies, and pragmatic clinical research. Building on Challenges in IBD Research 2019, the current Challenges aims to provide a comprehensive overview of current gaps in inflammatory bowel diseases (IBDs) research and deliver actionable approaches to address them with a focus on how these gaps can lead to advancements in interception, remission, and restoration for these diseases. The document is the result of multidisciplinary input from scientists, clinicians, patients, and funders, and represents a valuable resource for patient-centric research prioritization. In particular, the precision medicine section is focused on the main research gaps in elucidating how to bring the best care to the individual patient in IBD. Research gaps were identified in biomarker discovery and validation for predicting disease progression and choosing the most appropriate treatment for each patient. Other gaps were identified in making the best use of existing patient biosamples and clinical data, developing new technologies to analyze large datasets, and overcoming regulatory and payer hurdles to enable clinical use of biomarkers. To address these gaps, the Workgroup suggests focusing on thoroughly validating existing candidate biomarkers, using best-in-class data generation and analysis tools, and establishing cross-disciplinary teams to tackle regulatory hurdles as early as possible. Altogether, the precision medicine group recognizes the importance of bringing basic scientific biomarker discovery and translating it into the clinic to help improve the lives of IBD patients.
Collapse
Affiliation(s)
- Sana Syed
- Department of Pediatrics, University of Virginia, Charlottesville, VA, USA
- Patient representative for Crohn's & Colitis Foundation, New York, NY, USA
| | - Brigid S Boland
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Lauren T Bourke
- Precision Medicine Drug Development, Early Respiratory and Immunology, AstraZeneca, Boston, MA, USA
| | - Lea Ann Chen
- Division of Gastroenterology, Department of Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
| | - Laurie Churchill
- Leona M. and Harry B. Helmsley Charitable Trust, New York, NY, USA
| | | | - Adam Greene
- Department of Pediatrics, University of Virginia, Charlottesville, VA, USA
| | | | | | | | - Alan Moss
- Crohn's & Colitis Foundation, New York, NY, USA
| | | | - Lori Plung
- Patient representative for Crohn's & Colitis Foundation, New York, NY, USA
| | - John D Rioux
- Research Center, Montreal Heart Institute, Université de Montréal, Montréal, Québec, Canada
| | - Michael J Rosen
- Division of Pediatric Gastroenterology, Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Joana Torres
- Division of Gastroenterology, Hospital Beatriz Ângelo, Hospital da Luz, Lisbon, Portugal
- Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Fatima Zulqarnain
- Department of Pediatrics, University of Virginia, Charlottesville, VA, USA
| | - Jack Satsangi
- Translational Gastroenterology Unit, Experimental Medicine Division, Nuffield Department of Medicine, John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
2
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Cherry JM, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Gramates LS, Grove CA, Hale P, Harris T, Hayman GT, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Van Auken K, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources central infrastructure. Genetics 2024; 227:iyae049. [PMID: 38552170 DOI: 10.1093/genetics/iyae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/09/2024] Open
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, Caenorhabditis elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and application programming interfaces (APIs). Here, we focus on developments over the last 2 years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse software. We describe our progress toward a central persistent database to support curation, the data modeling that underpins harmonization, and progress toward a state-of-the-art literature curation system with integrated artificial intelligence and machine learning (AI/ML).
Collapse
Affiliation(s)
| | | | | | - Giulia Antonazzo
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Susan M Bello
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Olin Blodgett
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | | | - Carol J Bult
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Brian R Calvi
- Department of Biology, Indiana University , Bloomington, IN 47408 , USA
| | - Seth Carbon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - J Michael Cherry
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Madeline A Crosby
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Jeffrey L De Pons
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | | | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Mary E Dolan
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gilberto dos Santos
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Dustin Ebert
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Stacia R Engel
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - David Fashena
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Malcolm Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University , 5000 Forbes Ave, Pittsburgh, PA 15203
| | - Adam C Gibson
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Varun R Gollapally
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - L Sian Gramates
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Paul Hale
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - G Thomas Hayman
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Yanhui Hu
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Kamran Karimi
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Kalpana Karra
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Anne E Kwitek
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Stanley J F Laulederkind
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ian Longden
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Nicholas Markarian
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Beverley Matthews
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Monica S McAndrews
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gillian Millburn
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Stuart Miyasato
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Howie Motenko
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Sierra Moxon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Anushya Muruganujan
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Tremayne Mushayahama
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Robert S Nash
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Holly Paddock
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Troy Pells
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Norbert Perrimon
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christian Pich
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | | | | | - Susan Russo Gelbart
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Leyla Ruzicka
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - David R Shaw
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ajay Shrivatsav
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Amy Singer
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Constance M Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Cynthia L Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Jennifer R Smith
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Tabone
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Ketaki Thorat
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jyothi Thota
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Monika Tomczuk
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Vitor Trovisco
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Marek A Tutaj
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jose-Maria Urbano
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ceri E Van Slyke
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Peter D Vize
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Shuai Weng
- Department of Genetics, Stanford University , Stanford, CA 94305
| | | | - Laurens G Wilming
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Edith D Wong
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Pinglei Zhou
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Aaron Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Mark Zytkovicz
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| |
Collapse
|
3
|
Rangwala SH, Rudnev DV, Ananiev VV, Oh DH, Asztalos A, Benica B, Borodin EA, Bouk N, Evgeniev VI, Kodali VK, Lotov V, Mozes E, Omelchenko MV, Savkina S, Sukharnikov E, Virothaisakun J, Murphy TD, Pruitt KD, Schneider VA. The NCBI Comparative Genome Viewer (CGV) is an interactive visualization tool for the analysis of whole-genome eukaryotic alignments. PLoS Biol 2024; 22:e3002405. [PMID: 38713717 PMCID: PMC11101090 DOI: 10.1371/journal.pbio.3002405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 05/17/2024] [Accepted: 04/08/2024] [Indexed: 05/09/2024] Open
Abstract
We report a new visualization tool for analysis of whole-genome assembly-assembly alignments, the Comparative Genome Viewer (CGV) (https://ncbi.nlm.nih.gov/genome/cgv/). CGV visualizes pairwise same-species and cross-species alignments provided by National Center for Biotechnology Information (NCBI) using assembly alignment algorithms developed by us and others. Researchers can examine large structural differences spanning chromosomes, such as inversions or translocations. Users can also navigate to regions of interest, where they can detect and analyze smaller-scale deletions and rearrangements within specific chromosome or gene regions. RefSeq or user-provided gene annotation is displayed where available. CGV currently provides approximately 800 alignments from over 350 animal, plant, and fungal species. CGV and related NCBI viewers are undergoing active development to further meet needs of the research community in comparative genome visualization.
Collapse
Affiliation(s)
- Sanjida H. Rangwala
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Dmitry V. Rudnev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Victor V. Ananiev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Dong-Ha Oh
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Andrea Asztalos
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Barrett Benica
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Evgeny A. Borodin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Nathan Bouk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Vladislav I. Evgeniev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Vamsi K. Kodali
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Vadim Lotov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Eyal Mozes
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Marina V. Omelchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Sofya Savkina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Ekaterina Sukharnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Joël Virothaisakun
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Terence D. Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Kim D. Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| |
Collapse
|
4
|
Astashyn A, Tvedte ES, Sweeney D, Sapojnikov V, Bouk N, Joukov V, Mozes E, Strope PK, Sylla PM, Wagner L, Bidwell SL, Brown LC, Clark K, Davis EW, Smith-White B, Hlavina W, Pruitt KD, Schneider VA, Murphy TD. Rapid and sensitive detection of genome contamination at scale with FCS-GX. Genome Biol 2024; 25:60. [PMID: 38409096 PMCID: PMC10898089 DOI: 10.1186/s13059-024-03198-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 02/15/2024] [Indexed: 02/28/2024] Open
Abstract
Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 min. Testing FCS-GX on artificially fragmented genomes demonstrates high sensitivity and specificity for diverse contaminant species. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination, comprising 0.16% of total bases, with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/ or https://doi.org/10.5281/zenodo.10651084 .
Collapse
Affiliation(s)
- Alexander Astashyn
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eric S Tvedte
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Deacon Sweeney
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Sapojnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Nathan Bouk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Victor Joukov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eyal Mozes
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pooja K Strope
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pape M Sylla
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Lukas Wagner
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Shelby L Bidwell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Larissa C Brown
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Karen Clark
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Emily W Davis
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Brian Smith-White
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Wratko Hlavina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
5
|
Sayers E, Beck J, Bolton E, Brister J, Chan J, Comeau D, Connor R, DiCuccio M, Farrell C, Feldgarden M, Fine A, Funk K, Hatcher E, Hoeppner M, Kane M, Kannan S, Katz K, Kelly C, Klimke W, Kim S, Kimchi A, Landrum M, Lathrop S, Lu Z, Malheiro A, Marchler-Bauer A, Murphy T, Phan L, Prasad A, Pujar S, Sawyer A, Schmieder E, Schneider V, Schoch C, Sharma S, Thibaud-Nissen F, Trawick B, Venkatapathi T, Wang J, Pruitt K, Sherry S. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2024; 52:D33-D43. [PMID: 37994677 PMCID: PMC10767890 DOI: 10.1093/nar/gkad1044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/20/2023] [Accepted: 10/23/2023] [Indexed: 11/24/2023] Open
Abstract
The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, SciENcv, the NIH Comparative Genomics Resource (CGR), NCBI Virus, SRA, RefSeq, foreign contamination screening tools, Taxonomy, iCn3D, ClinVar, GTR, MedGen, dbSNP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.
Collapse
Affiliation(s)
- Eric W Sayers
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Jeff Beck
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Evan E Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - J Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Jessica Chan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Donald C Comeau
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Ryan Connor
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Michael DiCuccio
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Catherine M Farrell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Michael Feldgarden
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Anna M Fine
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kathryn Funk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Eneida Hatcher
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Marilu Hoeppner
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Megan Kane
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Sivakumar Kannan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kenneth S Katz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Christopher Kelly
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - William Klimke
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Sunghwan Kim
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Avi Kimchi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Melissa Landrum
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Stacy Lathrop
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Adriana Malheiro
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Aron Marchler-Bauer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Lon Phan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Arjun B Prasad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Shashikant Pujar
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Amanda Sawyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Erin Schmieder
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Conrad L Schoch
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Shobha Sharma
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Barton W Trawick
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Thilakam Venkatapathi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Jiyao Wang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Stephen T Sherry
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
6
|
Rangwala SH, Rudnev DV, Ananiev VV, Asztalos A, Benica B, Borodin EA, Bouk N, Evgeniev VI, Kodali VK, Lotov V, Mozes E, Oh DH, Omelchenko MV, Savkina S, Sukharnikov E, Virothaisakun J, Murphy TD, Pruitt KD, Schneider VA. Interactive visualization of whole eukaryote genome alignments using NCBI's Comparative Genome Viewer (CGV). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.30.564672. [PMID: 38077029 PMCID: PMC10705539 DOI: 10.1101/2023.10.30.564672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
We report a new visualization tool for analysis of whole genome assembly-assembly alignments, the Comparative Genome Viewer (CGV) (https://ncbi.nlm.nih.gov/genome/cgv/). CGV visualizes pairwise same-species and cross-species alignments provided by NCBI using assembly alignment algorithms developed by us and others. Researchers can examine the alignments between the two assemblies using two alternate views: a chromosome ideogram-based view or a 2D genome dotplot. Whole genome alignment views expose large structural differences spanning chromosomes, such as inversions or translocations. Users can also navigate to regions of interest, where they can detect and analyze smaller-scale deletions and rearrangements within specific chromosome or gene regions. RefSeq or user-provided gene annotation is displayed in the ideogram view where available. CGV currently provides approximately 700 alignments from over 300 animal, plant, and fungal species. CGV and related NCBI viewers are undergoing active development to further meet needs of the research community in comparative genome visualization.
Collapse
Affiliation(s)
- Sanjida H Rangwala
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Dmitry V Rudnev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Victor V Ananiev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Andrea Asztalos
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Barrett Benica
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Evgeny A Borodin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Nathan Bouk
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Vladislav I Evgeniev
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Vamsi K Kodali
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Vadim Lotov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Eyal Mozes
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Dong-Ha Oh
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Marina V Omelchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Sofya Savkina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Ekaterina Sukharnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Joël Virothaisakun
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Terence D. Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD 20894, USA
| |
Collapse
|
7
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, Santos GD, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Auken KV, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources Central Infrastructure Alliance of Genome Resources Consortium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.20.567935. [PMID: 38045425 PMCID: PMC10690154 DOI: 10.1101/2023.11.20.567935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and APIs. Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Collapse
|