Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Morales J, Pujar S, Loveland JE, Astashyn A, Bennett R, Berry A, Cox E, Davidson C, Ermolaeva O, Farrell CM, Fatima R, Gil L, Goldfarb T, Gonzalez JM, Haddad D, Hardy M, Hunt T, Jackson J, Joardar VS, Kay M, Kodali VK, McGarvey KM, McMahon A, Mudge JM, Murphy DN, Murphy MR, Rajput B, Rangwala SH, Riddick LD, Thibaud-Nissen F, Threadgold G, Vatsan AR, Wallin C, Webb D, Flicek P, Birney E, Pruitt KD, Frankish A, Cunningham F, Murphy TD. A joint NCBI and EMBL-EBI transcript set for clinical genomics and research. Nature 2022;604:310-315. [PMID: 35388217 PMCID: PMC9007741 DOI: 10.1038/s41586-022-04558-8] [Citation(s) in RCA: 136] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/07/2022] [Indexed: 12/25/2022]

For:	Morales J, Pujar S, Loveland JE, Astashyn A, Bennett R, Berry A, Cox E, Davidson C, Ermolaeva O, Farrell CM, Fatima R, Gil L, Goldfarb T, Gonzalez JM, Haddad D, Hardy M, Hunt T, Jackson J, Joardar VS, Kay M, Kodali VK, McGarvey KM, McMahon A, Mudge JM, Murphy DN, Murphy MR, Rajput B, Rangwala SH, Riddick LD, Thibaud-Nissen F, Threadgold G, Vatsan AR, Wallin C, Webb D, Flicek P, Birney E, Pruitt KD, Frankish A, Cunningham F, Murphy TD. A joint NCBI and EMBL-EBI transcript set for clinical genomics and research. Nature 2022;604:310-315. [PMID: 35388217 PMCID: PMC9007741 DOI: 10.1038/s41586-022-04558-8] [Citation(s) in RCA: 136] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/07/2022] [Indexed: 12/25/2022]

Number

Cited by Other Article(s)

Horste EL, Fansler MM, Cai T, Chen X, Mitschka S, Zhen G, Lee FCY, Ule J, Mayr C. Subcytoplasmic location of translation controls protein output. Mol Cell 2023;83:4509-4523.e11. [PMID: 38134885 PMCID: PMC11146010 DOI: 10.1016/j.molcel.2023.11.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 08/15/2023] [Accepted: 11/21/2023] [Indexed: 12/24/2023]

Ma JG, O’Neill MJ, Richardson E, Thomson KL, Ingles J, Muhammad A, Solus JF, Davogustto G, Anderson KC, Benjamin Shoemaker M, Stergachis AB, Floyd BJ, Dunn K, Parikh VN, Chubb H, Perrin MJ, Roden DM, Vandenberg JI, Ng CA, Glazer AM. Multi-site validation of a functional assay to adjudicate SCN5A Brugada Syndrome-associated variants. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.19.23299592. [PMID: 38196587 PMCID: PMC10775332 DOI: 10.1101/2023.12.19.23299592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]

Affiliation(s)

Joanne G. Ma Mark Cowley Lidwill Research Program in Cardiac Electrophysiology, Victor Chang Cardiac Research Institute, Darlinghurst, NSW, Australia School of Clinical Medicine, UNSW Sydney, Darlinghurst, NSW, Australia
Matthew J. O’Neill Vanderbilt University School of Medicine, Nashville, TN, USA
Ebony Richardson Clinical Genomics Laboratory, Centre for Population Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia and Murdoch Children Research Institute, Melbourne, Australia
Kate L. Thomson Oxford Genetics Laboratories, Churchill Hospital, Oxford, UK
Jodie Ingles Clinical Genomics Laboratory, Centre for Population Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia and Murdoch Children Research Institute, Melbourne, Australia
Ayesha Muhammad Vanderbilt University School of Medicine, Nashville, TN, USA
Joseph F. Solus Vanderbilt Center for Arrhythmia Research and Therapeutics (VanCART), Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Giovanni Davogustto Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Katherine C. Anderson Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
M. Benjamin Shoemaker Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Andrew B. Stergachis University of Washington School of Medicine, Department of Medicine, Seattle, WA, USA
Brendan J. Floyd Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, USA
Kyla Dunn Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, USA
Victoria N. Parikh Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, USA
Henry Chubb Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, USA
Mark J. Perrin Department of Genomic Medicine, Royal Melbourne Hospital, Victoria, Australia
Dan M. Roden Vanderbilt Center for Arrhythmia Research and Therapeutics (VanCART), Departments of Medicine, Pharmacology, and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Jamie I. Vandenberg Mark Cowley Lidwill Research Program in Cardiac Electrophysiology, Victor Chang Cardiac Research Institute, Darlinghurst, NSW, Australia School of Clinical Medicine, UNSW Sydney, Darlinghurst, NSW, Australia
Chai-Ann Ng Mark Cowley Lidwill Research Program in Cardiac Electrophysiology, Victor Chang Cardiac Research Institute, Darlinghurst, NSW, Australia School of Clinical Medicine, UNSW Sydney, Darlinghurst, NSW, Australia
Andrew M. Glazer Vanderbilt Center for Arrhythmia Research and Therapeutics (VanCART), Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA

Collapse

McCarley SC, Murphy DA, Thompson J, Shovlin CL. Pharmacogenomic Considerations for Anticoagulant Prescription in Patients with Hereditary Haemorrhagic Telangiectasia. J Clin Med 2023;12:7710. [PMID: 38137783 PMCID: PMC10744266 DOI: 10.3390/jcm12247710] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 12/10/2023] [Accepted: 12/12/2023] [Indexed: 12/24/2023] Open

Radford EJ, Tan HK, Andersson MHL, Stephenson JD, Gardner EJ, Ironfield H, Waters AJ, Gitterman D, Lindsay S, Abascal F, Martincorena I, Kolesnik-Taylor A, Ng-Cordell E, Firth HV, Baker K, Perry JRB, Adams DJ, Gerety SS, Hurles ME. Saturation genome editing of DDX3X clarifies pathogenicity of germline and somatic variation. Nat Commun 2023;14:7702. [PMID: 38057330 PMCID: PMC10700591 DOI: 10.1038/s41467-023-43041-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 10/30/2023] [Indexed: 12/08/2023] Open

Zhang Q, Shao M. Transcript assembly and annotations: Bias and adjustment. PLoS Comput Biol 2023;19:e1011734. [PMID: 38127855 PMCID: PMC10769104 DOI: 10.1371/journal.pcbi.1011734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 01/05/2024] [Accepted: 12/04/2023] [Indexed: 12/23/2023] Open

Dondi A, Lischetti U, Jacob F, Singer F, Borgsmüller N, Coelho R, Heinzelmann-Schwarz V, Beisel C, Beerenwinkel N. Detection of isoforms and genomic alterations by high-throughput full-length single-cell RNA sequencing in ovarian cancer. Nat Commun 2023;14:7780. [PMID: 38012143 PMCID: PMC10682465 DOI: 10.1038/s41467-023-43387-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 11/07/2023] [Indexed: 11/29/2023] Open

Sanchez-Mete L, Mosciatti L, Casadio M, Vittori L, Martayan A, Stigliano V. MUTYH-associated polyposis: Is it time to change upper gastrointestinal surveillance? A single-center case series and a literature overview. World J Gastrointest Oncol 2023;15:1891-1899. [DOI: 10.4251/wjgo.v15.i11.1891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/28/2023] [Accepted: 06/13/2023] [Indexed: 11/15/2023] Open

Zhang P, Chaldebas M, Ogishi M, Al Qureshah F, Ponsin K, Feng Y, Rinchai D, Milisavljevic B, Han JE, Moncada-Vélez M, Keles S, Schröder B, Stenson PD, Cooper DN, Cobat A, Boisson B, Zhang Q, Boisson-Dupuis S, Abel L, Casanova JL. Genome-wide detection of human intronic AG-gain variants located between splicing branchpoints and canonical splice acceptor sites. Proc Natl Acad Sci U S A 2023;120:e2314225120. [PMID: 37931111 PMCID: PMC10655562 DOI: 10.1073/pnas.2314225120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 10/02/2023] [Indexed: 11/08/2023] Open

Affiliation(s)

Peng Zhang St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Matthieu Chaldebas St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Masato Ogishi St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Fahd Al Qureshah St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Khoren Ponsin St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Yi Feng St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Darawan Rinchai St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Baptiste Milisavljevic St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Ji Eun Han St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Marcela Moncada-Vélez St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065
Sevgi Keles Division of Pediatric Allergy and Immunology, Necmettin Erbakan University, Meram Medical Faculty, Konya42080, Turkey
Bernd Schröder Institute of Physiological Chemistry, Technische Universität Dresden, Dresden01307, Germany
Peter D. Stenson Institute of Medical Genetics, School of Medicine, Cardiff University, CardiffCF14 4XN, United Kingdom
David N. Cooper Institute of Medical Genetics, School of Medicine, Cardiff University, CardiffCF14 4XN, United Kingdom
Aurélie Cobat Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France
Bertrand Boisson St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065 Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France
Qian Zhang St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065 Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France
Stéphanie Boisson-Dupuis St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065 Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France
Laurent Abel St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065 Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France
Jean-Laurent Casanova St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY10065 Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM UMR1163, Paris75015, France Paris Cité University, Imagine Institute, Paris75015, France Department of Pediatrics, Necker Hospital for Sick Children, Paris75015, France HHMI, New York, NY10065

Collapse

Shinder I, Hu R, Ji HJ, Chao KH, Pertea M. EASTR: Identifying and eliminating systematic alignment errors in multi-exon genes. Nat Commun 2023;14:7223. [PMID: 37940654 PMCID: PMC10632439 DOI: 10.1038/s41467-023-43017-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 10/30/2023] [Indexed: 11/10/2023] Open

Sun KY, Bai X, Chen S, Bao S, Kapoor M, Zhang C, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Gioia AD, Rajagopal V, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton J, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalog of protein-coding variation in 985,830 individuals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539329. [PMID: 37214792 PMCID: PMC10197621 DOI: 10.1101/2023.05.09.539329] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Abstract

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.

Collapse

Affiliation(s)

Kathie Y. Sun Regeneron Genetics Center, Tarrytown, NY, USA
Xiaodong Bai Regeneron Genetics Center, Tarrytown, NY, USA
Siying Chen Regeneron Genetics Center, Tarrytown, NY, USA
Suying Bao Regeneron Genetics Center, Tarrytown, NY, USA
Manav Kapoor Regeneron Genetics Center, Tarrytown, NY, USA
Chuanyi Zhang Regeneron Genetics Center, Tarrytown, NY, USA
Joshua Backman Regeneron Genetics Center, Tarrytown, NY, USA
Tyler Joseph Regeneron Genetics Center, Tarrytown, NY, USA
Evan Maxwell Regeneron Genetics Center, Tarrytown, NY, USA
George Mitra Regeneron Genetics Center, Tarrytown, NY, USA
Alexander Gorovits Regeneron Genetics Center, Tarrytown, NY, USA
Adam Mansfield Regeneron Genetics Center, Tarrytown, NY, USA
Boris Boutkov Regeneron Genetics Center, Tarrytown, NY, USA
Sujit Gokhale Regeneron Genetics Center, Tarrytown, NY, USA
Lukas Habegger Regeneron Genetics Center, Tarrytown, NY, USA
Anthony Marcketta Regeneron Genetics Center, Tarrytown, NY, USA
Adam Locke Regeneron Genetics Center, Tarrytown, NY, USA
Michael D. Kessler Regeneron Genetics Center, Tarrytown, NY, USA
Deepika Sharma Regeneron Genetics Center, Tarrytown, NY, USA
Jeffrey Staples Regeneron Genetics Center, Tarrytown, NY, USA
Jonas Bovijn Regeneron Genetics Center, Tarrytown, NY, USA
Sahar Gelfman Regeneron Genetics Center, Tarrytown, NY, USA
Alessandro Di Gioia Regeneron Genetics Center, Tarrytown, NY, USA
Veera Rajagopal Regeneron Genetics Center, Tarrytown, NY, USA
Alexander Lopez Regeneron Genetics Center, Tarrytown, NY, USA
Jennifer Rico Varela Regeneron Genetics Center, Tarrytown, NY, USA
Jesus Alegre Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Jaime Berumen Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Roberto Tapia-Conyer Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Pablo Kuri-Morales Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Jason Torres Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Jonathan Emberson Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Rory Collins Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Regeneron Genetics Center Regeneron Genetics Center, Tarrytown, NY, USA
RGC-ME Cohort Partners
Michael Cantor Regeneron Genetics Center, Tarrytown, NY, USA
Timothy Thornton Regeneron Genetics Center, Tarrytown, NY, USA
Hyun Min Kang Regeneron Genetics Center, Tarrytown, NY, USA
John Overton Regeneron Genetics Center, Tarrytown, NY, USA
Alan R. Shuldiner Regeneron Genetics Center, Tarrytown, NY, USA
M. Laura Cremona Regeneron Genetics Center, Tarrytown, NY, USA
Mona Nafde Regeneron Genetics Center, Tarrytown, NY, USA
Aris Baras Regeneron Genetics Center, Tarrytown, NY, USA
Goncalo Abecasis Regeneron Genetics Center, Tarrytown, NY, USA
Jonathan Marchini Regeneron Genetics Center, Tarrytown, NY, USA
Jeffrey G. Reid Regeneron Genetics Center, Tarrytown, NY, USA
William Salerno Regeneron Genetics Center, Tarrytown, NY, USA
Suganthi Balasubramanian Regeneron Genetics Center, Tarrytown, NY, USA

Collapse

Cross NCP, Ernst T, Branford S, Cayuela JM, Deininger M, Fabarius A, Kim DDH, Machova Polakova K, Radich JP, Hehlmann R, Hochhaus A, Apperley JF, Soverini S. European LeukemiaNet laboratory recommendations for the diagnosis and management of chronic myeloid leukemia. Leukemia 2023;37:2150-2167. [PMID: 37794101 PMCID: PMC10624636 DOI: 10.1038/s41375-023-02048-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/13/2023] [Accepted: 09/20/2023] [Indexed: 10/06/2023]

Varabyou A, Sommer MJ, Erdogdu B, Shinder I, Minkin I, Chao KH, Park S, Heinz J, Pockrandt C, Shumate A, Rincon N, Puiu D, Steinegger M, Salzberg SL, Pertea M. CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure. Genome Biol 2023;24:249. [PMID: 37904256 PMCID: PMC10614308 DOI: 10.1186/s13059-023-03088-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open

Affiliation(s)

Ales Varabyou Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.
Markus J Sommer Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Beril Erdogdu Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Ida Shinder Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Cross Disciplinary Graduate Program in Biomedical Sciences, Johns Hopkins School of Medicine, Baltimore, MD, USA
Ilia Minkin Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Kuan-Hao Chao Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
Sukhwan Park School of Biological Sciences, Seoul National University, Seoul, South Korea Artificial Intelligence Institute, Seoul National University, Seoul, South Korea
Jakob Heinz Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Christopher Pockrandt Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Alaina Shumate Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Natalia Rincon Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Daniela Puiu Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
Martin Steinegger School of Biological Sciences, Seoul National University, Seoul, South Korea Artificial Intelligence Institute, Seoul National University, Seoul, South Korea Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea
Steven L Salzberg Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA. Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA. Department of Biostatistics, Johns Hopkins University, Baltimore, MD, USA.
Mihaela Pertea Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA. Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.

Collapse

Ljungdahl A, Kohani S, Page NF, Wells ES, Wigdor EM, Dong S, Sanders SJ. AlphaMissense is better correlated with functional assays of missense impact than earlier prediction algorithms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.24.562294. [PMID: 37961354 PMCID: PMC10634779 DOI: 10.1101/2023.10.24.562294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Kubota N, Takeda R, Kobayashi J, Hidaka E, Nishi E, Takano K, Wakui K. Reanalysis of Chromosomal Microarray Data Using a Smaller Copy Number Variant Call Threshold Identifies Four Cases with Heterozygous Multiexon Deletions of ARID1B, EHMT1, and FOXP1 Genes. Mol Syndromol 2023;14:394-404. [PMID: 37901861 PMCID: PMC10601822 DOI: 10.1159/000530252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 03/16/2023] [Indexed: 10/31/2023] Open

Amaral P, Carbonell-Sala S, De La Vega FM, Faial T, Frankish A, Gingeras T, Guigo R, Harrow JL, Hatzigeorgiou AG, Johnson R, Murphy TD, Pertea M, Pruitt KD, Pujar S, Takahashi H, Ulitsky I, Varabyou A, Wells CA, Yandell M, Carninci P, Salzberg SL. The status of the human gene catalogue. Nature 2023;622:41-47. [PMID: 37794265 PMCID: PMC10575709 DOI: 10.1038/s41586-023-06490-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 07/27/2023] [Indexed: 10/06/2023]

Affiliation(s)

Paulo Amaral INSPER Institute of Education and Research, Sao Paulo, Brazil
Silvia Carbonell-Sala Centre for Genomic Regulation (CRG), Barcelona, Spain
Francisco M De La Vega Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA Tempus Labs, Chicago, IL, USA
Tiago Faial Nature Genetics, San Francisco, CA, USA
Adam Frankish European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Thomas Gingeras Department of Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
Roderic Guigo Centre for Genomic Regulation (CRG), Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain
Jennifer L Harrow Centre for Genomics Research, Discovery Sciences, AstraZeneca, Royston, UK
Artemis G Hatzigeorgiou Department of Computer Science and Biomedical Informatics, Universithy of Thessaly, Lamia, Greece Hellenic Pasteur Institute, Athens, Greece
Rory Johnson School of Biology and Environmental Science, University College Dublin, Dublin, Ireland Conway Institute of Biomedical and Biomolecular Research, University College Dublin, Dublin, Ireland Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland Department for BioMedical Research, University of Bern, Bern, Switzerland
Terence D Murphy National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
Mihaela Pertea Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
Kim D Pruitt National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
Shashikant Pujar National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
Hazuki Takahashi Laboratory for Transcriptome Technology, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Igor Ulitsky Department of Immunology and Regenerative Biology, Weizmann Institute of Science, Rehovot, Israel Department of Molecular Neuroscience, Weizmann Institute of Science, Rehovot, Israel
Ales Varabyou Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
Christine A Wells Stem Cell Systems, Department of Anatomy and Physiology, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
Mark Yandell Departent of Human Genetics, Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT, USA
Piero Carninci Laboratory for Transcriptome Technology, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan. Human Technopole, Milan, Italy.
Steven L Salzberg Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA. Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. Department of Biostatistics, Johns Hopkins University, Baltimore, MD, USA.

Collapse

Lang M, Kazdal D, Mohr I, Anamaterou C. Differences and similarities of GTF2I mutated thymomas in different Eurasian ethnic groups. Transl Lung Cancer Res 2023;12:1842-1844. [PMID: 37854159 PMCID: PMC10579828 DOI: 10.21037/tlcr-23-396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 09/06/2023] [Indexed: 10/20/2023]

Martin-Geary AC, Blakes AJM, Dawes R, Findlay SD, Lord J, Walker S, Talbot-Martin J, Wieder N, D’Souza EN, Fernandes M, Hilton S, Lahiri N, Campbell C, Jenkinson S, DeGoede CGEL, Anderson ER, Burge CB, Sanders SJ, Ellingford J, Baralle D, Banka S, Whiffin N. Systematic identification of disease-causing promoter and untranslated region variants in 8,040 undiagnosed individuals with rare disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.12.23295416. [PMID: 37745552 PMCID: PMC10516070 DOI: 10.1101/2023.09.12.23295416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]

Affiliation(s)

Alexandra C Martin-Geary Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK
Alexander J M Blakes Manchester Centre for Genomic Medicine, Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
Ruebena Dawes Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK
Scott D Findlay Department of Biology, Massachusetts Institute of Technology, Cambridge, USA
Jenny Lord Genomics England, UK
Susan Walker Genomics England, UK
Jonathan Talbot-Martin Department of Bioengineering, Imperial College London, UK
Nechama Wieder Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK
Elston N D’Souza Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK
Maria Fernandes Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK
Sarah Hilton Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Health Innovation Manchester, Manchester M13 9WL, UK
Nayana Lahiri St George’s, University of London & St George’s University Hospitals NHS Foundation Trust, Institute of Molecular and Clinical Sciences, London, SW17 0QT, UK
Christopher Campbell Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Health Innovation Manchester, Manchester M13 9WL, UK
Sarah Jenkinson Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Health Innovation Manchester, Manchester M13 9WL, UK
Christian G E L DeGoede Department of Paediatric Neurology, Clinical research Facility, Lancashire Teaching Hospitals NHS Trust Manchester Metropolitan University
Emily R Anderson Liverpool Centre for Genomic Medicine, Liverpool Women’s Hospital, Liverpool, UK
Christopher B. Burge Department of Biology, Massachusetts Institute of Technology, Cambridge, USA
Stephan J Sanders Institute of Developmental and Regenerative Medicine, Department of Paediatrics, University of Oxford, Oxford, OX3 7TY, UK Department of Psychiatry and Behavioral Sciences, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA New York Genome Center, New York, NY, USA
Jamie Ellingford Manchester Centre for Genomic Medicine, Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Health Innovation Manchester, Manchester M13 9WL, UK
Diana Baralle School of Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, United Kingdom
Siddharth Banka Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Health Innovation Manchester, Manchester M13 9WL, UK
Nicola Whiffin Big Data Institute, University of Oxford, UK Wellcome Centre for Human Genetics, University of Oxford, UK Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA

Collapse

Ryu J, Barkal S, Yu T, Jankowiak M, Zhou Y, Francoeur M, Phan QV, Li Z, Tognon M, Brown L, Love MI, Lettre G, Ascher DB, Cassa CA, Sherwood RI, Pinello L. Joint genotypic and phenotypic outcome modeling improves base editing variant effect quantification. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.08.23295253. [PMID: 37732177 PMCID: PMC10508837 DOI: 10.1101/2023.09.08.23295253] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]

Affiliation(s)

Jayoung Ryu Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA
Sam Barkal Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Tian Yu Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Martin Jankowiak Broad Institute of Harvard and MIT, Cambridge, MA, USA
Yunzhuo Zhou School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
Matthew Francoeur Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Quang Vinh Phan Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Zhijian Li Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA
Manuel Tognon Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA Computer Science Department, University of Verona, Verona, Italy
Lara Brown Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Michael I. Love Department of Genetics, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
Guillaume Lettre Montreal Heart Institute, Montréal, QC H1T 1C8, Canada Faculté de Médecine, Université de Montréal, Montréal, QC H3T 1J4, Canada
David B. Ascher School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
Christopher A. Cassa Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Richard I. Sherwood Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Luca Pinello Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA Broad Institute of Harvard and MIT, Cambridge, MA, USA Department of Pathology, Harvard Medical School, Boston, MA, USA

Collapse

Bohn E, Lau TTY, Wagih O, Masud T, Merico D. A curated census of pathogenic and likely pathogenic UTR variants and evaluation of deep learning models for variant effect prediction. Front Mol Biosci 2023;10:1257550. [PMID: 37745687 PMCID: PMC10517338 DOI: 10.3389/fmolb.2023.1257550] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 08/28/2023] [Indexed: 09/26/2023] Open

Abstract

Introduction: Variants in 5' and 3' untranslated regions (UTR) contribute to rare disease. While predictive algorithms to assist in classifying pathogenicity can potentially be highly valuable, the utility of these tools is often unclear, as it depends on carefully selected training and validation conditions. To address this, we developed a high confidence set of pathogenic (P) and likely pathogenic (LP) variants and assessed deep learning (DL) models for predicting their molecular effects. Methods: 3' and 5' UTR variants documented as P or LP (P/LP) were obtained from ClinVar and refined by reviewing the annotated variant effect and reassessing evidence of pathogenicity following published guidelines. Prediction scores from sequence-based DL models were compared between three groups: P/LP variants acting though the mechanism for which the model was designed (model-matched), those operating through other mechanisms (model-mismatched), and putative benign variants. PhyloP was used to compare conservation scores between P/LP and putative benign variants. Results: 295 3' and 188 5' UTR variants were obtained from ClinVar, of which 26 3' and 68 5' UTR variants were classified as P/LP. Predictions by DL models achieved statistically significant differences when comparing modelmatched P/LP variants to both putative benign variants and modelmismatched P/LP variants, as well as when comparing all P/LP variants to putative benign variants. PhyloP conservation scores were significantly higher among P/LP compared to putative benign variants for both the 3' and 5' UTR. Discussion: In conclusion, we present a high-confidence set of P/LP 3' and 5' UTR variants spanning a range of mechanisms and supported by detailed pathogenicity and molecular mechanism evidence curation. Predictions from DL models further substantiate these classifications. These datasets will support further development and validation of DL algorithms designed to predict the functional impact of variants that may be implicated in rare disease.

Collapse

Kerimov N, Tambets R, Hayhurst JD, Rahu I, Kolberg P, Raudvere U, Kuzmin I, Chowdhary A, Vija A, Teras HJ, Kanai M, Ulirsch J, Ryten M, Hardy J, Guelfi S, Trabzuni D, Kim-Hellmuth S, Rayner W, Finucane H, Peterson H, Mosaku A, Parkinson H, Alasoo K. eQTL Catalogue 2023: New datasets, X chromosome QTLs, and improved detection and visualisation of transcript-level QTLs. PLoS Genet 2023;19:e1010932. [PMID: 37721944 PMCID: PMC10538656 DOI: 10.1371/journal.pgen.1010932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 09/28/2023] [Accepted: 08/22/2023] [Indexed: 09/20/2023] Open

Affiliation(s)

Nurlan Kerimov Institute of Computer Science, University of Tartu, Tartu, Estonia Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Ralf Tambets Institute of Computer Science, University of Tartu, Tartu, Estonia
James D. Hayhurst Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Ida Rahu Institute of Computer Science, University of Tartu, Tartu, Estonia
Peep Kolberg Institute of Computer Science, University of Tartu, Tartu, Estonia
Uku Raudvere Institute of Computer Science, University of Tartu, Tartu, Estonia
Ivan Kuzmin Institute of Computer Science, University of Tartu, Tartu, Estonia
Anshika Chowdhary Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany
Andreas Vija Institute of Computer Science, University of Tartu, Tartu, Estonia
Hans J. Teras Institute of Computer Science, University of Tartu, Tartu, Estonia
Masahiro Kanai Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Jacob Ulirsch Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Mina Ryten Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London, United Kingdom
John Hardy Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London, United Kingdom
Sebastian Guelfi Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London, United Kingdom
Daniah Trabzuni Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London, United Kingdom
Sarah Kim-Hellmuth Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital LMU Munich, Munich, Germany
William Rayner Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany
Hilary Finucane Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Hedi Peterson Institute of Computer Science, University of Tartu, Tartu, Estonia
Abayomi Mosaku Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Helen Parkinson Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Kaur Alasoo Institute of Computer Science, University of Tartu, Tartu, Estonia Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom

Collapse

Hayesmoore JB, Bhuiyan ZA, Coviello DA, du Sart D, Edwards M, Iascone M, Morris-Rosendahl DJ, Sheils K, van Slegtenhorst M, Thomson KL. EMQN: Recommendations for genetic testing in inherited cardiomyopathies and arrhythmias. Eur J Hum Genet 2023;31:1003-1009. [PMID: 37443332 PMCID: PMC10474043 DOI: 10.1038/s41431-023-01421-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 06/21/2023] [Accepted: 06/22/2023] [Indexed: 07/15/2023] Open

Lee H, Greer SU, Pavlichin DS, Zhou B, Urban AE, Weissman T, Ji HP. Pan-conserved segment tags identify ultra-conserved sequences across assemblies in the human pangenome. CELL REPORTS METHODS 2023;3:100543. [PMID: 37671027 PMCID: PMC10475782 DOI: 10.1016/j.crmeth.2023.100543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 04/14/2023] [Accepted: 07/06/2023] [Indexed: 09/07/2023]

Korbecki J, Bosiacki M, Chlubek D, Baranowska-Bosiacka I. Bioinformatic Analysis of the CXCR2 Ligands in Cancer Processes. Int J Mol Sci 2023;24:13287. [PMID: 37686093 PMCID: PMC10487711 DOI: 10.3390/ijms241713287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 08/23/2023] [Accepted: 08/24/2023] [Indexed: 09/10/2023] Open

Warburton PE, Sebra RP. Long-Read DNA Sequencing: Recent Advances and Remaining Challenges. Annu Rev Genomics Hum Genet 2023;24:109-132. [PMID: 37075062 DOI: 10.1146/annurev-genom-101722-103045] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/20/2023]

Foreman J, Perrett D, Mazaika E, Hunt SE, Ware JS, Firth HV. DECIPHER: Improving Genetic Diagnosis Through Dynamic Integration of Genomic and Clinical Data. Annu Rev Genomics Hum Genet 2023;24:151-176. [PMID: 37285546 PMCID: PMC7615097 DOI: 10.1146/annurev-genom-102822-100509] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Brovkina MV, Chapman MA, Holding ML, Clowney EJ. Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays. BMC Biol 2023;21:179. [PMID: 37612705 PMCID: PMC10463633 DOI: 10.1186/s12915-023-01673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/01/2023] [Indexed: 08/25/2023] Open

Ahles A, Engelhardt S. Genetic Variants of Adrenoceptors. Handb Exp Pharmacol 2023. [PMID: 37578621 DOI: 10.1007/164_2023_676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]

Guigó R. Genome annotation: From human genetics to biodiversity genomics. CELL GENOMICS 2023;3:100375. [PMID: 37601977 PMCID: PMC10435374 DOI: 10.1016/j.xgen.2023.100375] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]

Wang L. Reference-guided search for open reading frames. NATURE COMPUTATIONAL SCIENCE 2023;3:667-668. [PMID: 38177317 DOI: 10.1038/s43588-023-00497-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]

Bowler-Barnett EH, Fan J, Luo J, Magrane M, Martin MJ, Orchard S. UniProt and Mass Spectrometry-Based Proteomics-A 2-Way Working Relationship. Mol Cell Proteomics 2023;22:100591. [PMID: 37301379 PMCID: PMC10404557 DOI: 10.1016/j.mcpro.2023.100591] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 05/20/2023] [Accepted: 06/07/2023] [Indexed: 06/12/2023] Open

Varabyou A, Erdogdu B, Salzberg SL, Pertea M. Investigating Open Reading Frames in Known and Novel Transcripts using ORFanage. NATURE COMPUTATIONAL SCIENCE 2023;3:700-708. [PMID: 38098813 PMCID: PMC10718564 DOI: 10.1038/s43588-023-00496-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/05/2023] [Indexed: 12/17/2023]

Chao KH, Mao A, Salzberg SL, Pertea M. Splam: a deep-learning-based splice site predictor that improves spliced alignments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.27.550754. [PMID: 37546880 PMCID: PMC10402160 DOI: 10.1101/2023.07.27.550754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Pardo-Palacios FJ, Wang D, Reese F, Diekhans M, Carbonell-Sala S, Williams B, Loveland JE, De María M, Adams MS, Balderrama-Gutierrez G, Behera AK, Gonzalez JM, Hunt T, Lagarde J, Liang CE, Li H, Jerryd Meade M, Moraga Amador DA, Prjibelski AD, Birol I, Bostan H, Brooks AM, Hasan Çelik M, Chen Y, Du MR, Felton C, Göke J, Hafezqorani S, Herwig R, Kawaji H, Lee J, Liang Li J, Lienhard M, Mikheenko A, Mulligan D, Ming Nip K, Pertea M, Ritchie ME, Sim AD, Tang AD, Kei Wan Y, Wang C, Wong BY, Yang C, Barnes I, Berry A, Capella S, Dhillon N, Fernandez-Gonzalez JM, Ferrández-Peral L, Garcia-Reyero N, Goetz S, Hernández-Ferrer C, Kondratova L, Liu T, Martinez-Martin A, Menor C, Mestre-Tomás J, Mudge JM, Panayotova NG, Paniagua A, Repchevsky D, Rouchka E, Saint-John B, Sapena E, Sheynkman L, Laird Smith M, Suner MM, Takahashi H, Youngworth IA, Carninci P, Denslow ND, Guigó R, Hunter ME, Tilgner HU, Wold BJ, Vollmers C, Frankish A, Fai Au K, Sheynkman GM, Mortazavi A, Conesa A, Brooks AN. Systematic assessment of long-read RNA-seq methods for transcript identification and quantification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.25.550582. [PMID: 37546854 PMCID: PMC10402094 DOI: 10.1101/2023.07.25.550582] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Affiliation(s)

Francisco J. Pardo-Palacios Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain These authors contributed equally to this work
Dingjie Wang Department of Biomedical Informatics, The Ohio State University, Columbus, USA Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, USA These authors contributed equally to this work
Fairlie Reese Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA These authors contributed equally to this work
Mark Diekhans UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, USA These authors contributed equally to this work
Sílvia Carbonell-Sala Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain These authors contributed equally to this work
Brian Williams Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA These authors contributed equally to this work
Jane E. Loveland European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK These authors contributed equally to this work
Maite De María Department of Physiological Sciences, College of Veterinary Medicine, University of Florida, Gainesville, USA Center for Environmental and Human Toxicology, University of Florida, Gainesville, USA These authors contributed equally to this work
Matthew S. Adams Molecular Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, USA These authors contributed equally to this work
Gabriela Balderrama-Gutierrez Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA These authors contributed equally to this work
Amit K. Behera Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA These authors contributed equally to this work
Jose M. Gonzalez European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK These authors contributed equally to this work
Toby Hunt European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK These authors contributed equally to this work
Julien Lagarde Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain Flomics Biotech, Dr Aiguader 88, Barcelona 08003, Spain These authors contributed equally to this work
Cindy E. Liang Molecular Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, USA These authors contributed equally to this work
Haoran Li Department of Biomedical Informatics, The Ohio State University, Columbus, USA Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, USA These authors contributed equally to this work
Marcus Jerryd Meade Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, USA These authors contributed equally to this work
David A. Moraga Amador Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, USA These authors contributed equally to this work
Andrey D. Prjibelski Department of Computer Science, University of Helsinki, Helsinki, Finland Center for Bioinformatics and Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia These authors contributed equally to this work
Inanc Birol Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, Canada
Hamed Bostan Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Durham, USA
Ashley M. Brooks Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Durham, USA
Muhammed Hasan Çelik Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Ying Chen Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Mei R,M. Du Walter and Eliza Hall Institute of Medical Research, Parkville, Australia
Colette Felton Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Jonathan Göke Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore
Saber Hafezqorani Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, Canada
Ralf Herwig Department Computational Molecular Biology, Max-Planck-Institute for Molecular Genetics, Berlin, Germany
Hideya Kawaji Research Center for Genome & Medical Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
Joseph Lee Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Jian Liang Li Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Durham, USA
Matthias Lienhard Department Computational Molecular Biology, Max-Planck-Institute for Molecular Genetics, Berlin, Germany
Alla Mikheenko Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London, UK
Dennis Mulligan Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Ka Ming Nip Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, Canada
Mihaela Pertea Department of Biomedical Engineering, Johns Hopkins University, Baltimore, USA Center for Computational Biology, Johns Hopkins University, Baltimore, USA
Matthew E. Ritchie Walter and Eliza Hall Institute of Medical Research, Parkville, Australia Department of Medical Biology, The University of Melbourne, Parkville, Australia
Andre D. Sim Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Alison D. Tang Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Yuk Kei Wan Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Changqing Wang Walter and Eliza Hall Institute of Medical Research, Parkville, Australia
Brandon Y. Wong Department of Biomedical Engineering, Johns Hopkins University, Baltimore, USA Center for Computational Biology, Johns Hopkins University, Baltimore, USA
Chen Yang Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, Canada
If Barnes European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Andrew Berry European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Salvador Capella Barcelona Supercomputing Cente, Barcelona, Spain
Namrita Dhillon Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Jose M. Fernandez-Gonzalez Barcelona Supercomputing Cente, Barcelona, Spain
Luis Ferrández-Peral Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
Natàlia Garcia-Reyero Environmental Laboratory, US Army Engineer Research & Development Center, Vicksburg, USA
Stefan Goetz Biobam Bioinformatics SL, Valencia, Spain
Carles Hernández-Ferrer Barcelona Supercomputing Cente, Barcelona, Spain
Liudmyla Kondratova Genetics Institute, University of Florida, Gainesville, USA
Tianyuan Liu Cardiff University, Cardiff, UK
Alessandra Martinez-Martin Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
Carlos Menor Biobam Bioinformatics SL, Valencia, Spain
Jorge Mestre-Tomás Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
Jonathan M. Mudge European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Nedka G. Panayotova Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, USA
Alejandro Paniagua Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
Dmitry Repchevsky Barcelona Supercomputing Cente, Barcelona, Spain
Eric Rouchka Department of Biochemistry & Molecular Genetics, University of Louisville, Louisville, USA
Brandon Saint-John Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Enrique Sapena European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK, UK
Leon Sheynkman Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, USA
Melissa Laird Smith Department of Biochemistry & Molecular Genetics, University of Louisville, Louisville, USA
Marie-Marthe Suner European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Hazuki Takahashi Center for Integrative Medical Sciences, Laboratory for Transcriptome Technology, RIKEN, Yokohama, Japan
Ingrid Ashley. Youngworth Department of Genetics, Stanford University, Palo Alto, USA
Piero Carninci Center for Integrative Medical Sciences, Laboratory for Transcriptome Technology, RIKEN, Yokohama, Japan Human Technopole, Milano, Italy
Nancy D. Denslow Department of Physiological Sciences, College of Veterinary Medicine, University of Florida, Gainesville, USA Center for Environmental and Human Toxicology, Department of Physiological Sciences,, University of Florida, Gainesville, USA
Roderic Guigó Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Catalonia, Spain Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, Spain
Margaret E. Hunter U.S. Geological Survey, Wetland and Aquatic Research Center, Gainesville, USA
Hagen U. Tilgner Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York City, USA
Barbara J. Wold Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA
Christopher Vollmers Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA
Adam Frankish European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Kin Fai Au Department of Biomedical Informatics, The Ohio State University, Columbus, USA Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, USA
Gloria M. Sheynkman Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, USA Center for Public Health Genomics UVA Cancer Center, University of Virginia, Charlottesville, USA
Ali Mortazavi Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Ana Conesa Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain Microbiology and Cell Science Department, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, USA
Angela N. Brooks UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, USA Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, USA

Collapse

Sansbury SE, Serebrenik YV, Lapidot T, Burslem GM, Shalem O. Pooled tagging and hydrophobic targeting of endogenous proteins for unbiased mapping of unfolded protein responses. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548611. [PMID: 37503003 PMCID: PMC10370017 DOI: 10.1101/2023.07.13.548611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Sweatt AJ, Griffiths CD, Paudel BB, Janes KA. Proteome-wide copy-number estimation from transcriptomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.10.548432. [PMID: 37503057 PMCID: PMC10369941 DOI: 10.1101/2023.07.10.548432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Abstract

Protein copy numbers constrain systems-level properties of regulatory networks, but absolute proteomic data remain scarce compared to transcriptomics obtained by RNA sequencing. We addressed this persistent gap by relating mRNA to protein statistically using best-available data from quantitative proteomics-transcriptomics for 4366 genes in 369 cell lines. The approach starts with a central estimate of protein copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model that links mRNAs to protein. For dozens of independent cell lines and primary prostate samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, and empirical protein-to-mRNA ratios. The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein interaction complexes, suggesting mechanistic relationships are embedded. We use the method to estimate viral-receptor abundances of CD55-CXADR from human heart transcriptomes and build 1489 systems-biology models of coxsackievirus B3 infection susceptibility. When applied to 796 RNA sequencing profiles of breast cancer from The Cancer Genome Atlas, inferred copy-number estimates collectively reclassify 26% of Luminal A and 29% of Luminal B tumors. Protein-based reassignments strongly involve a pharmacologic target for luminal breast cancer (CDK4) and an α-catenin that is often undetectable at the mRNA level (CTTNA2). Thus, by adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility limits of contemporary proteomics. The collection of gene-specific models is assembled as a web tool for users seeking mRNA-guided predictions of absolute protein abundance (http://janeslab.shinyapps.io/Pinferna).

Collapse

Walker LC, Hoya MDL, Wiggins GAR, Lindy A, Vincent LM, Parsons MT, Canson DM, Bis-Brewer D, Cass A, Tchourbanov A, Zimmermann H, Byrne AB, Pesaran T, Karam R, Harrison SM, Spurdle AB. Using the ACMG/AMP framework to capture evidence related to predicted and observed impact on splicing: Recommendations from the ClinGen SVI Splicing Subgroup. Am J Hum Genet 2023;110:1046-1067. [PMID: 37352859 PMCID: PMC10357475 DOI: 10.1016/j.ajhg.2023.06.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 06/01/2023] [Accepted: 06/02/2023] [Indexed: 06/25/2023] Open

Abstract

The American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) framework for classifying variants uses six evidence categories related to the splicing potential of variants: PVS1, PS3, PP3, BS3, BP4, and BP7. However, the lack of guidance on how to apply such codes has contributed to variation in the specifications developed by different Clinical Genome Resource (ClinGen) Variant Curation Expert Panels. The ClinGen Sequence Variant Interpretation Splicing Subgroup was established to refine recommendations for applying ACMG/AMP codes relating to splicing data and computational predictions. We utilized empirically derived splicing evidence to (1) determine the evidence weighting of splicing-related data and appropriate criteria code selection for general use, (2) outline a process for integrating splicing-related considerations when developing a gene-specific PVS1 decision tree, and (3) exemplify methodology to calibrate splice prediction tools. We propose repurposing the PVS1_Strength code to capture splicing assay data that provide experimental evidence for variants resulting in RNA transcript(s) with loss of function. Conversely, BP7 may be used to capture RNA results demonstrating no splicing impact for intronic and synonymous variants. We propose that the PS3/BS3 codes are applied only for well-established assays that measure functional impact not directly captured by RNA-splicing assays. We recommend the application of PS1 based on similarity of predicted RNA-splicing effects for a variant under assessment in comparison with a known pathogenic variant. The recommendations and approaches for consideration and evaluation of RNA-assay evidence described aim to help standardize variant pathogenicity classification processes when interpreting splicing-based evidence.

Collapse

Hamza A, El-Sissy C, Yousfi N, Martins PV, Rafat C, Masliah-Planchon J, Frémeaux-Bacchi V, Mesnard L. The absence of CFHR3 and CFHR1 genes from the T2T-CHM13 assembly can limit the molecular diagnosis of complement-related diseases. Eur J Hum Genet 2023;31:730-732. [PMID: 37032353 PMCID: PMC10325998 DOI: 10.1038/s41431-023-01350-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 03/14/2023] [Accepted: 03/20/2023] [Indexed: 04/11/2023] Open

Bucalo A, Conti G, Valentini V, Capalbo C, Bruselles A, Tartaglia M, Bonanni B, Calistri D, Coppa A, Cortesi L, Giannini G, Gismondi V, Manoukian S, Manzella L, Montagna M, Peterlongo P, Radice P, Russo A, Tibiletti MG, Turchetti D, Viel A, Zanna I, Palli D, Silvestri V, Ottini L. Male breast cancer risk associated with pathogenic variants in genes other than BRCA1/2: an Italian case-control study. Eur J Cancer 2023;188:183-191. [PMID: 37262986 DOI: 10.1016/j.ejca.2023.04.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/24/2023] [Accepted: 04/26/2023] [Indexed: 06/03/2023]

Affiliation(s)

Agostino Bucalo Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
Giulia Conti Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
Virginia Valentini Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
Carlo Capalbo Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
Alessandro Bruselles Department of Oncology and Molecular Medicine, Istituto Superiore di Sanità, Rome, Italy
Marco Tartaglia Molecular Genetics and Functional Genomics Research Unit, Ospedale Pediatrico Bambino Gesù, IRCCS, Rome, Italy
Bernardo Bonanni Division of Cancer Prevention and Genetics, European Institute of Oncology (IEO), IRCCS, Milan, Italy
Daniele Calistri Istituto Romagnolo per lo Studio dei Tumori "Dino Amadori"-IRST IRCCS, Meldola, Italy
Anna Coppa Department of Experimental Medicine, Sapienza University of Rome, Rome, Italy
Laura Cortesi Department of Oncology and Haematology, University of Modena and Reggio Emilia, Modena, Italy
Giuseppe Giannini Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy; Istituto Pasteur-Fondazione Cenci Bolognetti, Rome, Italy
Viviana Gismondi Hereditary Cancer Unit, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
Siranoush Manoukian Unità di Genetica Medica, Dipartimento di Oncologia Medica ed Ematologia, Fondazione IRCCS Istituto Nazionale dei Tumori (INT), Milan, Italy
Livia Manzella Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
Marco Montagna Immunology and Molecular Oncology Unit, Veneto Institute of Oncology IOV - IRCCS, Padua, Italy
Paolo Peterlongo Genome Diagnostics Program, IFOM ETS - The AIRC Institute of Molecular Oncology, Milan, Italy
Paolo Radice Unit of Molecular Bases of Genetic Risk and Genetic Testing, Department of Research, Fondazione IRCCS Istituto Nazionale Dei Tumori (INT), Milan, Italy
Antonio Russo Section of Medical Oncology, Department of Surgical and Oncological Sciences, University of Palermo, Palermo, Italy
Maria Grazia Tibiletti Dipartimento di Patologia, ASST Settelaghi and Centro di Ricerca per lo studio dei tumori eredo-familiari, Università dell'Insubria, Varese, Italy
Daniela Turchetti Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
Alessandra Viel Unità di Oncogenetica e Oncogenomica Funzionale, Centro di Riferimento Oncologico di Aviano (CRO), IRCCS, Aviano, Italy
Ines Zanna Cancer Risk Factors and Lifestyle Epidemiology Unit, Institute for Cancer Research, Prevention and Clinical Network (ISPRO), Florence, Italy
Domenico Palli Cancer Risk Factors and Lifestyle Epidemiology Unit, Institute for Cancer Research, Prevention and Clinical Network (ISPRO), Florence, Italy
Valentina Silvestri Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
Laura Ottini Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy.

Collapse

Kovačević M, Milićević O, Branković M, Janković M, Novaković I, Sokić D, Ristić A, Shamsani J, Vojvodić N. Novel variants in established epilepsy genes in focal epilepsy. Seizure 2023;110:146-152. [PMID: 37390664 DOI: 10.1016/j.seizure.2023.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/30/2023] [Accepted: 06/06/2023] [Indexed: 07/02/2023] Open

Abstract

INTRODUCTION

Next generation sequencing (NGS) has greatly expanded our understanding of genetic contributors in multiple epilepsy syndromes, including focal epilepsy. Describing the genetic architecture of common syndromes promises to facilitate the diagnostic process as well as aid in the identification of patients who stand to benefit from genetic testing, but most studies to date have been limited to examining children or adults with intellectual disability. Our aim was to determine the yield of targeted sequencing of 5 established epilepsy genes (DEPDC5, LGI1, SCN1A, GRIN2A, and PCHD19) in an extensively phenotyped cohort of focal epilepsy patients with normal intellectual function or mild intellectual disability, as well as describe novel variants and determine the characteristics of variant carriers.

PATIENTS AND METHODS

Targeted panel sequencing was performed on 96 patients with a strong clinical suspicion of genetic focal epilepsy. Patients had previously gone through a comprehensive diagnostic epilepsy evaluation in The Neurology Clinic, University Clinical Center of Serbia. Variants of interest (VOI) were classified using the American College of Medical Genetics and the Association for Molecular Pathology criteria.

RESULTS

Six VOI in eight (8/96, 8.3%) patients were found in our cohort. Four likely pathogenic VOI were determined in six (6/96, 6.2%) patients, two DEPDC5 variants in two patients, one SCN1A variant in two patients and one PCDH19 variant in two patients. One variant of unknown significance (VUS) was found in GRIN2A in one (1/96, 1.0%) patient. Only one VOI in GRIN2A was classified as likely benign. No VOI were detected in LGI1.

CONCLUSION

Sequencing of only five known epilepsy genes yielded a diagnostic result in 6.2% of our cohort and revealed multiple novel variants. Further research is necessary for a better understanding of the genetic basis in common epilepsy syndromes in patients with normal intellectual function or mild intellectual disability.

Collapse

Florian K, Benet-Pagès A, Berner D, Teubert A, Eck S, Arnold N, Bauer P, Begemann M, Sturm M, Kleinle S, B. Haack T, Eggermann T. Quality assurance within the context of genome diagnostics (a german perspective). MED GENET-BERLIN 2023;35:91-104. [PMID: 38840862 PMCID: PMC10842579 DOI: 10.1515/medgen-2023-2028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]

Ameratunga R, Edwards ESJ, Lehnert K, Leung E, Woon ST, Lea E, Allan C, Chan L, Steele R, Longhurst H, Bryant VL. The Rapidly Expanding Genetic Spectrum of Common Variable Immunodeficiency-Like Disorders. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2023;11:1646-1664. [PMID: 36796510 DOI: 10.1016/j.jaip.2023.01.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/21/2023] [Accepted: 01/27/2023] [Indexed: 02/16/2023]

Affiliation(s)

Rohan Ameratunga Department of Clinical immunology, Auckland Hospital, Auckland, New Zealand; Department of Virology and Immunology, Auckland Hospital, Auckland, New Zealand; Department of Molecular Medicine and Pathology, School of Medicine, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand.
Emily S J Edwards The Jeffrey Modell Diagnostic and Research Centre for Primary Immunodeficiencies, and Allergy and Clinical Immunology Laboratory, Department of Immunology, Monash University, Melbourne, VIC, Australia
Klaus Lehnert Applied Translational Genetics Group, School of Biological Sciences, University of Auckland, Auckland, New Zealand; Maurice Wilkins Centre, School of Biological Sciences, University of Auckland, Auckland, New Zealand
Euphemia Leung Auckland Cancer Society Research Centre, School of Medicine, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
See-Tarn Woon Department of Virology and Immunology, Auckland Hospital, Auckland, New Zealand
Edward Lea Department of Virology and Immunology, Auckland Hospital, Auckland, New Zealand
Caroline Allan Department of Virology and Immunology, Auckland Hospital, Auckland, New Zealand
Lydia Chan Department of Clinical immunology, Auckland Hospital, Auckland, New Zealand
Richard Steele Department of Virology and Immunology, Auckland Hospital, Auckland, New Zealand; Department of Respiratory Medicine, Wellington Hospital, Wellington, New Zealand
Hilary Longhurst Department of Medicine, School of Medicine, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
Vanessa L Bryant Department of Immunology, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia; Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia; Department of Clinical Immunology and Allergy, Royal Melbourne Hospital, Parkville, VIC, Australia

Collapse

Reese F, Williams B, Balderrama-Gutierrez G, Wyman D, Çelik MH, Rebboah E, Rezaie N, Trout D, Razavi-Mohseni M, Jiang Y, Borsari B, Morabito S, Liang HY, McGill CJ, Rahmanian S, Sakr J, Jiang S, Zeng W, Carvalho K, Weimer AK, Dionne LA, McShane A, Bedi K, Elhajjajy SI, Upchurch S, Jou J, Youngworth I, Gabdank I, Sud P, Jolanki O, Strattan JS, Kagda MS, Snyder MP, Hitz BC, Moore JE, Weng Z, Bennett D, Reinholdt L, Ljungman M, Beer MA, Gerstein MB, Pachter L, Guigó R, Wold BJ, Mortazavi A. The ENCODE4 long-read RNA-seq collection reveals distinct classes of transcript structure diversity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.15.540865. [PMID: 37292896 PMCID: PMC10245583 DOI: 10.1101/2023.05.15.540865] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Abstract

The majority of mammalian genes encode multiple transcript isoforms that result from differential promoter use, changes in exonic splicing, and alternative 3' end choice. Detecting and quantifying transcript isoforms across tissues, cell types, and species has been extremely challenging because transcripts are much longer than the short reads normally used for RNA-seq. By contrast, long-read RNA-seq (LR-RNA-seq) gives the complete structure of most transcripts. We sequenced 264 LR-RNA-seq PacBio libraries totaling over 1 billion circular consensus reads (CCS) for 81 unique human and mouse samples. We detect at least one full-length transcript from 87.7% of annotated human protein coding genes and a total of 200,000 full-length transcripts, 40% of which have novel exon junction chains. To capture and compute on the three sources of transcript structure diversity, we introduce a gene and transcript annotation framework that uses triplets representing the transcript start site, exon junction chain, and transcript end site of each transcript. Using triplets in a simplex representation demonstrates how promoter selection, splice pattern, and 3' processing are deployed across human tissues, with nearly half of multi-transcript protein coding genes showing a clear bias toward one of the three diversity mechanisms. Evaluated across samples, the predominantly expressed transcript changes for 74% of protein coding genes. In evolution, the human and mouse transcriptomes are globally similar in types of transcript structure diversity, yet among individual orthologous gene pairs, more than half (57.8%) show substantial differences in mechanism of diversification in matching tissues. This initial large-scale survey of human and mouse long-read transcriptomes provides a foundation for further analyses of alternative transcript usage, and is complemented by short-read and microRNA data on the same samples and by epigenome data elsewhere in the ENCODE4 collection.

Collapse

Affiliation(s)

Fairlie Reese Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Brian Williams Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA
Gabriela Balderrama-Gutierrez Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Dana Wyman Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Muhammed Hasan Çelik Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Elisabeth Rebboah Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Narges Rezaie Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Diane Trout Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA
Milad Razavi-Mohseni Department of Biomedical Engineering, Johns Hopkins University, Baltimore, USA McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University, Baltimore, USA
Yunzhe Jiang Program in Computational Biology and Bioinformatics, Yale University, New Haven, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, USA
Beatrice Borsari Program in Computational Biology and Bioinformatics, Yale University, New Haven, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, USA Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
Samuel Morabito Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Heidi Yahan Liang Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Cassandra J McGill Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Sorena Rahmanian Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Jasmine Sakr Center for Complex Biological Systems, University of California, Irvine, Irvine, USA Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, USA
Shan Jiang Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Weihua Zeng Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Klebea Carvalho Center for Complex Biological Systems, University of California, Irvine, Irvine, USA
Annika K Weimer Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Louise A Dionne The Jackson Laboratory, The Jackson Laboratory, Bar Harbor, USA
Ariel McShane Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, USA Department of Radiation Oncology, University of Michigan, Ann Arbor, USA
Karan Bedi Department of Biostatistics, University of Michigan, Ann Arbor, USA Center for RNA Biomedicine and Rogel Cancer Center, University of Michigan, Ann Arbor, USA
Shaimae I Elhajjajy Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, USA
Sean Upchurch Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA
Jennifer Jou Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Ingrid Youngworth Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Idan Gabdank Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Paul Sud Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Otto Jolanki Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
J Seth Strattan Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Meenakshi S Kagda Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Michael P Snyder Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Ben C Hitz Department of Genetics, Stanford University School of Medicine, Palo Alto, USA
Jill E Moore Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, USA
Zhiping Weng Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, USA
David Bennett Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, USA Department of Neurological Sciences, Rush University Medical Center, Chicago, USA
Laura Reinholdt The Jackson Laboratory, The Jackson Laboratory, Bar Harbor, USA
Mats Ljungman Center for RNA Biomedicine and Rogel Cancer Center, University of Michigan, Ann Arbor, USA Departments of Radiation Oncology and Environmental Health Sciences, University of Michigan, Ann Arbor, USA
Michael A Beer Department of Biomedical Engineering, Johns Hopkins University, Baltimore, USA McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University, Baltimore, USA
Mark B Gerstein Program in Computational Biology and Bioinformatics, Yale University, New Haven, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, USA Section on Biomedical Informatics and Data Science, Yale University, New Haven, USA Department of Statistics and Data Science, Yale University, New Haven, USA Department of Computer Science, Yale University, New Haven, USA
Lior Pachter Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, USA
Roderic Guigó Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona, Spain
Barbara J Wold Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, USA
Ali Mortazavi Developmental and Cell Biology, University of California, Irvine, Irvine, USA Center for Complex Biological Systems, University of California, Irvine, Irvine, USA

Collapse

Toomata Z, Leask M, Krishnan M, Cadzow M, Dalbeth N, Stamp LK, de Zoysa J, Merriman T, Wilcox P, Dewes O, Murphy R. Genetic testing for misclassified monogenic diabetes in Māori and Pacific peoples in Aōtearoa New Zealand with early-onset type 2 diabetes. Front Endocrinol (Lausanne) 2023;14:1174699. [PMID: 37234800 PMCID: PMC10206310 DOI: 10.3389/fendo.2023.1174699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 04/20/2023] [Indexed: 05/28/2023] Open

Weisburd B, Tiao G, Rehm HL. Insights from a genome-wide truth set of tandem repeat variation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539588. [PMID: 37214979 PMCID: PMC10197592 DOI: 10.1101/2023.05.05.539588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Abstract

Tools for genotyping tandem repeats (TRs) from short read sequencing data have improved significantly over the past decade. Extensive comparisons of these tools to gold standard diagnostic methods like RP-PCR have confirmed their accuracy for tens to hundreds of well-studied loci. However, a scarcity of high-quality orthogonal truth data limited our ability to measure tool accuracy for the millions of other loci throughout the genome. To address this, we developed a TR truth set based on the Synthetic Diploid Benchmark (SynDip). By identifying the subset of insertions and deletions that represent TR expansions or contractions with motifs between 2 and 50 base pairs, we obtained accurate genotypes for 139,795 pure and 6,845 interrupted repeats in a single diploid sample. Our approach did not require running existing genotyping tools on short read or long read sequencing data and provided an alternative, more accurate view of tandem repeat variation. We applied this truth set to compare the strengths and weaknesses of widely-used tools for genotyping TRs, evaluated the completeness of existing genome-wide TR catalogs, and explored the properties of tandem repeat variation throughout the genome. We found that, without filtering, ExpansionHunter had higher accuracy than GangSTR and HipSTR over a wide range of motifs and allele sizes. Also, when errors in allele size occurred, ExpansionHunter tended to overestimate expansion sizes, while GangSTR tended to underestimate them. Additionally, we saw that widely-used TR catalogs miss between 16% and 41% of variant loci in the truth set. These results suggest that genome-wide analyses would benefit from genotyping a larger set of loci as well as further tool development that builds on the strengths of current algorithms. To that end, we developed a new catalog of 2.8 million loci that captures 95% of variant loci in the truth set, and created a modified version of ExpansionHunter that runs 2 to 3x faster than the original while producing the same output.

Collapse

Smith C, Kitzman JO. Benchmarking splice variant prediction algorithms using massively parallel splicing assays. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.04.539398. [PMID: 37205456 PMCID: PMC10187268 DOI: 10.1101/2023.05.04.539398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Abstract

Background

Variants that disrupt mRNA splicing account for a sizable fraction of the pathogenic burden in many genetic disorders, but identifying splice-disruptive variants (SDVs) beyond the essential splice site dinucleotides remains difficult. Computational predictors are often discordant, compounding the challenge of variant interpretation. Because they are primarily validated using clinical variant sets heavily biased to known canonical splice site mutations, it remains unclear how well their performance generalizes.

Results

We benchmarked eight widely used splicing effect prediction algorithms, leveraging massively parallel splicing assays (MPSAs) as a source of experimentally determined ground-truth. MPSAs simultaneously assay many variants to nominate candidate SDVs. We compared experimentally measured splicing outcomes with bioinformatic predictions for 3,616 variants in five genes. Algorithms' concordance with MPSA measurements, and with each other, was lower for exonic than intronic variants, underscoring the difficulty of identifying missense or synonymous SDVs. Deep learning-based predictors trained on gene model annotations achieved the best overall performance at distinguishing disruptive and neutral variants. Controlling for overall call rate genome-wide, SpliceAI and Pangolin also showed superior overall sensitivity for identifying SDVs. Finally, our results highlight two practical considerations when scoring variants genome-wide: finding an optimal score cutoff, and the substantial variability introduced by differences in gene model annotation, and we suggest strategies for optimal splice effect prediction in the face of these issues.

Conclusion

SpliceAI and Pangolin showed the best overall performance among predictors tested, however, improvements in splice effect prediction are still needed especially within exons.

Collapse

Hofman DA, Ruiz-Orera J, Yannuzzi I, Murugesan R, Brown A, Clauser KR, Condurat AL, van Dinter JT, Engels SA, Goodale A, van der Lugt J, Abid T, Wang L, Zhou KN, Vogelzang J, Ligon KL, Phoenix TN, Roth JA, Root DE, Hubner N, Golub TR, Bandopadhayay P, van Heesch S, Prensner JR. Translation of non-canonical open reading frames as a cancer cell survival mechanism in childhood medulloblastoma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.04.539399. [PMID: 37205492 PMCID: PMC10187264 DOI: 10.1101/2023.05.04.539399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Affiliation(s)

Damon A. Hofman Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands These authors contributed equally
Jorge Ruiz-Orera Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany These authors contributed equally
Ian Yannuzzi Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Rakesh Murugesan Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Adam Brown Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Current address: Arbor Biotechnologies, Cambridge, MA, 02140, USA
Karl R. Clauser Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Alexandra L. Condurat Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
Jip T. van Dinter Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
Sem A.G. Engels Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
Amy Goodale Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Jasper van der Lugt Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
Tanaz Abid Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Li Wang Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Kevin N. Zhou Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA Current address: Kaiser Permanente Bernard J. Tyson School of Medicine, Pasadena, CA, 91101, USA
Jayne Vogelzang Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA Department of Pathology, Brigham and Women’s Hospital, Boston, MA, 02215, USA
Keith L. Ligon Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA Department of Pathology, Brigham and Women’s Hospital, Boston, MA, 02215, USA Department of Pathology, Boston Children’s Hospital, Boston MA 02115
Timothy N. Phoenix Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH, 45229, USA
Jennifer A. Roth Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
David E. Root Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Norbert Hubner Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany Charité-Universitätsmedizin, 10117 Berlin, Germany German Centre for Cardiovascular Research, Partner Site Berlin, 13347 Berlin, Germany
Todd R. Golub Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA
Pratiti Bandopadhayay Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA
Sebastiaan van Heesch Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
John R. Prensner Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA Current address: Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA

Collapse

Pagni S, Custodio HM, Frankish A, Mudge JM, Mills JD, Sisodiya SM. SCN1A: bioinformatically informed revised boundaries for promoter and enhancer regions. Hum Mol Genet 2023;32:1753-1763. [PMID: 36715146 PMCID: PMC10162429 DOI: 10.1093/hmg/ddad015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 01/06/2023] [Accepted: 01/24/2023] [Indexed: 01/31/2023] Open

Zhang Q, Shao M. Transcript Assembly and Annotations: Bias and Adjustment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.20.537700. [PMID: 37131680 PMCID: PMC10153229 DOI: 10.1101/2023.04.20.537700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Kerimov N, Tambets R, Hayhurst JD, Rahu I, Kolberg P, Raudvere U, Kuzmin I, Chowdhary A, Vija A, Teras HJ, Kanai M, Ulirsch J, Ryten M, Hardy J, Guelfi S, Trabzuni D, Kim-Hellmuth S, Rayner W, Finucane H, Peterson H, Mosaku A, Parkinson H, Alasoo K. Systematic visualisation of molecular QTLs reveals variant mechanisms at GWAS loci. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.06.535816. [PMID: 37066341 PMCID: PMC10104061 DOI: 10.1101/2023.04.06.535816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Affiliation(s)

Nurlan Kerimov Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Ralf Tambets Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
James D Hayhurst Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Ida Rahu Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Peep Kolberg Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Uku Raudvere Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Ivan Kuzmin Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Anshika Chowdhary Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany
Andreas Vija Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Hans J Teras Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Masahiro Kanai Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jacob Ulirsch Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Mina Ryten Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London
John Hardy Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London
Sebastian Guelfi Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London
Daniah Trabzuni Department of Genetics and Genomic Medicine, Great Ormond Street Institute of Child Health, University College London, London
Sarah Kim-Hellmuth Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital LMU Munich, Munich, Germany
Will Rayner Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany
Hilary Finucane Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Hedi Peterson Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia
Abayomi Mosaku Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Helen Parkinson Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Kaur Alasoo Institute of Computer Science, University of Tartu, Tartu, 51009, Estonia Open Targets, South Building, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK

Collapse

100

Omenn GS, Lane L, Overall CM, Pineau C, Packer NH, Cristea IM, Lindskog C, Weintraub ST, Orchard S, Roehrl MH, Nice E, Liu S, Bandeira N, Chen YJ, Guo T, Aebersold R, Moritz RL, Deutsch EW. The 2022 Report on the Human Proteome from the HUPO Human Proteome Project. J Proteome Res 2023;22:1024-1042. [PMID: 36318223 PMCID: PMC10081950 DOI: 10.1021/acs.jproteome.2c00498] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]