Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Weghorn D, Balick DJ, Cassa C, Kosmicki JA, Daly MJ, Beier DR, Sunyaev SR. Applicability of the Mutation-Selection Balance Model to Population Genetics of Heterozygous Protein-Truncating Variants in Humans. Mol Biol Evol 2020;36:1701-1710. [PMID: 31004148 DOI: 10.1093/molbev/msz092] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

For:	Weghorn D, Balick DJ, Cassa C, Kosmicki JA, Daly MJ, Beier DR, Sunyaev SR. Applicability of the Mutation-Selection Balance Model to Population Genetics of Heterozygous Protein-Truncating Variants in Humans. Mol Biol Evol 2020;36:1701-1710. [PMID: 31004148 DOI: 10.1093/molbev/msz092] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Number

Cited by Other Article(s)

Chao KR, Wang L, Panchal R, Liao C, Abderrazzaq H, Ye R, Schultz P, Compitello J, Grant RH, Kosmicki JA, Weisburd B, Phu W, Wilson MW, Laricchia KM, Goodrich JK, Goldstein D, Goldstein JI, Vittal C, Poterba T, Baxter S, Watts NA, Solomonson M, Tiao G, Rehm HL, Neale BM, Talkowski ME, MacArthur DG, O'Donnell-Luria A, Karczewski KJ, Radivojac P, Daly MJ, Samocha KE. The landscape of regional missense mutational intolerance quantified from 125,748 exomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.11.588920. [PMID: 38645134 PMCID: PMC11030311 DOI: 10.1101/2024.04.11.588920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]

Fan WTL, Wakeley J. Latent mutations in the ancestries of alleles under selection. Theor Popul Biol 2024:S0040-5809(24)00041-8. [PMID: 38697365 DOI: 10.1016/j.tpb.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 04/23/2024] [Accepted: 04/29/2024] [Indexed: 05/05/2024]

Zeng T, Spence JP, Mostafavi H, Pritchard JK. Bayesian estimation of gene constraint from an evolutionary model with gene features. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.19.541520. [PMID: 37292653 PMCID: PMC10245655 DOI: 10.1101/2023.05.19.541520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Zhao Y, Zhong G, Hagen J, Pan H, Chung WK, Shen Y. A probabilistic graphical model for estimating selection coefficient of missense variants from human population sequence data. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.11.23299809. [PMID: 38168397 PMCID: PMC10760286 DOI: 10.1101/2023.12.11.23299809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]

Seplyarskiy V, Koch EM, Lee DJ, Lichtman JS, Luan HH, Sunyaev SR. A mutation rate model at the basepair resolution identifies the mutagenic effect of polymerase III transcription. Nat Genet 2023;55:2235-2242. [PMID: 38036792 DOI: 10.1038/s41588-023-01562-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 10/06/2023] [Indexed: 12/02/2023]

Sun KY, Bai X, Chen S, Bao S, Kapoor M, Zhang C, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Gioia AD, Rajagopal V, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton J, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalog of protein-coding variation in 985,830 individuals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539329. [PMID: 37214792 PMCID: PMC10197621 DOI: 10.1101/2023.05.09.539329] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Abstract

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.

Collapse

Affiliation(s)

Kathie Y. Sun Regeneron Genetics Center, Tarrytown, NY, USA
Xiaodong Bai Regeneron Genetics Center, Tarrytown, NY, USA
Siying Chen Regeneron Genetics Center, Tarrytown, NY, USA
Suying Bao Regeneron Genetics Center, Tarrytown, NY, USA
Manav Kapoor Regeneron Genetics Center, Tarrytown, NY, USA
Chuanyi Zhang Regeneron Genetics Center, Tarrytown, NY, USA
Joshua Backman Regeneron Genetics Center, Tarrytown, NY, USA
Tyler Joseph Regeneron Genetics Center, Tarrytown, NY, USA
Evan Maxwell Regeneron Genetics Center, Tarrytown, NY, USA
George Mitra Regeneron Genetics Center, Tarrytown, NY, USA
Alexander Gorovits Regeneron Genetics Center, Tarrytown, NY, USA
Adam Mansfield Regeneron Genetics Center, Tarrytown, NY, USA
Boris Boutkov Regeneron Genetics Center, Tarrytown, NY, USA
Sujit Gokhale Regeneron Genetics Center, Tarrytown, NY, USA
Lukas Habegger Regeneron Genetics Center, Tarrytown, NY, USA
Anthony Marcketta Regeneron Genetics Center, Tarrytown, NY, USA
Adam Locke Regeneron Genetics Center, Tarrytown, NY, USA
Michael D. Kessler Regeneron Genetics Center, Tarrytown, NY, USA
Deepika Sharma Regeneron Genetics Center, Tarrytown, NY, USA
Jeffrey Staples Regeneron Genetics Center, Tarrytown, NY, USA
Jonas Bovijn Regeneron Genetics Center, Tarrytown, NY, USA
Sahar Gelfman Regeneron Genetics Center, Tarrytown, NY, USA
Alessandro Di Gioia Regeneron Genetics Center, Tarrytown, NY, USA
Veera Rajagopal Regeneron Genetics Center, Tarrytown, NY, USA
Alexander Lopez Regeneron Genetics Center, Tarrytown, NY, USA
Jennifer Rico Varela Regeneron Genetics Center, Tarrytown, NY, USA
Jesus Alegre Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Jaime Berumen Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Roberto Tapia-Conyer Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Pablo Kuri-Morales Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
Jason Torres Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Jonathan Emberson Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Rory Collins Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Regeneron Genetics Center Regeneron Genetics Center, Tarrytown, NY, USA
RGC-ME Cohort Partners
Michael Cantor Regeneron Genetics Center, Tarrytown, NY, USA
Timothy Thornton Regeneron Genetics Center, Tarrytown, NY, USA
Hyun Min Kang Regeneron Genetics Center, Tarrytown, NY, USA
John Overton Regeneron Genetics Center, Tarrytown, NY, USA
Alan R. Shuldiner Regeneron Genetics Center, Tarrytown, NY, USA
M. Laura Cremona Regeneron Genetics Center, Tarrytown, NY, USA
Mona Nafde Regeneron Genetics Center, Tarrytown, NY, USA
Aris Baras Regeneron Genetics Center, Tarrytown, NY, USA
Goncalo Abecasis Regeneron Genetics Center, Tarrytown, NY, USA
Jonathan Marchini Regeneron Genetics Center, Tarrytown, NY, USA
Jeffrey G. Reid Regeneron Genetics Center, Tarrytown, NY, USA
William Salerno Regeneron Genetics Center, Tarrytown, NY, USA
Suganthi Balasubramanian Regeneron Genetics Center, Tarrytown, NY, USA

Collapse

Spence JP, Zeng T, Mostafavi H, Pritchard JK. Scaling the discrete-time Wright-Fisher model to biobank-scale datasets. Genetics 2023;225:iyad168. [PMID: 37724741 PMCID: PMC10627256 DOI: 10.1093/genetics/iyad168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 06/01/2023] [Accepted: 09/08/2023] [Indexed: 09/21/2023] Open

Foreman J, Perrett D, Mazaika E, Hunt SE, Ware JS, Firth HV. DECIPHER: Improving Genetic Diagnosis Through Dynamic Integration of Genomic and Clinical Data. Annu Rev Genomics Hum Genet 2023;24:151-176. [PMID: 37285546 PMCID: PMC7615097 DOI: 10.1146/annurev-genom-102822-100509] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Spence JP, Zeng T, Mostafavi H, Pritchard JK. Scaling the Discrete-time Wright Fisher model to biobank-scale datasets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.19.541517. [PMID: 37293115 PMCID: PMC10245735 DOI: 10.1101/2023.05.19.541517] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Abstract

The Discrete-Time Wright Fisher (DTWF) model and its large population diffusion limit are central to population genetics. These models describe the forward-in-time evolution of the frequency of an allele in a population and can include the fundamental forces of genetic drift, mutation, and selection. Computing like-lihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large sample sizes or in the presence of strong selection. Unfortunately, existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here we present an algorithm that approximates the DTWF model with provably bounded error and runs in time linear in the size of the population. Our approach relies on two key observations about Binomial distributions. The first is that Binomial distributions are approximately sparse. The second is that Binomial distributions with similar success probabilities are extremely close as distributions, allowing us to approximate the DTWF Markov transition matrix as a very low rank matrix. Together, these observations enable matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the billions, paving the way for rigorous biobank-scale population genetic inference. Finally, we use our results to estimate how increasing sample sizes will improve the estimation of selection coefficients acting on loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.

Collapse

Barroso GV, Lohmueller KE. Inferring the mode and strength of ongoing selection. Genome Res 2023;33:632-643. [PMID: 37055196 PMCID: PMC10234300 DOI: 10.1101/gr.276386.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/29/2023] [Indexed: 04/15/2023]

Agarwal I, Fuller ZL, Myers SR, Przeworski M. Relating pathogenic loss-of-function mutations in humans to their evolutionary fitness costs. eLife 2023;12:83172. [PMID: 36648429 PMCID: PMC9937649 DOI: 10.7554/elife.83172] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/16/2023] [Indexed: 01/18/2023] Open

Zug R, Uller T. Evolution and dysfunction of human cognitive and social traits: A transcriptional regulation perspective. EVOLUTIONARY HUMAN SCIENCES 2022;4:e43. [PMID: 37588924 PMCID: PMC10426018 DOI: 10.1017/ehs.2022.42] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 08/11/2022] [Accepted: 09/11/2022] [Indexed: 11/07/2022] Open

Baake E, Cordero F, Hummel S. Lines of descent in the deterministic mutation–selection model with pairwise interaction. ANN APPL PROBAB 2022. [DOI: 10.1214/21-aap1736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Extreme purifying selection against point mutations in the human genome. Nat Commun 2022;13:4312. [PMID: 35879308 PMCID: PMC9314448 DOI: 10.1038/s41467-022-31872-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 07/07/2022] [Indexed: 12/13/2022] Open

Gardner EJ, Neville MDC, Samocha KE, Barclay K, Kolk M, Niemi MEK, Kirov G, Martin HC, Hurles ME. Reduced reproductive success is associated with selective constraint on human genes. Nature 2022;603:858-863. [PMID: 35322230 DOI: 10.1038/s41586-022-04549-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 02/07/2022] [Indexed: 12/22/2022]

Balick DJ, Jordan DM, Sunyaev S, Do R. Overcoming constraints on the detection of recessive selection in human genes from population frequency data. Am J Hum Genet 2022;109:33-49. [PMID: 34951958 DOI: 10.1016/j.ajhg.2021.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/30/2021] [Indexed: 11/01/2022] Open

Agarwal I, Przeworski M. Mutation saturation for fitness effects at human CpG sites. eLife 2021;10:e71513. [PMID: 34806592 PMCID: PMC8683084 DOI: 10.7554/elife.71513] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/21/2021] [Indexed: 01/06/2023] Open

Measuring intolerance to mutation in human genetics. Nat Genet 2019;51:772-776. [PMID: 30962618 DOI: 10.1038/s41588-019-0383-1] [Citation(s) in RCA: 71] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 02/22/2019] [Indexed: 01/07/2023]