Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Peng B, Leong MC, Chen HS, Rotunno M, Brignole KR, Clarke J, Mechanic LE. Genetic Simulation Resources and the GSR Certification Program. Bioinformatics 2018;35:709-710. [PMID: 30101297 DOI: 10.1093/bioinformatics/bty666] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 03/27/2018] [Accepted: 08/06/2018] [Indexed: 11/14/2022] Open

For:	Peng B, Leong MC, Chen HS, Rotunno M, Brignole KR, Clarke J, Mechanic LE. Genetic Simulation Resources and the GSR Certification Program. Bioinformatics 2018;35:709-710. [PMID: 30101297 DOI: 10.1093/bioinformatics/bty666] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 03/27/2018] [Accepted: 08/06/2018] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Cui Z, Schumacher FR. Small-group originating model: Optimized individual-level GWAS simulation featured by SLiM and using open-access data. Comput Biol Chem 2024;112:108147. [PMID: 39033733 DOI: 10.1016/j.compbiolchem.2024.108147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 05/22/2024] [Accepted: 07/08/2024] [Indexed: 07/23/2024]

Abstract

The development of analytical methods for Genome-wide Association Studies (GWAS) has outpaced the evolution of simulation techniques and pipelines. This disparity underscores the importance of innovative simulation methods that can keep pace with the rapidly increasing scale of GWAS. The median sample size of GWAS over the past ten years has exceeded 50,000 individuals, a trend that emphasizes the need for simulation tools capable of generating data on a similar or larger scale. This paper introduces a novel method, the small-group originating (SGO) model, utilizing the SLiM software for simulating individual-level GWAS data. Our standardized protocol facilitates the generation of tens of thousands of pseudo-individuals with millions of variants from small (30-90) open-access datasets. SGO stands out, especially when compared to the widely-used resampling method in HapGen, showcasing superior simulation efficiency for large sample sizes (> 13,000) of unrelated individuals. This capability is particularly relevant given the current trajectory towards larger GWAS, necessitating tools that can simulate datasets reflective of this growth. Additionally, SGO provides customization options and can model dynamic life cycles and mating across generations, positioning it as a highly promising alternative for GWAS simulations. In a case study, sensitivity analyses of chromosome-level principal component analysis and kinship coefficient estimation were conducted. The results highlighted the poor robustness of chromosome-level quality control (QC) indexes and the uneven distribution of population structure across chromosomes and ancestries, advocating for the caution against relying solely on chromosome-level QC statistics. With its flexible and efficient approach to generating pseudo GWAS data, our standardized SGO protocol emerges as a crucial asset for method development, power analysis, and benchmarking in GWAS research. It is especially vital in the context of accommodating the demands for large-scale simulations, aligning with the current and future scale of GWAS.

Collapse

Scandino R, Calabrese F, Romanel A. Synggen: fast and data-driven generation of synthetic heterogeneous NGS cancer data. Bioinformatics 2022;39:6885441. [PMID: 36484701 PMCID: PMC9825741 DOI: 10.1093/bioinformatics/btac792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/02/2022] [Accepted: 12/08/2022] [Indexed: 12/13/2022] Open

Petrillo M, Fabbri M, Kagkli DM, Querci M, Van den Eede G, Alm E, Aytan-Aktug D, Capella-Gutierrez S, Carrillo C, Cestaro A, Chan KG, Coque T, Endrullat C, Gut I, Hammer P, Kay GL, Madec JY, Mather AE, McHardy AC, Naas T, Paracchini V, Peter S, Pightling A, Raffael B, Rossen J, Ruppé E, Schlaberg R, Vanneste K, Weber LM, Westh H, Angers-Loustau A. A roadmap for the generation of benchmarking resources for antimicrobial resistance detection using next generation sequencing. F1000Res 2022;10:80. [PMID: 35847383 PMCID: PMC9243550 DOI: 10.12688/f1000research.39214.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/10/2022] [Indexed: 11/20/2022] Open

Affiliation(s)

Mauro Petrillo European Commission Joint Research Centre, Ispra, Italy
Marco Fabbri European Commission Joint Research Centre, Ispra, Italy
Dafni Maria Kagkli European Commission Joint Research Centre, Ispra, Italy
Maddalena Querci European Commission Joint Research Centre, Ispra, Italy
Guy Van den Eede European Commission Joint Research Centre, Ispra, Italy European Commission Joint Research Centre, Geel, Belgium
Erik Alm The European Centre for Disease Prevention and Control, Stockholm, Sweden
Derya Aytan-Aktug National Food Institute, Technical University of Denmark, Lyngby, Denmark
Salvador Capella-Gutierrez Barcelona Supercomputing Centre (BSC), Barcelona, Spain
Catherine Carrillo Ottawa Laboratory – Carling, Canadian Food Inspection Agency, Ottawa, Ontario, Canada
Alessandro Cestaro Fondazione Edmund Mach, San Michele all'Adige (TN), Italy
Kok-Gan Chan International Genome Centre, Jiangsu University, Zhenjiang, China Division of Genetics and Molecular Biology, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
Teresa Coque Servicio de Microbiología, Hospital Universitario Ramón y Cajal, Instituto Ramón y Cajal de Investigación Sanitaria (IRYCIS), Madrid, Spain Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Carlos III Health Institute, Madrid, Spain
Christoph Endrullat MSD SHARP & DOHME GMBH, Haar, Germany
Ivo Gut Centro Nacional de Análisis Genómico, Centre for Genomic Regulation (CNAG-CRG), Barcelona Institute of Technology, Barcelona, Spain Universitat Pompeu Fabra, Barcelona, Spain
Paul Hammer BIOMES. NGS GmbH c/o Technische Hochschule Wildau, Wildau, Germany
Gemma L. Kay Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
Jean-Yves Madec Unité Antibiorésistance et Virulence Bactériennes, ANSES Site de Lyon, Lyon, France
Alison E. Mather Quadram Institute Bioscience, Norwich Research Park, Norwich, UK University of East Anglia, Norwich, UK
Alice Carolyn McHardy Helmholtz Centre for Infection Research, Braunschweig, Germany
Thierry Naas French-NRC for CPEs, Service de Bactériologie-Hygiène, Hôpital de Bicêtre, Le Kremlin-Bicêtre, France
Valentina Paracchini European Commission Joint Research Centre, Ispra, Italy
Silke Peter Institute of Medical Microbiology and Hygiene, University of Tübingen, Tübingen, Germany
Arthur Pightling Center for Food Safety and Applied Nutrition, US Food and Drug Administration, College Park, MD, USA
Barbara Raffael European Commission Joint Research Centre, Ispra, Italy
John Rossen Department of Medical Microbiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Etienne Ruppé IAME, Université de Paris, Paris, France
Robert Schlaberg Department of Pathology, University of Utah, Salt Lake City, UT, USA
Kevin Vanneste Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium
Lukas M. Weber Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland Present address: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Henrik Westh Hvidovre University Hospital, Hvidovre, Denmark
Alexandre Angers-Loustau European Commission Publications Office, Luxembourg, Luxembourg

Collapse

Barajas R, Hair B, Lai G, Rotunno M, Shams-White MM, Gillanders EM, Mechanic LE. Facilitating cancer systems epidemiology research. PLoS One 2022;16:e0255328. [PMID: 34972102 PMCID: PMC8719747 DOI: 10.1371/journal.pone.0255328] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Petrillo M, Fabbri M, Kagkli DM, Querci M, Van den Eede G, Alm E, Aytan-Aktug D, Capella-Gutierrez S, Carrillo C, Cestaro A, Chan KG, Coque T, Endrullat C, Gut I, Hammer P, Kay GL, Madec JY, Mather AE, McHardy AC, Naas T, Paracchini V, Peter S, Pightling A, Raffael B, Rossen J, Ruppé E, Schlaberg R, Vanneste K, Weber LM, Westh H, Angers-Loustau A. A roadmap for the generation of benchmarking resources for antimicrobial resistance detection using next generation sequencing. F1000Res 2021;10:80. [PMID: 35847383 PMCID: PMC9243550 DOI: 10.12688/f1000research.39214.1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/10/2022] [Indexed: 10/31/2024] Open

Affiliation(s)

Mauro Petrillo European Commission Joint Research Centre, Ispra, Italy
Marco Fabbri European Commission Joint Research Centre, Ispra, Italy
Dafni Maria Kagkli European Commission Joint Research Centre, Ispra, Italy
Maddalena Querci European Commission Joint Research Centre, Ispra, Italy
Guy Van den Eede European Commission Joint Research Centre, Ispra, Italy European Commission Joint Research Centre, Geel, Belgium
Erik Alm The European Centre for Disease Prevention and Control, Stockholm, Sweden
Derya Aytan-Aktug National Food Institute, Technical University of Denmark, Lyngby, Denmark
Salvador Capella-Gutierrez Barcelona Supercomputing Centre (BSC), Barcelona, Spain
Catherine Carrillo Ottawa Laboratory – Carling, Canadian Food Inspection Agency, Ottawa, Ontario, Canada
Alessandro Cestaro Fondazione Edmund Mach, San Michele all'Adige (TN), Italy
Kok-Gan Chan International Genome Centre, Jiangsu University, Zhenjiang, China Division of Genetics and Molecular Biology, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
Teresa Coque Servicio de Microbiología, Hospital Universitario Ramón y Cajal, Instituto Ramón y Cajal de Investigación Sanitaria (IRYCIS), Madrid, Spain Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Carlos III Health Institute, Madrid, Spain
Christoph Endrullat MSD SHARP & DOHME GMBH, Haar, Germany
Ivo Gut Centro Nacional de Análisis Genómico, Centre for Genomic Regulation (CNAG-CRG), Barcelona Institute of Technology, Barcelona, Spain Universitat Pompeu Fabra, Barcelona, Spain
Paul Hammer BIOMES. NGS GmbH c/o Technische Hochschule Wildau, Wildau, Germany
Gemma L. Kay Quadram Institute Bioscience, Norwich Research Park, Norwich, UK
Jean-Yves Madec Unité Antibiorésistance et Virulence Bactériennes, ANSES Site de Lyon, Lyon, France
Alison E. Mather Quadram Institute Bioscience, Norwich Research Park, Norwich, UK University of East Anglia, Norwich, UK
Alice Carolyn McHardy Helmholtz Centre for Infection Research, Braunschweig, Germany
Thierry Naas French-NRC for CPEs, Service de Bactériologie-Hygiène, Hôpital de Bicêtre, Le Kremlin-Bicêtre, France
Valentina Paracchini European Commission Joint Research Centre, Ispra, Italy
Silke Peter Institute of Medical Microbiology and Hygiene, University of Tübingen, Tübingen, Germany
Arthur Pightling Center for Food Safety and Applied Nutrition, US Food and Drug Administration, College Park, MD, USA
Barbara Raffael European Commission Joint Research Centre, Ispra, Italy
John Rossen Department of Medical Microbiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Etienne Ruppé IAME, Université de Paris, Paris, France
Robert Schlaberg Department of Pathology, University of Utah, Salt Lake City, UT, USA
Kevin Vanneste Transversal activities in Applied Genomics, Sciensano, Brussels, Belgium
Lukas M. Weber Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland Present address: Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Henrik Westh Hvidovre University Hospital, Hvidovre, Denmark
Alexandre Angers-Loustau European Commission Publications Office, Luxembourg, Luxembourg

Collapse

Blumenthal DB, Viola L, List M, Baumbach J, Tieri P, Kacprowski T. EpiGEN: an epistasis simulation pipeline. Bioinformatics 2020;36:4957-4959. [DOI: 10.1093/bioinformatics/btaa245] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Revised: 04/03/2020] [Accepted: 04/08/2020] [Indexed: 02/06/2023] Open

Juan L, Wang Y, Jiang J, Yang Q, Jiang Q, Wang Y. PGsim: A Comprehensive and Highly Customizable Personal Genome Simulator. Front Bioeng Biotechnol 2020;8:28. [PMID: 32047747 PMCID: PMC6997238 DOI: 10.3389/fbioe.2020.00028] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 01/13/2020] [Indexed: 11/26/2022] Open

Abstract

Although genome sequencing has become increasingly popular, the simulation of individual genomes is still important. This is because sequencing a large number of individual genomes is costly and genome data with extreme and boundary conditions, such as fatal genetic defects, are difficult to obtain. Privacy and legal barriers also prevent many applications of real data. Large sequencing projects in recent years have provided a deeper understanding of the human genome. However, there is a lack of tools to leverage known data to simulate personal genomes as real as possible. Here, we designed and developed PGsim, a comprehensive and highly customizable individual genome simulator, that fully uses existing knowledge, such as variant allele frequencies in global or world main populations, mutation probability differences between protein-coding regions and non-coding regions, transition/transversion (Ti/Tv) ratios, Indel incidence, Indel length distribution, structural variation sites, and pathogenic mutation sites. Users can flexibly control the proportion and quantity of known variants, common variants, novel variants in both coding and non-coding regions, and special variants through detailed parameter settings. To ensure that the simulated personal genome has sufficient randomness, PGsim makes the generated variants more real and reliable in terms of variant distribution, proportion, and population characteristics. PGsim is able to employ a huge volume database as background data to simulate personal genomes and does not require SQL database support. Users can easily change the variant databases used as needed. As a Perl script, there is no obstacle to running PGsim on any version of the MAC OS or Linux systems, and no libraries, packages, interpreters, compilers, or other dependencies need to be installed in advance. The PGsim tool is publicly available at https://github.com/lrjuan/PGsim.

Collapse