1
|
Fuente CMDL, Palomo-Díez S. Do forensic genetic markers disclose more information about us than they should? (A review). Int J Legal Med 2025; 139:935-943. [PMID: 39775034 DOI: 10.1007/s00414-024-03395-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 12/06/2024] [Indexed: 01/11/2025]
Abstract
The 20 established STRs that make up the CoDIS package must comply with national and international privacy rights and legal policies. Current research reveals that it is possible that certain genetic markers, used in forensic contexts, may show information about other neighboring markers that could reflect certain private characteristics of individuals. Therefore, we will aim to find out, through a literature review, whether there may indeed be associations between some of the STRs alleles established by CoDIS and medical and phenotypic conditions, with the aim of checking whether this problem has a real basis. To carry out this review, a systematic search has been carried out in different databases, as well as a critical evaluation of the articles collected and a synthesis of all the relevant studies. The results generally show that private phenotypic data of the individual can be known from certain STRs established by CoDIS, due to their association with neighboring genes. However, most of the papers state that practically none of the established associations offer proof of causality, but rather a relationship between some STRs and a phenotype.
Collapse
Affiliation(s)
| | - Sara Palomo-Díez
- Health Legislation, Psychiatry and Pathology Department, Medicine Faculty, The Complutense University of Madrid, Madrid, Spain.
- Forensic Genetics and Human Population Genetics Research Group, Complutense University of Madrid, Madrid, Spain.
| |
Collapse
|
2
|
Tillmar A, Kling D. SNP Genotype Imputation in Forensics-A Performance Study. Genes (Basel) 2024; 15:1386. [PMID: 39596586 PMCID: PMC11593911 DOI: 10.3390/genes15111386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 10/21/2024] [Accepted: 10/24/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND/OBJECTIVES Emerging forensic genetic applications, such as forensic investigative genetic genealogy (FIGG), advanced DNA phenotyping, and distant kinship inference, increasingly require dense SNP genotype datasets. However, forensic-grade DNA often contains missing genotypes due to its quality and quantity limitations, potentially hindering these applications. Genotype imputation, a method that predicts missing genotypes, is widely used in population and medical genetics, but its utility in forensic genetics has not been thoroughly explored. This study aims to assess the performance of genotype imputation in forensic contexts and determine the conditions under which it can be effectively applied. METHODS We employed a simulation-based approach to generate realistic forensic SNP genotype datasets with varying numbers, densities, and qualities of observed genotypes. Genotype imputation was performed using Beagle software, and the performance was evaluated based on the call rate and imputation accuracy across different datasets and imputation settings. RESULTS The results demonstrate that genotype imputation can significantly increase the number of SNP genotypes. However, imputation accuracy was dependent on factors such as the quality of the original genotype data and the characteristics of the reference population. Higher SNP density and fewer genotype errors generally resulted in improved imputation accuracy. CONCLUSIONS This study highlights the potential of genotype imputation to enhance forensic SNP datasets but underscores the importance of optimizing imputation parameters and understanding the limitations of the original data. These findings will inform the future application of imputation in forensic genetics, supporting its integration into forensic workflows.
Collapse
Affiliation(s)
- Andreas Tillmar
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, SE-58758 Linköping, Sweden;
- Department of Biomedical and Clinical Sciences, Faculty of Health Sciences, Linköping University, SE-58183 Linköping, Sweden
| | - Daniel Kling
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, SE-58758 Linköping, Sweden;
- Department of Forensic Sciences, Oslo University Hospital, NO-0424 Oslo, Norway
| |
Collapse
|
3
|
Song M, Zhou Y, Zhao C, Song F, Hou Y. YHP: Y-chromosome Haplogroup Predictor for predicting male lineages based on Y-STRs. Forensic Sci Int 2024; 361:112113. [PMID: 38936202 DOI: 10.1016/j.forsciint.2024.112113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 05/24/2024] [Accepted: 06/16/2024] [Indexed: 06/29/2024]
Abstract
Human Y chromosome reflects the evolutionary process of males. Male lineage tracing by Y chromosome is of great use in evolutionary, forensic, and anthropological studies. Identifying the male lineage based on the specific distribution of Y haplogroups narrows down the investigation scope, which has been used in forensic scenarios. However, existing software aids in familial searching using Y-STRs (Y-chromosome short tandem repeats) to predict Y-SNP (Y-chromosome single nucleotide polymorphism) haplogroups, they often lack resolution. In this study, we developed YHP (Y Haplogroup Predictor), a novel software offering high-resolution haplogroup inference without requiring extensive Y-SNP sequencing. Leveraging existing datasets (219 haplogroups, 4064 samples in total), YHP predicts haplogroups with 0.923 accuracy under the highest haplogroup resolution, employing a random forest algorithm. YHP, available on Github (https://github.com/cissy123/YHP-Y-Haplogroup-Predictor-), facilitates high-resolution haplogroup prediction, haplotype mismatch analysis, and haplotype similarity comparison. Notably, it demonstrates efficacy in East Asian populations, benefiting from training data from eight distinct East Asian ethnic populations. Moreover, it enables seamless integration of additional training sets, extending its utility to diverse populations.
Collapse
Affiliation(s)
- Mengyuan Song
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, China
| | - Yuxiang Zhou
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Chenxi Zhao
- College of Computer Science, Sichuan University, Chengdu, China
| | - Feng Song
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.
| | - Yiping Hou
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
4
|
Thomas M, Mackes N, Preuss-Dodhy A, Wieland T, Bundschus M. Assessing Privacy Vulnerabilities in Genetic Data Sets: Scoping Review. JMIR BIOINFORMATICS AND BIOTECHNOLOGY 2024; 5:e54332. [PMID: 38935957 PMCID: PMC11165293 DOI: 10.2196/54332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 03/26/2024] [Accepted: 03/29/2024] [Indexed: 06/29/2024]
Abstract
BACKGROUND Genetic data are widely considered inherently identifiable. However, genetic data sets come in many shapes and sizes, and the feasibility of privacy attacks depends on their specific content. Assessing the reidentification risk of genetic data is complex, yet there is a lack of guidelines or recommendations that support data processors in performing such an evaluation. OBJECTIVE This study aims to gain a comprehensive understanding of the privacy vulnerabilities of genetic data and create a summary that can guide data processors in assessing the privacy risk of genetic data sets. METHODS We conducted a 2-step search, in which we first identified 21 reviews published between 2017 and 2023 on the topic of genomic privacy and then analyzed all references cited in the reviews (n=1645) to identify 42 unique original research studies that demonstrate a privacy attack on genetic data. We then evaluated the type and components of genetic data exploited for these attacks as well as the effort and resources needed for their implementation and their probability of success. RESULTS From our literature review, we derived 9 nonmutually exclusive features of genetic data that are both inherent to any genetic data set and informative about privacy risk: biological modality, experimental assay, data format or level of processing, germline versus somatic variation content, content of single nucleotide polymorphisms, short tandem repeats, aggregated sample measures, structural variants, and rare single nucleotide variants. CONCLUSIONS On the basis of our literature review, the evaluation of these 9 features covers the great majority of privacy-critical aspects of genetic data and thus provides a foundation and guidance for assessing genetic data risk.
Collapse
|
5
|
Lappo E, Rosenberg NA. Solving the Arizona search problem by imputation. iScience 2024; 27:108831. [PMID: 38323008 PMCID: PMC10845060 DOI: 10.1016/j.isci.2024.108831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 11/03/2023] [Accepted: 01/03/2024] [Indexed: 02/08/2024] Open
Abstract
An "Arizona search" is an evaluation of the numbers of pairs of profiles in a forensic-genetic database that possess partial or complete genotypic matches; such a search assists in establishing the extent to which a set of loci provides unique identifications. In forensic genetics, however, the potential for performing Arizona searches is constrained by the limited availability of actual forensic profiles for research purposes. Here, we use genotype imputation to circumvent this problem. From a database of genomes, we impute genotypes of forensic short-tandem-repeat (STR) loci from neighboring single-nucleotide polymorphisms (SNPs), searching for partial STR matches using the imputed profiles. We compare the distributions of the numbers of partial matches in imputed and actual profiles, finding close agreement. Despite limited potential for performing Arizona searches with actual forensic STR profiles, the questions that such searches seek to answer can be posed with imputation-based Arizona searches in increasingly large SNP databases.
Collapse
Affiliation(s)
- Egor Lappo
- Department of Biology, Stanford University, Stanford, CA, USA
| | | |
Collapse
|
6
|
Kim J, Rosenberg NA. Record-matching of STR profiles with fragmentary genomic SNP data. Eur J Hum Genet 2023; 31:1283-1290. [PMID: 37567955 PMCID: PMC10620386 DOI: 10.1038/s41431-023-01430-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 05/30/2023] [Accepted: 07/03/2023] [Indexed: 08/13/2023] Open
Abstract
In many forensic settings, identity of a DNA sample is sought from poor-quality DNA, for which the typical STR loci tabulated in forensic databases are not possible to reliably genotype. Genome-wide SNPs, however, can potentially be genotyped from such samples via next-generation sequencing, so that queries can in principle compare SNP genotypes from DNA samples of interest to STR genotype profiles that represent proposed matches. We use genetic record-matching to evaluate the possibility of testing SNP profiles obtained from poor-quality DNA samples to identify exact and relatedness matches to STR profiles. Using simulations based on whole-genome sequences, we show that in some settings, similar match accuracies to those seen with full coverage of the genome are obtained by genetic record-matching for SNP data that represent 5-10% genomic coverage. Thus, if even a fraction of random genomic SNPs can be genotyped by next-generation sequencing, then the potential may exist to test the resulting genotype profiles for matches to profiles consisting exclusively of nonoverlapping STR loci. The result has implications in relation to criminal justice, mass disasters, missing-person cases, studies of ancient DNA, and genomic privacy.
Collapse
Affiliation(s)
- Jaehee Kim
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Noah A Rosenberg
- Department of Biology, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
7
|
Link V, Zavaleta YJA, Reyes RJ, Ding L, Wang J, Rohlfs RV, Edge MD. Microsatellites used in forensics are in regions enriched for trait-associated variants. iScience 2023; 26:107992. [PMID: 37841589 PMCID: PMC10570123 DOI: 10.1016/j.isci.2023.107992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 08/10/2023] [Accepted: 09/18/2023] [Indexed: 10/17/2023] Open
Abstract
The 20 short tandem repeat (STR) loci of the combined DNA index system (CODIS) are the basis of the vast majority of forensic genetics in the United States. One argument for permissive rules about the collection of CODIS genotypes is that the CODIS loci are thought to contain little information about ancestry or traits. However, in the past 20 years, a growing field has identified hundreds of thousands of genotype-trait associations. Here, we conduct a survey of the landscape of such associations surrounding the CODIS loci as compared with non-CODIS STRs. Although this study cannot establish or quantify associations between CODIS genotypes and phenotypes, we find that the regions around the CODIS loci are enriched for both known pathogenic variants (> 90th percentile) and for trait-associated SNPs identified in genome-wide association studies (GWAS) (≥ 95th percentile in 10kb and 100kb flanking regions), compared with other random sets of autosomal tetranucleotide-repeat STRs.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | - Rochelle-Jan Reyes
- Department of Biology, San Francisco State University, San Francisco, CA, USA
| | - Linda Ding
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Judy Wang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Rori V. Rohlfs
- Department of Biology, San Francisco State University, San Francisco, CA, USA
- Department of Data Science and Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
8
|
Link V, Zavaleta YJA, Reyes RJ, Ding L, Wang J, Rohlfs RV, Edge MD. Microsatellites used in forensics are located in regions unusually rich in trait-associated variants. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.07.531629. [PMID: 36945578 PMCID: PMC10028909 DOI: 10.1101/2023.03.07.531629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
Abstract
The 20 short tandem repeat (STR) markers of the combined DNA index system (CODIS) are the basis of the vast majority of forensic genetics in the United States. One argument for permissive rules about the collection of CODIS genotypes is that the CODIS markers are thought to contain information relevant to identification only (such as a human fingerprint would), with little information about ancestry or traits. However, in the past 20 years, a quickly growing field has identified hundreds of thousands of genotype-trait associations. Here we conduct a survey of the landscape of such associations surrounding the CODIS loci as compared with non-CODIS STRs. We find that the regions around the CODIS markers are enriched for both known pathogenic variants (>90th percentile) and for SNPs identified as trait-associated in genome-wide association studies (GWAS) (≥95th percentile in 10kb and 100kb flanking regions), compared with other random sets of autosomal tetranucleotide-repeat STRs. Although it is not obvious how much phenotypic information CODIS would need to convey to strain the "DNA fingerprint" analogy, the CODIS markers, considered as a set, are in regions unusually dense with variants with known phenotypic associations.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California
| | | | | | - Linda Ding
- Department of Quantitative and Computational Biology, University of Southern California
| | - Judy Wang
- Department of Quantitative and Computational Biology, University of Southern California
| | - Rori V. Rohlfs
- Department of Biology, San Francisco State University
- Department of Computer Science and Institute of Ecology and Evolution, University of Oregon
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California
| |
Collapse
|
9
|
Evaluation of a SNP-STR haplotype panel for forensic genotype imputation. Forensic Sci Int Genet 2023; 62:102801. [PMID: 36272212 DOI: 10.1016/j.fsigen.2022.102801] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 10/08/2022] [Accepted: 10/14/2022] [Indexed: 11/07/2022]
Abstract
Short tandem repeat polymorphism (STR)-based individual identification is a popular and reliable method in many forensic applications. However, STRs still frequently fail to find any matched records. In such cases, if known STRs could provide more information, it would be very helpful to solve specific problems. Genotype imputation has long been used in the study of single nucleotide polymorphisms (SNPs) and has recently been introduced into forensic fields. The idea is that, through a reference haplotype panel containing SNPs and STRs, we can obtain unknown genetic information through genotype imputation based on known STR or SNP genotypes. Several recent studies have already demonstrated this exciting idea, and a 1000 Genomes SNP-STR haplotype panel has also been released. To further study the performance of genotype imputation in forensic fields, we collected STR, microhaplotype (MH) and SNP array genotypes from Chinese Han population individuals and then performed genotype imputation analysis based on the released reference panel. As a result, the average locus imputation accuracy was ∼83 % (or ∼70 %) when SNPs in the SNP array (or MH SNPs) were imputed from STRs, and was ∼30 % when highly polymorphic markers (STRs and MHs) were imputed from each other. When STRs were imputed from SNP array, the average locus imputation accuracy increased to ∼48 %. After analyzing the match scores between real STRs and the STRs imputed from SNPs, ∼80 % of studied STR records can be connected to corresponding SNP records, which may help for individual identification. Our results indicate that genotype imputation has great potential for forensic applications.
Collapse
|
10
|
Ogbunugafor CB, Edge MD. Gattaca as a lens on contemporary genetics: marking 25 years into the film's "not-too-distant" future. Genetics 2022; 222:iyac142. [PMID: 36218390 PMCID: PMC9713434 DOI: 10.1093/genetics/iyac142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 09/05/2022] [Indexed: 11/13/2022] Open
Abstract
The 1997 film Gattaca has emerged as a canonical pop culture reference used to discuss modern controversies in genetics and bioethics. It appeared in theaters a few years prior to the announcement of the "completion" of the human genome (2000), as the science of human genetics was developing a renewed sense of its social implications. The story is set in a near-future world in which parents can, with technological assistance, influence the genetic composition of their offspring on the basis of predicted life outcomes. The current moment-25 years after the film's release-offers an opportunity to reflect on where society currently stands with respect to the ideas explored in Gattaca. Here, we review and discuss several active areas of genetic research-genetic prediction, embryo selection, forensic genetics, and others-that interface directly with scenes and concepts in the film. On its silver anniversary, we argue that Gattaca remains an important reflection of society's expectations and fears with respect to the ways that genetic science has manifested in the real world. In accompanying supplemental material, we offer some thought questions to guide group discussions inside and outside of the classroom.
Collapse
Affiliation(s)
- C Brandon Ogbunugafor
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
- Santa Fe Institute, Santa Fe, NM 87501, USA
- Vermont Complex Systems Center, Burlington, VT 05401, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
11
|
Bañuelos MM, Zavaleta YJA, Roldan A, Reyes RJ, Guardado M, Chavez Rojas B, Nyein T, Rodriguez Vega A, Santos M, Huerta-Sanchez E, Rohlfs RV. Associations between forensic loci and expression levels of neighboring genes may compromise medical privacy. Proc Natl Acad Sci U S A 2022; 119:e2121024119. [PMID: 36166477 PMCID: PMC9546536 DOI: 10.1073/pnas.2121024119] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 08/29/2022] [Indexed: 11/18/2022] Open
Abstract
A set of 20 short tandem repeats (STRs) is used by the US criminal justice system to identify suspects and to maintain a database of genetic profiles for individuals who have been previously convicted or arrested. Some of these STRs were identified in the 1990s, with a preference for markers in putative gene deserts to avoid forensic profiles revealing protected medical information. We revisit that assumption, investigating whether forensic genetic profiles reveal information about gene-expression variation or potential medical information. We find six significant correlations (false discovery rate = 0.23) between the forensic STRs and the expression levels of neighboring genes in lymphoblastoid cell lines. We explore possible mechanisms for these associations, showing evidence compatible with forensic STRs causing expression variation or being in linkage disequilibrium with a causal locus in three cases and weaker or potentially spurious associations in the other three cases. Together, these results suggest that forensic genetic loci may reveal expression levels and, perhaps, medical information.
Collapse
Affiliation(s)
- Mayra M. Bañuelos
- Department of Mathematics, San Francisco State University, San Francisco, CA 94132
- Ecology, Evolution and Organismal Biology, Brown University, Providence, RI 02912
- Center for Computational and Molecular Biology, Brown University, Providence, RI 02912
| | | | - Alennie Roldan
- Department of Biology, San Francisco State University, San Francisco, CA 94132
| | - Rochelle-Jan Reyes
- Department of Biology, San Francisco State University, San Francisco, CA 94132
| | - Miguel Guardado
- Department of Mathematics, San Francisco State University, San Francisco, CA 94132
| | | | - Thet Nyein
- Department of Mathematics, San Francisco State University, San Francisco, CA 94132
| | - Ana Rodriguez Vega
- Department of Biology, San Francisco State University, San Francisco, CA 94132
| | - Maribel Santos
- Department of Biology, San Francisco State University, San Francisco, CA 94132
| | - Emilia Huerta-Sanchez
- Ecology, Evolution and Organismal Biology, Brown University, Providence, RI 02912
- Center for Computational and Molecular Biology, Brown University, Providence, RI 02912
| | - Rori V. Rohlfs
- Department of Biology, San Francisco State University, San Francisco, CA 94132
| |
Collapse
|
12
|
Abstract
Genomics data are important for advancing biomedical research, improving clinical care, and informing other disciplines such as forensics and genealogy. However, privacy concerns arise when genomic data are shared. In particular, the identifying nature of genetic information, its direct relationship to health status, and the potential financial harm and stigmatization posed to individuals and their blood relatives call for a survey of the privacy issues related to sharing genetic and related data and potential solutions to overcome these issues. In this work, we provide an overview of the importance of genomic privacy, the information gleaned from genomics data, the sources of potential private information leakages in genomics, and ways to preserve privacy while utilizing the genetic information in research. We discuss the relationship between trust in the scientific community and protecting privacy, illuminating a future roadmap for data sharing and study participation.
Collapse
Affiliation(s)
- Gamze Gürsoy
- Department of Biomedical Informatics, Columbia University, New York, NY, USA; .,New York Genome Center, New York, NY, USA
| |
Collapse
|
13
|
Wan Z, Hazel JW, Clayton EW, Vorobeychik Y, Kantarcioglu M, Malin BA. Sociotechnical safeguards for genomic data privacy. Nat Rev Genet 2022; 23:429-445. [PMID: 35246669 PMCID: PMC8896074 DOI: 10.1038/s41576-022-00455-y] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/24/2022] [Indexed: 12/21/2022]
Abstract
Recent developments in a variety of sectors, including health care, research and the direct-to-consumer industry, have led to a dramatic increase in the amount of genomic data that are collected, used and shared. This state of affairs raises new and challenging concerns for personal privacy, both legally and technically. This Review appraises existing and emerging threats to genomic data privacy and discusses how well current legal frameworks and technical safeguards mitigate these concerns. It concludes with a discussion of remaining and emerging challenges and illustrates possible solutions that can balance protecting privacy and realizing the benefits that result from the sharing of genetic information.
Collapse
Affiliation(s)
- Zhiyu Wan
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - James W Hazel
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN, USA
| | - Ellen Wright Clayton
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN, USA
- Vanderbilt University Law School, Nashville, TN, USA
| | - Yevgeniy Vorobeychik
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, MO, USA
| | - Murat Kantarcioglu
- Department of Computer Science, University of Texas at Dallas, Richardson, TX, USA
| | - Bradley A Malin
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA.
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA.
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
14
|
Could routine forensic STR genotyping data leak personal phenotypic information? Forensic Sci Int 2022; 335:111311. [PMID: 35468577 DOI: 10.1016/j.forsciint.2022.111311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 03/19/2022] [Accepted: 04/13/2022] [Indexed: 11/22/2022]
Abstract
The application of forensic genetic markers must comply with privacy rights and legal policies on a premise that the markers do not expose phenotypic information. The most widely-used short tandem repeats (STRs) are generally viewed as 'junk' DNA because most STRs are located in non-coding regions and therefore refrain from leaking phenotypic traits. But with a deepening understanding of phenotypes and underlying genetic structure, whether STRs could potentially reflect any phenotypic information may need re-examining. Therefore, we performed the following analyses. First, we analyzed the association between 15 STRs and three facial characteristics (single or double eyelid, with or without epicanthus, unattached or attached earlobe) on 721 unrelated Han Chinese individuals. Then, we collected 27199 individuals' STRs and geographic data from the literature to investigate the association between STRs and bio-geographic information, and predict geographic information by STRs on additional 1993 unrelated individuals. We found that there was scarcely any association between STRs with studied facial characteristics. Although allele19 in D2S1338 and allele 18 in FGA (P = 0.0032, P = 0.0030, respectively after Bonferroni correction) showed statistical significance, the prediction effectiveness was very low. For the STRs and bio-geographic information, the principal component analysis showed the first three components could explain 87.7% of the variance, but the prediction accuracy only reached 25.2%. We demonstrated that the forensic phenotypes are usually complex traits, it is hardly possible to uncover phenotypic information by testing only dozens of STR loci.
Collapse
|
15
|
Wan Z, Vorobeychik Y, Xia W, Liu Y, Wooders M, Guo J, Yin Z, Clayton EW, Kantarcioglu M, Malin BA. Using game theory to thwart multistage privacy intrusions when sharing data. SCIENCE ADVANCES 2021; 7:eabe9986. [PMID: 34890225 PMCID: PMC8664254 DOI: 10.1126/sciadv.abe9986] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 10/25/2021] [Indexed: 06/13/2023]
Abstract
Person-specific biomedical data are now widely collected, but its sharing raises privacy concerns, specifically about the re-identification of seemingly anonymous records. Formal re-identification risk assessment frameworks can inform decisions about whether and how to share data; current techniques, however, focus on scenarios where the data recipients use only one resource for re-identification purposes. This is a concern because recent attacks show that adversaries can access multiple resources, combining them in a stage-wise manner, to enhance the chance of an attack’s success. In this work, we represent a re-identification game using a two-player Stackelberg game of perfect information, which can be applied to assess risk, and suggest an optimal data sharing strategy based on a privacy-utility tradeoff. We report on experiments with large-scale genomic datasets to show that, using game theoretic models accounting for adversarial capabilities to launch multistage attacks, most data can be effectively shared with low re-identification risk.
Collapse
Affiliation(s)
- Zhiyu Wan
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37212, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Yevgeniy Vorobeychik
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Weiyi Xia
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Yongtai Liu
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37212, USA
| | - Myrna Wooders
- Department of Economics, Vanderbilt University, Nashville, TN 37235, USA
| | - Jia Guo
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37212, USA
| | - Zhijun Yin
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37212, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Ellen Wright Clayton
- Center for Biomedical Ethics and Society, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- School of Law, Vanderbilt University, Nashville, TN 37203, USA
- Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Murat Kantarcioglu
- Department of Computer Science, University of Texas at Dallas, Richardson, TX 75080, USA
- Institute for Quantitative Social Science, Harvard University, Cambridge, MA 02138, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Bradley A. Malin
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37212, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| |
Collapse
|
16
|
Gürsoy G, Emani P, Brannon CM, Jolanki OA, Harmanci A, Strattan JS, Cherry JM, Miranker AD, Gerstein M. Data Sanitization to Reduce Private Information Leakage from Functional Genomics. Cell 2021; 183:905-917.e16. [PMID: 33186529 DOI: 10.1016/j.cell.2020.09.036] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 07/23/2020] [Accepted: 09/11/2020] [Indexed: 12/30/2022]
Abstract
The generation of functional genomics datasets is surging, because they provide insight into gene regulation and organismal phenotypes (e.g., genes upregulated in cancer). The intent behind functional genomics experiments is not necessarily to study genetic variants, yet they pose privacy concerns due to their use of next-generation sequencing. Moreover, there is a great incentive to broadly share raw reads for better statistical power and general research reproducibility. Thus, we need new modes of sharing beyond traditional controlled-access models. Here, we develop a data-sanitization procedure allowing raw functional genomics reads to be shared while minimizing privacy leakage, enabling principled privacy-utility trade-offs. Our protocol works with traditional Illumina-based assays and newer technologies such as 10x single-cell RNA sequencing. It involves quantifying the privacy leakage in reads by statistically linking study participants to known individuals. We carried out these linkages using data from highly accurate reference genomes and more realistic environmental samples.
Collapse
Affiliation(s)
- Gamze Gürsoy
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Prashant Emani
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Charlotte M Brannon
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Otto A Jolanki
- Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA
| | - Arif Harmanci
- School of Biomedical Informatics, Center for Precision Health, University of Texas Health Sciences Center, Houston, TX 77030, USA
| | - J Seth Strattan
- Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA
| | - J Michael Cherry
- Stanford University School of Medicine, Department of Genetics, Stanford, CA 94305, USA
| | - Andrew D Miranker
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Chemical and Environmental Engineering, Yale University, New Haven, CT 06520, USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Computer Science, Yale University, New Haven, CT 06520, USA; Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|
17
|
Uhlig HH, Charbit-Henrion F, Kotlarz D, Shouval DS, Schwerd T, Strisciuglio C, de Ridder L, van Limbergen J, Macchi M, Snapper SB, Ruemmele FM, Wilson DC, Travis SP, Griffiths AM, Turner D, Klein C, Muise AM, Russell RK. Clinical Genomics for the Diagnosis of Monogenic Forms of Inflammatory Bowel Disease: A Position Paper From the Paediatric IBD Porto Group of European Society of Paediatric Gastroenterology, Hepatology and Nutrition. J Pediatr Gastroenterol Nutr 2021; 72:456-473. [PMID: 33346580 PMCID: PMC8221730 DOI: 10.1097/mpg.0000000000003017] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
BACKGROUND It is important to identify patients with monogenic IBD as management may differ from classical IBD. In this position statement we formulate recommendations for the use of genomics in evaluating potential monogenic causes of IBD across age groups. METHODS The consensus included paediatric IBD specialists from the Paediatric IBD Porto group of the European Society of Paediatric Gastroenterology, Hepatology and Nutrition (ESPGHAN) and specialists from several monogenic IBD research consortia. We defined key topics and performed a systematic literature review to cover indications, technologies (targeted panel, exome and genome sequencing), gene panel setup, cost-effectiveness of genetic screening, and requirements for the clinical care setting. We developed recommendations that were voted upon by all authors and Porto group members (32 voting specialists). RESULTS We recommend next-generation DNA-sequencing technologies to diagnose monogenic causes of IBD in routine clinical practice embedded in a setting of multidisciplinary patient care. Routine genetic screening is not recommended for all IBD patients. Genetic testing should be considered depending on age of IBD-onset (infantile IBD, very early-onset IBD, paediatric or young adult IBD), and further criteria, such as family history, relevant comorbidities, and extraintestinal manifestations. Genetic testing is also recommended in advance of hematopoietic stem cell transplantation. We developed a diagnostic algorithm that includes a gene panel of 75 monogenic IBD genes. Considerations are provided also for low resource countries. CONCLUSIONS Genomic technologies should be considered an integral part of patient care to investigate patients at risk for monogenic forms of IBD.
Collapse
Affiliation(s)
- Holm H. Uhlig
- Translational Gastroenterology Unit, University of Oxford, Oxford, United Kingdom
- Department of Pediatrics, University of Oxford, Oxford, United Kingdom
- Biomedical Research Center, University of Oxford, Oxford, United Kingdom
| | - Fabienne Charbit-Henrion
- Université de Paris, INSERM UMR 1163 Immunité Intestinale, APHP, Hôpital Necker Enfants Malades, Service de Génétique moléculaire, Paris, France
| | - Daniel Kotlarz
- Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital, LMU Munich, Munich, Germany
| | - Dror S. Shouval
- Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital, LMU Munich, Munich, Germany
| | - Tobias Schwerd
- Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital, LMU Munich, Munich, Germany
| | | | - Lissy de Ridder
- Department of Paediatric Gastroenterology, Erasmus University Medical Center Sophia Children’s Hospital, Rotterdam, the Netherlands
| | - Johan van Limbergen
- Amsterdam University Medical Centres, Emma Children’s Hospital, The Netherlands and Tytgat Institute for Liver and Intestinal Research, Amsterdam Gastroenterology and Metabolism, Academic Medical Centre, University of Amsterdam, Amsterdam, The Netherlands
| | - Marina Macchi
- Translational Gastroenterology Unit, University of Oxford, Oxford, United Kingdom
| | - Scott B. Snapper
- Division of Gastroenterology, Hepatology, and Nutrition, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Frank M. Ruemmele
- Université de Paris, APHP, Hôpital Necker Enfants Malades, Service de Gastroentérologie pédiatrique, Paris, France
| | - David C. Wilson
- Child Life and Health, University of Edinburgh, Department of Paediatric Gastroenterology, The Royal Hospital for Sick Children, Edinburgh
| | - Simon P.L. Travis
- Translational Gastroenterology Unit, University of Oxford, Oxford, United Kingdom
- Biomedical Research Center, University of Oxford, Oxford, United Kingdom
| | - Anne M. Griffiths
- The Hospital for Sick Children, University of Toronto
- SickKids Inflammatory Bowel Disease Centre and Cell Biology Program, Research Institute, The Hospital for Sick Children
- Division of Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, Toronto, Ontario, Canada
| | - Dan Turner
- Shaare Zedek Medical Center, The Hebrew University of Jerusalem, Israel
| | - Christoph Klein
- Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital, LMU Munich, Munich, Germany
| | - Aleixo M. Muise
- The Hospital for Sick Children, University of Toronto
- SickKids Inflammatory Bowel Disease Centre and Cell Biology Program, Research Institute, The Hospital for Sick Children
- Division of Gastroenterology, Hepatology, and Nutrition, Department of Pediatrics, Toronto, Ontario, Canada
| | - Richard K. Russell
- Child Life and Health, University of Edinburgh, Department of Paediatric Gastroenterology, The Royal Hospital for Sick Children, Edinburgh
| |
Collapse
|
18
|
Kling D, Phillips C, Kennett D, Tillmar A. Investigative genetic genealogy: Current methods, knowledge and practice. Forensic Sci Int Genet 2021; 52:102474. [PMID: 33592389 DOI: 10.1016/j.fsigen.2021.102474] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 01/12/2021] [Accepted: 01/27/2021] [Indexed: 12/15/2022]
Abstract
Investigative genetic genealogy (IGG) has emerged as a new, rapidly growing field of forensic science. We describe the process whereby dense SNP data, commonly comprising more than half a million markers, are employed to infer distant relationships. By distant we refer to degrees of relatedness exceeding that of first cousins. We review how methods of relationship matching and SNP analysis on an enlarged scale are used in a forensic setting to identify a suspect in a criminal investigation or a missing person. There is currently a strong need in forensic genetics not only to understand the underlying models to infer relatedness but also to fully explore the DNA technologies and data used in IGG. This review brings together many of the topics and examines their effectiveness and operational limits, while suggesting future directions for their forensic validation. We further investigated the methods used by the major direct-to-consumer (DTC) genetic ancestry testing companies as well as submitting a questionnaire where providers of forensic genetic genealogy summarized their operation/services. Although most of the DTC market, and genetic genealogy in general, has undisclosed, proprietary algorithms we review the current knowledge where information has been discussed and published more openly.
Collapse
Affiliation(s)
- Daniel Kling
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
| | - Christopher Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Santiago de Compostela, Spain.
| | - Debbie Kennett
- Research Department of Genetics, Evolution and Environment, University College London, Gower Street, London WC1E 6BT, United Kingdom
| | - Andreas Tillmar
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Biomedical and Clinical Sciences, Faculty of Medicine and Health Sciences, Linköping University, Linköping, Sweden
| |
Collapse
|
19
|
Katsanis SH. Pedigrees and Perpetrators: Uses of DNA and Genealogy in Forensic Investigations. Annu Rev Genomics Hum Genet 2020; 21:535-564. [DOI: 10.1146/annurev-genom-111819-084213] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the past few years, cases with DNA evidence that could not be solved with direct matches in DNA databases have benefited from comparing single-nucleotide polymorphism data with private and public genomic databases. Using a combination of genome comparisons and traditional genealogical research, investigators can triangulate distant relatives to the contributor of DNA data from a crime scene, ultimately identifying perpetrators of violent crimes. This approach has also been successful in identifying unknown deceased persons and perpetrators of lesser crimes. Such advances are bringing into focus ethical questions on how much access to DNA databases should be granted to law enforcement and how best to empower public genome contributors with control over their data. The necessary policies will take time to develop but can be informed by reflection on the familial searching policies developed for searches of the federal DNA database and considerations of the anonymity and privacy interests of civilians.
Collapse
Affiliation(s)
- Sara H. Katsanis
- Mary Ann & J. Milburn Smith Child Health Research, Outreach, and Advocacy Center, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, Illinois 60611, USA
- Department of Pediatrics, Northwestern University, Chicago, Illinois 60611, USA
| |
Collapse
|
20
|
Conboy C. Consent and Privacy in the Era of Precision Medicine and Biobanking Genomic Data. AMERICAN JOURNAL OF LAW & MEDICINE 2020; 46:167-187. [PMID: 32659188 DOI: 10.1177/0098858820933493] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
"Big Data represents a challenge that points to the need for collective and political approaches to self-protection rather than solely individual, atomistic approaches."- Anita Allen, "Protecting One's Own Privacy in a Big Data Economy".
Collapse
|
21
|
Edge MD, Coop G. Attacks on genetic privacy via uploads to genealogical databases. eLife 2020; 9:e51810. [PMID: 31908268 PMCID: PMC6992384 DOI: 10.7554/elife.51810] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Accepted: 12/23/2019] [Indexed: 02/06/2023] Open
Abstract
Direct-to-consumer (DTC) genetics services are increasingly popular, with tens of millions of customers. Several DTC genealogy services allow users to upload genetic data to search for relatives, identified as people with genomes that share identical by state (IBS) regions. Here, we describe methods by which an adversary can learn database genotypes by uploading multiple datasets. For example, an adversary who uploads approximately 900 genomes could recover at least one allele at SNP sites across up to 82% of the genome of a median person of European ancestries. In databases that detect IBS segments using unphased genotypes, approximately 100 falsified uploads can reveal enough genetic information to allow genome-wide genetic imputation. We provide a proof-of-concept demonstration in the GEDmatch database, and we suggest countermeasures that will prevent the exploits we describe.
Collapse
Affiliation(s)
- Michael D Edge
- Center for Population BiologyUniversity of California, DavisDavisUnited States
- Department of Evolution and EcologyUniversity of California, DavisDavisUnited States
- Quantitative and Computational Biology, Department of Biological SciencesUniversity of Southern CaliforniaLos AngelesUnited States
| | - Graham Coop
- Center for Population BiologyUniversity of California, DavisDavisUnited States
- Department of Evolution and EcologyUniversity of California, DavisDavisUnited States
| |
Collapse
|
22
|
Wickenheiser RA. Forensic genealogy, bioethics and the Golden State Killer case. Forensic Sci Int Synerg 2019; 1:114-125. [PMID: 32411963 PMCID: PMC7219171 DOI: 10.1016/j.fsisyn.2019.07.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 07/05/2019] [Accepted: 07/05/2019] [Indexed: 11/25/2022]
Abstract
A case study will be used to examine specific issues of bioethics and forensic science that occur in forensic investigative genealogical searching, which include genetic privacy, discrimination and public safety concerns. The forensic investigative process and various investigative DNA tools will also be described. The Golden State Killer Case (1) will be examined to highlight and discuss forensic ethical issues to develop an ethical framework, as well as provide recommended solutions to pressing public safety and privacy issues facing crime laboratories and criminal investigators. Use of the ethical concept of proportionality (2) will be utilized to contrast and balance competing ethical concerns.
Collapse
Affiliation(s)
- Ray A. Wickenheiser
- Laboratory System Director, New York State Police Crime Laboratory System, 1220 Washington Ave, Building 30, Albany, NY, 12226-3000, USA
| |
Collapse
|
23
|
Hendricks-Sturrup RM, Prince AER, Lu CY. Direct-to-Consumer Genetic Testing and Potential Loopholes in Protecting Consumer Privacy and Nondiscrimination. JAMA 2019; 321:1869-1870. [PMID: 30998825 DOI: 10.1001/jama.2019.3384] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Rachele M Hendricks-Sturrup
- Precision Medicine Translational Research (PROMoTeR) Center, Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, Massachusetts
| | | | - Christine Y Lu
- Precision Medicine Translational Research (PROMoTeR) Center, Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
24
|
Gürsoy G, Harmanci A, Tang H, Ayday E, Brenner SE. When Biology Gets Personal: Hidden Challenges of Privacy and Ethics in Biological Big Data. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019; 24:386-390. [PMID: 30864339 PMCID: PMC7577606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
High-throughput technologies for biological data acquisition are advancing at an increasing pace. Most prominently, the decreasing cost of DNA sequencing has led to an exponential growth of sequence information, including individual human genomes. This session of the 2019 Pacific Symposium on Biocomputing presents the distinctive privacy and ethical challenges related to the generation, storage, processing, study, and sharing of individuals' biological data generated by multitude of technologies including but not limited to genomics, proteomics, metagenomics, bioimaging, biosensors, and personal health trackers. The mission is to bring together computational biologists, experimental biologists, computer scientists, ethicists, and policy and lawmakers to share ideas, discuss the challenges related to biological data and privacy.
Collapse
Affiliation(s)
| | - Arif Harmanci
- Center for Precision Health, School of Biomedical Informatics, University of Texas Health Science Center, Houston, TX, 77030, USA
| | | | - Erman Ayday
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH, 44106, USA
| | | |
Collapse
|
25
|
Hazel JW, Clayton EW, Malin BA, Slobogin C. Is it time for a universal genetic forensic database? Science 2018; 362:898-900. [DOI: 10.1126/science.aav5475] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Affiliation(s)
- J. W. Hazel
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN 37203, USA
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - E. W. Clayton
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN 37203, USA
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Vanderbilt University Law School, Nashville, TN 37203, USA
| | - B. A. Malin
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37203, USA
| | - C. Slobogin
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN 37203, USA
- Vanderbilt University Law School, Nashville, TN 37203, USA
| |
Collapse
|
26
|
|