1
|
Estravis Barcala M, van der Valk T, Chen Z, Funda T, Chaudhary R, Klingberg A, Fundova I, Suontama M, Hallingbäck H, Bernhardsson C, Nystedt B, Ingvarsson PK, Sherwood E, Street N, Gyllensten U, Nilsson O, Wu HX. Whole-genome resequencing facilitates the development of a 50K single nucleotide polymorphism genotyping array for Scots pine (Pinus sylvestris L.) and its transferability to other pine species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:944-955. [PMID: 37947292 DOI: 10.1111/tpj.16535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/25/2023] [Accepted: 10/26/2023] [Indexed: 11/12/2023]
Abstract
Scots pine (Pinus sylvestris L.) is one of the most widespread and economically important conifer species in the world. Applications like genomic selection and association studies, which could help accelerate breeding cycles, are challenging in Scots pine because of its large and repetitive genome. For this reason, genotyping tools for conifer species, and in particular for Scots pine, are commonly based on transcribed regions of the genome. In this article, we present the Axiom Psyl50K array, the first single nucleotide polymorphism (SNP) genotyping array for Scots pine based on whole-genome resequencing, that represents both genic and intergenic regions. This array was designed following a two-step procedure: first, 192 trees were sequenced, and a 430K SNP screening array was constructed. Then, 480 samples, including haploid megagametophytes, full-sib family trios, breeding population, and range-wide individuals from across Eurasia were genotyped with the screening array. The best 50K SNPs were selected based on quality, replicability, distribution across the draft genome assembly, balance between genic and intergenic regions, and genotype-environment and genotype-phenotype associations. Of the final 49 877 probes tiled in the array, 20 372 (40.84%) occur inside gene models, while the rest lie in intergenic regions. We also show that the Psyl50K array can yield enough high-confidence SNPs for genetic studies in pine species from North America and Eurasia. This new genotyping tool will be a valuable resource for high-throughput fundamental and applied research of Scots pine and other pine species.
Collapse
Affiliation(s)
- Maximiliano Estravis Barcala
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Tom van der Valk
- Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Zhiqiang Chen
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Tomas Funda
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Rajiv Chaudhary
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Adam Klingberg
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
- Skogforsk, Sävar, Uppsala, Sweden
| | - Irena Fundova
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | | | | | - Carolina Bernhardsson
- Department of Organismal Biology, Human Evolution, Uppsala University, Uppsala, Sweden
- Department of Plant Biology, Linnean Centre for Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Björn Nystedt
- Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Pär K Ingvarsson
- Department of Plant Biology, Linnean Centre for Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Ellen Sherwood
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
- Department of Gene Technology, Science for Life Laboratory, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Nathaniel Street
- Department of Plant Physiology, Umeå Plant Science Centre (UPSC), Umeå University, Umeå, Sweden
| | - Ulf Gyllensten
- Department of Immunology, Genetics, and Pathology, Biomedical Center, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ove Nilsson
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Harry X Wu
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, Umeå, Sweden
| |
Collapse
|
2
|
Bani-Wais DFN, Ad'hiah AH. A novel intergenic variant, rs2004339 A/G, of the gene encoding interleukin-40, C17orf99, is associated with risk of rheumatoid arthritis in Iraqi women. Mol Immunol 2023; 164:39-46. [PMID: 37951185 DOI: 10.1016/j.molimm.2023.11.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/25/2023] [Accepted: 11/03/2023] [Indexed: 11/13/2023]
Abstract
Interleukin-40 (IL-40) is a novel cytokine encoded by the chromosome 17 open reading frame 99 (C17orf99) gene. Recent studies have shown that IL-40 levels are significantly up-regulated in patients with rheumatoid arthritis (RA). However, the association of genetic variants of the C17orf99 gene with the risk of RA has not been investigated. In this case-control study, two intergenic variants, rs2004339 A/G and rs2310998 G/A, were genotyped for the first time in 120 Iraqi women with RA (30 newly diagnosed [ND] and 90 medicated [MD; treated with the tumor necrosis inhibitor etanercept plus methotrexate]) and 110 control women using TaqMan 5'-allele discrimination method. Serum IL-40 levels were also determined using an enzyme-linked immunosorbent assay kit. Multinomial logistic regression analysis was used to analyze rs2004339 and rs2310998 under five genetic models (allele, recessive, dominant, over-dominant, and co-dominant). Results revealed that the mutant A allele (allele model) and the homozygous AA genotype (co-dominant model) of rs2004339 were significantly associated with an increased risk of RA (odds ratio [OR] = 3.37 and 7.44, respectively; corrected probability [pc] < 0.001), while rs2310998 showed no association with RA risk. When comparing the allele and genotype frequencies of rs2004339 and rs2310998 between ND and MD patients, there were no statistically significant differences. Haplotype analysis of the two variants (in the order rs2004339-rs2310998) revealed that haplotypes A-A (OR = 1.72; pc = 0.024) and A-G (OR = 2.85; pc < 0.001) were associated with an increased risk of RA. IL-40 levels (median and interquartile range) were significantly elevated in RA patients compared to controls (29.3 [15.5-41.5] vs. 12.6 [7.4-18.8] pg/mL; p < 0.001). IL-40 levels were not influence by disease duration or disease activity, but the rs2310998 genotypes had an effect; IL-40 levels were significantly higher in women with the AA genotype than in women with the GG genotype (20.1 [12.9-37.1] vs. 15.8 [8.3-22.6] pg/mL; p = 0.006). Regarding medication, IL-40 tended to show elevated levels in ND cases compared to MD cases but without a significant difference. In conclusion, the mutant A allele and the mutant-type AA genotype of the intergenic variant rs2004339 were associated with an increased risk of RA among Iraqi women. Serum IL-40 levels were also elevated in patients, particularly ND patients, and were positively affected by the mutant-type AA genotype. Accordingly, the role of IL-40 in the pathogenesis of RA has been indicated.
Collapse
Affiliation(s)
- Dhuha F N Bani-Wais
- Department of Biotechnology, College of Science, University of Baghdad, Baghdad, Iraq
| | - Ali H Ad'hiah
- Tropical-Biological Research Unit, College of Science, University of Baghdad, Baghdad, Iraq.
| |
Collapse
|
3
|
Xu H, Lin S, Zhou Z, Li D, Zhang X, Yu M, Zhao R, Wang Y, Qian J, Li X, Li B, Wei C, Chen K, Yoshimura T, Wang JM, Huang J. New genetic and epigenetic insights into the chemokine system: the latest discoveries aiding progression toward precision medicine. Cell Mol Immunol 2023; 20:739-776. [PMID: 37198402 PMCID: PMC10189238 DOI: 10.1038/s41423-023-01032-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 04/14/2023] [Indexed: 05/19/2023] Open
Abstract
Over the past thirty years, the importance of chemokines and their seven-transmembrane G protein-coupled receptors (GPCRs) has been increasingly recognized. Chemokine interactions with receptors trigger signaling pathway activity to form a network fundamental to diverse immune processes, including host homeostasis and responses to disease. Genetic and nongenetic regulation of both the expression and structure of chemokines and receptors conveys chemokine functional heterogeneity. Imbalances and defects in the system contribute to the pathogenesis of a variety of diseases, including cancer, immune and inflammatory diseases, and metabolic and neurological disorders, which render the system a focus of studies aiming to discover therapies and important biomarkers. The integrated view of chemokine biology underpinning divergence and plasticity has provided insights into immune dysfunction in disease states, including, among others, coronavirus disease 2019 (COVID-19). In this review, by reporting the latest advances in chemokine biology and results from analyses of a plethora of sequencing-based datasets, we outline recent advances in the understanding of the genetic variations and nongenetic heterogeneity of chemokines and receptors and provide an updated view of their contribution to the pathophysiological network, focusing on chemokine-mediated inflammation and cancer. Clarification of the molecular basis of dynamic chemokine-receptor interactions will help advance the understanding of chemokine biology to achieve precision medicine application in the clinic.
Collapse
Affiliation(s)
- Hanli Xu
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Shuye Lin
- Cancer Research Center, Beijing Chest Hospital, Capital Medical University, Beijing Tuberculosis and Thoracic Tumor Institute, 101149, Beijing, China
| | - Ziyun Zhou
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Duoduo Li
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Xiting Zhang
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Muhan Yu
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Ruoyi Zhao
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Yiheng Wang
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Junru Qian
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Xinyi Li
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Bohan Li
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Chuhan Wei
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China
| | - Keqiang Chen
- Laboratory of Cancer Innovation, Center for Cancer Research, National Cancer Institute at Frederick, Frederick, MD, 21702, USA
| | - Teizo Yoshimura
- Laboratory of Cancer Innovation, Center for Cancer Research, National Cancer Institute at Frederick, Frederick, MD, 21702, USA
| | - Ji Ming Wang
- Laboratory of Cancer Innovation, Center for Cancer Research, National Cancer Institute at Frederick, Frederick, MD, 21702, USA
| | - Jiaqiang Huang
- College of Life Sciences and Bioengineering, School of Physical Science and Engineering, Beijing Jiaotong University, 3 ShangyuanCun, Haidian District, 100044, Beijing, P.R. China.
- Cancer Research Center, Beijing Chest Hospital, Capital Medical University, Beijing Tuberculosis and Thoracic Tumor Institute, 101149, Beijing, China.
- Laboratory of Cancer Innovation, Center for Cancer Research, National Cancer Institute at Frederick, Frederick, MD, 21702, USA.
| |
Collapse
|
4
|
Chang EH, Pouladi N, Guerra S, Jandova J, Kim A, Li H, Li J, Morgan W, Stern DA, Willis AL, Lussier YA, Martinez FD. Epithelial cell responses to rhinovirus identify an early-life-onset asthma phenotype in adults. J Allergy Clin Immunol 2022; 150:604-611. [PMID: 35367470 PMCID: PMC9463086 DOI: 10.1016/j.jaci.2022.03.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 02/15/2022] [Accepted: 03/24/2022] [Indexed: 11/15/2022]
Abstract
BACKGROUND The study of pathogenic mechanisms in adult asthma is often marred by a lack of precise information about the natural history of the disease. Children who have persistent wheezing (PW) during the first 6 years of life and whose symptoms start before age 3 years (PW+) are much more likely to have wheezing illnesses due to rhinovirus (RV) in infancy and to have asthma into adult life than are those who do not have PW (PW-). OBJECTIVE Our aim was to determine whether nasal epithelial cells from PW+ asthmatic adults as compared with cells from PW- asthmatic adults show distinct biomechanistic processes activated by RV exposure. METHODS Air-liquid interface cultures derived from nasal epithelial cells of 36-year old participants with active asthma with and without a history of PW in childhood (10 PW+ participants and 20 PW- participants) from the Tucson Children's Respiratory Study were challenged with a human RV-A strain (RV-A16) or control, and their RNA was sequenced. RESULTS A total of 35 differentially expressed genes involved in extracellular remodeling and angiogenesis distinguished the PW+ group from the PW- group at baseline and after RV-A stimulation. Notably, 22 transcriptomic pathways showed PW-by-RV interactions; the pathways were invariably overactivated in PW+ patients, and were involved in Toll-like receptor- and cytokine-mediated responses, remodeling, and angiogenic processes. CONCLUSIONS Asthmatic adults with a history of persistent wheeze in the first 6 years of life have specific biomolecular alterations in response to RV-A that are not present in patients without such a history. Targeting these mechanisms may slow the progression of asthma in these patients.
Collapse
Affiliation(s)
- Eugene H Chang
- Department of Otolaryngology, University of Arizona, Tucson, Arizona
- College of Medicine, University of Arizona, Tucson, Arizona
- Asthma / Airway Disease Research Center, University of Arizona, Tucson, Arizona
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| | - Nima Pouladi
- Department of Biomedical Informatics, The University of Utah School of Medicine, Salt Lake City, UT
| | - Stefano Guerra
- College of Medicine, University of Arizona, Tucson, Arizona
- Asthma / Airway Disease Research Center, University of Arizona, Tucson, Arizona
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| | - Jana Jandova
- Department of Otolaryngology, University of Arizona, Tucson, Arizona
| | - Alexander Kim
- Department of Otolaryngology, University of Arizona, Tucson, Arizona
| | - Haiquan Li
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| | - Jianrong Li
- Department of Biomedical Informatics, The University of Utah School of Medicine, Salt Lake City, UT
| | - Wayne Morgan
- College of Medicine, University of Arizona, Tucson, Arizona
- Asthma / Airway Disease Research Center, University of Arizona, Tucson, Arizona
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| | - Debra A Stern
- College of Medicine, University of Arizona, Tucson, Arizona
- Asthma / Airway Disease Research Center, University of Arizona, Tucson, Arizona
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| | - Amanda L Willis
- Department of Otolaryngology, University of Arizona, Tucson, Arizona
| | - Yves A. Lussier
- Department of Biomedical Informatics, The University of Utah School of Medicine, Salt Lake City, UT
| | - Fernando D Martinez
- College of Medicine, University of Arizona, Tucson, Arizona
- Asthma / Airway Disease Research Center, University of Arizona, Tucson, Arizona
- The University of Arizona BIO5 Institute, University of Arizona, Tucson, Arizona
| |
Collapse
|
5
|
Wang M, Song WM, Ming C, Wang Q, Zhou X, Xu P, Krek A, Yoon Y, Ho L, Orr ME, Yuan GC, Zhang B. Guidelines for bioinformatics of single-cell sequencing data analysis in Alzheimer's disease: review, recommendation, implementation and application. Mol Neurodegener 2022; 17:17. [PMID: 35236372 PMCID: PMC8889402 DOI: 10.1186/s13024-022-00517-z] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Accepted: 01/18/2022] [Indexed: 12/13/2022] Open
Abstract
Alzheimer's disease (AD) is the most common form of dementia, characterized by progressive cognitive impairment and neurodegeneration. Extensive clinical and genomic studies have revealed biomarkers, risk factors, pathways, and targets of AD in the past decade. However, the exact molecular basis of AD development and progression remains elusive. The emerging single-cell sequencing technology can potentially provide cell-level insights into the disease. Here we systematically review the state-of-the-art bioinformatics approaches to analyze single-cell sequencing data and their applications to AD in 14 major directions, including 1) quality control and normalization, 2) dimension reduction and feature extraction, 3) cell clustering analysis, 4) cell type inference and annotation, 5) differential expression, 6) trajectory inference, 7) copy number variation analysis, 8) integration of single-cell multi-omics, 9) epigenomic analysis, 10) gene network inference, 11) prioritization of cell subpopulations, 12) integrative analysis of human and mouse sc-RNA-seq data, 13) spatial transcriptomics, and 14) comparison of single cell AD mouse model studies and single cell human AD studies. We also address challenges in using human postmortem and mouse tissues and outline future developments in single cell sequencing data analysis. Importantly, we have implemented our recommended workflow for each major analytic direction and applied them to a large single nucleus RNA-sequencing (snRNA-seq) dataset in AD. Key analytic results are reported while the scripts and the data are shared with the research community through GitHub. In summary, this comprehensive review provides insights into various approaches to analyze single cell sequencing data and offers specific guidelines for study design and a variety of analytic directions. The review and the accompanied software tools will serve as a valuable resource for studying cellular and molecular mechanisms of AD, other diseases, or biological systems at the single cell level.
Collapse
Affiliation(s)
- Minghui Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Won-min Song
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Chen Ming
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Qian Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Xianxiao Zhou
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Peng Xu
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Azra Krek
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 USA
| | - Yonejung Yoon
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Lap Ho
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| | - Miranda E. Orr
- Department of Internal Medicine, Section of Gerontology and Geriatric Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina USA
- Sticht Center for Healthy Aging and Alzheimer’s Prevention, Wake Forest School of Medicine, Winston-Salem, North Carolina USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 USA
| | - Bin Zhang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, 1470 Madison Avenue, Room S8-111, New York, NY 10029 USA
| |
Collapse
|
6
|
Nevola KT, Nagarajan A, Hinton AC, Trajanoska K, Formosa MM, Xuereb-Anastasi A, van der Velde N, Stricker BH, Rivadeneira F, Fuggle NR, Westbury LD, Dennison EM, Cooper C, Kiel DP, Motyl KJ, Lary CW. Pharmacogenomic Effects of β-Blocker Use on Femoral Neck Bone Mineral Density. J Endocr Soc 2021; 5:bvab092. [PMID: 34195528 PMCID: PMC8237849 DOI: 10.1210/jendso/bvab092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Indexed: 11/19/2022] Open
Abstract
CONTEXT Recent studies have shown that β-blocker (BB) users have a decreased risk of fracture and higher bone mineral density (BMD) compared to nonusers, likely due to the suppression of adrenergic signaling in osteoblasts, leading to increased BMD. There is also variability in the effect size of BB use on BMD in humans, which may be due to pharmacogenomic effects. OBJECTIVE To investigate potential single-nucleotide variations (SNVs) associated with the effect of BB use on femoral neck BMD, we performed a cross-sectional analysis using clinical data, dual-energy x-ray absorptiometry, and genetic data from the Framingham Heart Study's (FHS) Offspring Cohort. We then sought to validate our top 4 genetic findings using data from the Rotterdam Study, the BPROOF Study, the Malta Osteoporosis Fracture Study (MOFS), and the Hertfordshire Cohort Study. METHODS We used sex-stratified linear mixed models to determine SNVs that had a significant interaction effect with BB use on femoral neck (FN) BMD across 11 gene regions. We also evaluated the association of our top SNVs from the FHS with microRNA (miRNA) expression in blood and identified potential miRNA-mediated mechanisms by which these SNVs may affect FN BMD. RESULTS One variation (rs11124190 in HDAC4) was validated in females using data from the Rotterdam Study, while another (rs12414657 in ADRB1) was validated in females using data from the MOFS. We performed an exploratory meta-analysis of all 5 studies for these variations, which further validated our findings. CONCLUSION This analysis provides a starting point for investigating the pharmacogenomic effects of BB use on BMD measures.
Collapse
Affiliation(s)
- Kathleen T Nevola
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA, 02111, USA
| | - Archana Nagarajan
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA, 02111, USA
- Center for Outcomes Research and Evaluation, Maine Medical Center Research Institute, Portland, ME 04101, USA
| | - Alexandra C Hinton
- Center for Outcomes Research and Evaluation, Maine Medical Center Research Institute, Portland, ME 04101, USA
| | - Katerina Trajanoska
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam 3015 GD, the Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam 3015 GD, the Netherlands
| | - Melissa M Formosa
- Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida MSD 2080, Malta
- Centre for Molecular Medicine and Biobanking, MSD 2080, Malta
| | - Angela Xuereb-Anastasi
- Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida MSD 2080, Malta
- Centre for Molecular Medicine and Biobanking, MSD 2080, Malta
| | - Nathalie van der Velde
- Department of Internal Medicine, Geriatrics, Amsterdam Public Health Research Institute, Amsterdam University Medical Center, Amsterdam, 1105 AZ, the Netherlands
| | - Bruno H Stricker
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam 3015 GD, the Netherlands
| | - Fernando Rivadeneira
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam 3015 GD, the Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam 3015 GD, the Netherlands
| | - Nicholas R Fuggle
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, SO16 6YD, UK
| | - Leo D Westbury
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
| | - Elaine M Dennison
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, SO16 6YD, UK
- Victoria University of Wellington, Wellington, New Zealand
| | - Cyrus Cooper
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, SO16 6YD, UK
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
- NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Douglas P Kiel
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
- Hinda and Arthur Marcus Institute for Aging Research Hebrew SeniorLife, Boston, MA 02131, USA
| | - Katherine J Motyl
- Center for Molecular Medicine, Maine Medical Center Research Institute, Maine Medical Center, Scarborough, ME 04074, USA
| | - Christine W Lary
- Center for Outcomes Research and Evaluation, Maine Medical Center Research Institute, Portland, ME 04101, USA
| |
Collapse
|
7
|
The Identification of the SARS-CoV-2 Whole Genome: Nine Cases Among Patients in Banten Province, Indonesia. JOURNAL OF PURE AND APPLIED MICROBIOLOGY 2021. [DOI: 10.22207/jpam.15.2.52] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the strain of virus that causes coronavirus disease 2019 (COVID-19), the respiratory illness responsible for the current pandemic. Viral genome sequencing has been widely applied during outbreaks to study the relatedness of this virus to other viruses, its transmission mode, pace, evolution and geographical spread, and also its adaptation to human hosts. To date, more than 90,000 SARS-CoV-2 genome sequences have been uploaded to the GISAID database. The availability of sequencing data along with clinical and geographical data may be useful for epidemiological investigations. In this study, we aimed to analyse the genetic background of SARS-CoV-2 from patients in Indonesia by whole genome sequencing. We examined nine samples from COVID-19 patients with RT-PCR cycle threshold (Ct) of less than 25 using ARTIC Network protocols for Oxford Nanopore’s Gridi On sequencer. The analytical methods were based on the ARTIC multiplex PCR sequencing protocol for COVID-19. In this study, we found that several genetic variants within the nine COVID-19 patient samples. We identified a mutation at position 614 P323L mutation in the ORF1ab gene often found in our severe patient samples. The number of SNPs and their location within the SARS-CoV-2 genome seems to vary. This diversity might be responsible for the virulence of the virus and its clinical manifestation.
Collapse
|
8
|
Bernhardsson C, Zan Y, Chen Z, Ingvarsson PK, Wu HX. Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species. Mol Ecol Resour 2020; 21:880-896. [PMID: 33179386 PMCID: PMC7984398 DOI: 10.1111/1755-0998.13292] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 10/23/2020] [Accepted: 11/04/2020] [Indexed: 12/30/2022]
Abstract
Norway spruce (Picea abies L. Karst) is one of the most important forest tree species with significant economic and ecological impact in Europe. For decades, genomic and genetic studies on Norway spruce have been challenging due to the large and repetitive genome (19.6 Gb with more than 70% being repetitive). To accelerate genomic studies, including population genetics, genome‐wide association studies (GWAS) and genomic selection (GS), in Norway spruce and related species, we here report on the design and performance of a 50K single nucleotide polymorphism (SNP) genotyping array for Norway spruce. The array is developed based on whole genome resequencing (WGS), making it the first WGS‐based SNP array in any conifer species so far. After identifying SNPs using genome resequencing data from 29 trees collected in northern Europe, we adopted a two‐step approach to design the array. First, we built a 450K screening array and used this to genotype a population of 480 trees sampled from both natural and breeding populations across the Norway spruce distribution range. These samples were then used to select high‐confidence probes that were put on the final 50K array. The SNPs selected are distributed over 45,552 scaffolds from the P. abies version 1.0 genome assembly and target 19,954 unique gene models with an even coverage of the 12 linkage groups in Norway spruce. We show that the array has a 99.5% probe specificity, >98% Mendelian allelic inheritance concordance, an average sample call rate of 96.30% and an SNP call rate of 98.90% in family trios and haploid tissues. We also observed that 23,797 probes (50%) could be identified with high confidence in three other spruce species (white spruce [Picea glauca], black spruce [P. mariana] and Sitka spruce [P. sitchensis]). The high‐quality genotyping array will be a valuable resource for genetic and genomic studies in Norway spruce as well as in other conifer species of the same genus.
Collapse
Affiliation(s)
- Carolina Bernhardsson
- Department of Ecology and Environmental Science, Umeå University, Umeå, Sweden.,Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Yanjun Zan
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden
| | - Zhiqiang Chen
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | - Harry X Wu
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden.,Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China.,Black Mountain Laboratory, CSIRO National Research Collection Australia, Canberra, ACT, Australia
| |
Collapse
|
9
|
Baldwin E, Han J, Luo W, Zhou J, An L, Liu J, Zhang HH, Li H. On fusion methods for knowledge discovery from multi-omics datasets. Comput Struct Biotechnol J 2020; 18:509-517. [PMID: 32206210 PMCID: PMC7078495 DOI: 10.1016/j.csbj.2020.02.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 01/25/2020] [Accepted: 02/19/2020] [Indexed: 12/22/2022] Open
Abstract
Recent years have witnessed the tendency of measuring a biological sample on multiple omics scales for a comprehensive understanding of how biological activities on varying levels are perturbed by genetic variants, environments, and their interactions. This new trend raises substantial challenges to data integration and fusion, of which the latter is a specific type of integration that applies a uniform method in a scalable manner, to solve biological problems which the multi-omics measurements target. Fusion-based analysis has advanced rapidly in the past decade, thanks to application drivers and theoretical breakthroughs in mathematics, statistics, and computer science. We will briefly address these methods from methodological and mathematical perspectives and categorize them into three types of approaches: data fusion (a narrowed definition as compared to the general data fusion concept), model fusion, and mixed fusion. We will demonstrate at least one typical example in each specific category to exemplify the characteristics, principles, and applications of the methods in general, as well as discuss the gaps and potential issues for future studies.
Collapse
Affiliation(s)
- Edwin Baldwin
- Department of Biosystems Engineering, University of Arizona, United States
| | - Jiali Han
- Department of Systems and Industrial Engineering, University of Arizona, United States
| | - Wenting Luo
- Department of Biosystems Engineering, University of Arizona, United States
| | - Jin Zhou
- Department of Epidemiology and Biostatics, University of Arizona, United States
| | - Lingling An
- Department of Biosystems Engineering, University of Arizona, United States.,Department of Epidemiology and Biostatics, University of Arizona, United States
| | - Jian Liu
- Department of Systems and Industrial Engineering, University of Arizona, United States
| | - Hao Helen Zhang
- Department of Mathematics, University of Arizona, United States
| | - Haiquan Li
- Department of Biosystems Engineering, University of Arizona, United States
| |
Collapse
|
10
|
Placek K, Baer GM, Elman L, McCluskey L, Hennessy L, Ferraro PM, Lee EB, Lee VMY, Trojanowski JQ, Van Deerlin VM, Grossman M, Irwin DJ, McMillan CT. UNC13A polymorphism contributes to frontotemporal disease in sporadic amyotrophic lateral sclerosis. Neurobiol Aging 2019; 73:190-199. [PMID: 30368160 PMCID: PMC6251755 DOI: 10.1016/j.neurobiolaging.2018.09.031] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 09/20/2018] [Accepted: 09/23/2018] [Indexed: 12/26/2022]
Abstract
The majority (90%-95%) of amyotrophic lateral sclerosis (ALS) is sporadic, and ∼50% of patients develop symptoms of frontotemporal degeneration (FTD) associated with shorter survival. The genetic polymorphism rs12608932 in UNC13A confers increased risk of sporadic ALS and sporadic FTD and modifies survival in ALS. Here, we evaluate whether rs12608932 is also associated with frontotemporal disease in sporadic ALS. We identified reduced cortical thickness in sporadic ALS with T1-weighted magnetic resonance imaging (N = 109) relative to controls (N = 113), and observed that minor allele (C) carriers exhibited greater reduction of cortical thickness in the dorsal prefrontal, ventromedial prefrontal, anterior temporal, and middle temporal cortices and worse performance on a frontal lobe-mediated cognitive test (reverse digit span). In sporadic ALS with autopsy data (N = 102), minor allele homozygotes exhibited greater burden of phosphorylated tar DNA-binding protein-43 kda (TDP-43) pathology in the middle frontal, middle temporal, and motor cortices. Our findings demonstrate converging evidence that rs12608932 may modify frontotemporal disease in sporadic ALS and suggest that rs12608932 may function as a prognostic indicator and could be used to define patient endophenotypes in clinical trials.
Collapse
Affiliation(s)
- Katerina Placek
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA
| | - G Michael Baer
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - Lauren Elman
- University of Pennsylvania, Penn Comprehensive ALS Center, Philadelphia, PA, USA
| | - Leo McCluskey
- University of Pennsylvania, Penn Comprehensive ALS Center, Philadelphia, PA, USA
| | - Laura Hennessy
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA
| | - Pilar M Ferraro
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA
| | - Edward B Lee
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - Virginia M Y Lee
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - John Q Trojanowski
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - Vivianna M Van Deerlin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - Murray Grossman
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA
| | - David J Irwin
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, University of Pennsylvania, Center for Neurodegenerative Disease Research, Philadelphia, PA, USA
| | - Corey T McMillan
- Department of Neurology, University of Pennsylvania, Penn Frontotemporal Degeneration Center, Philadelphia, PA, USA.
| |
Collapse
|
11
|
Li H, Fan J, Vitali F, Berghout J, Aberasturi D, Li J, Wilson L, Chiu W, Pumarejo M, Han J, Kenost C, Koripella PC, Pouladi N, Billheimer D, Bedrick EJ, Lussier YA. Novel disease syndromes unveiled by integrative multiscale network analysis of diseases sharing molecular effectors and comorbidities. BMC Med Genomics 2018; 11:112. [PMID: 30598089 PMCID: PMC6311938 DOI: 10.1186/s12920-018-0428-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Background Forty-two percent of patients experience disease comorbidity, contributing substantially to mortality rates and increased healthcare costs. Yet, the possibility of underlying shared mechanisms for diseases remains not well established, and few studies have confirmed their molecular predictions with clinical datasets. Methods In this work, we integrated genome-wide association study (GWAS) associating diseases and single nucleotide polymorphisms (SNPs) with transcript regulatory activity from expression quantitative trait loci (eQTL). This allowed novel mechanistic insights for noncoding and intergenic regions. We then analyzed pairs of SNPs across diseases to identify shared molecular effectors robust to multiple test correction (False Discovery Rate FDReRNA < 0.05). We hypothesized that disease pairs found to be molecularly convergent would also be significantly overrepresented among comorbidities in clinical datasets. To assess our hypothesis, we used clinical claims datasets from the Healthcare Cost and Utilization Project (HCUP) and calculated significant disease comorbidities (FDRcomorbidity < 0.05). We finally verified if disease pairs resulting molecularly convergent were also statistically comorbid more than by chance using the Fisher’s Exact Test. Results Our approach integrates: (i) 6175 SNPs associated with 238 diseases from ~ 1000 GWAS, (ii) eQTL associations from 19 tissues, and (iii) claims data for 35 million patients from HCUP. Logistic regression (controlled for age, gender, and race) identified comorbidities in HCUP, while enrichment analyses identified cis- and trans-eQTL downstream effectors of GWAS-identified variants. Among ~ 16,000 combinations of diseases, 398 disease-pairs were prioritized by both convergent eQTL-genetics (RNA overlap enrichment, FDReRNA < 0.05) and clinical comorbidities (OR > 1.5, FDRcomorbidity < 0.05). Case studies of comorbidities illustrate specific convergent noncoding regulatory elements. An intergenic architecture of disease comorbidity was unveiled due to GWAS and eQTL-derived convergent mechanisms between distinct diseases being overrepresented among observed comorbidities in clinical datasets (OR = 8.6, p-value = 6.4 × 10− 5 FET). Conclusions These comorbid diseases with convergent eQTL genetic mechanisms suggest clinical syndromes. While it took over a decade to confirm the genetic underpinning of the metabolic syndrome, this study is likely highlighting hundreds of new ones. Further, this knowledge may improve the clinical management of comorbidities with precision and shed light on novel approaches of drug repositioning or SNP-guided precision molecular therapy inclusive of intergenic risks. Electronic supplementary material The online version of this article (10.1186/s12920-018-0428-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Haiquan Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA. .,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA. .,Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, 85721, USA. .,Department of Biosystems Engineering, The University of Arizona, Tucson, AZ, 85721, USA.
| | - Jungwei Fan
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA
| | - Francesca Vitali
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA.,University of Arizona Health Sciences, The University of Arizona, Tucson, AZ, 85721, USA
| | - Joanne Berghout
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA.,The Center for Applied Genetics and Genomics Medicine, The University of Arizona, Tucson, AZ, 85721, USA.,The Center for Innovation in Brain Science, The University of Arizona, Tucson, AZ, 85721, USA
| | - Dillon Aberasturi
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA.,Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, 85721, USA
| | - Jianrong Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA.,University of Arizona Health Sciences, The University of Arizona, Tucson, AZ, 85721, USA
| | - Liam Wilson
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA
| | - Wesley Chiu
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA
| | - Minsu Pumarejo
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA
| | - Jiali Han
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Systems & Industrial Engineering, The University of Arizona, Tucson, AZ, 85721, USA
| | - Colleen Kenost
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA
| | - Pradeep C Koripella
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA
| | - Nima Pouladi
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA
| | - Dean Billheimer
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, 85721, USA.,University of Arizona Health Sciences, The University of Arizona, Tucson, AZ, 85721, USA.,Epidemiology and Biostatistics Department, College of Public Health, The University of Arizona, Tucson, AZ, 85721, USA
| | - Edward J Bedrick
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA.,Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, 85721, USA.,University of Arizona Health Sciences, The University of Arizona, Tucson, AZ, 85721, USA.,Epidemiology and Biostatistics Department, College of Public Health, The University of Arizona, Tucson, AZ, 85721, USA
| | - Yves A Lussier
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ, 85721, USA. .,Department of Medicine at the College of Medicine-Tucson, The University of Arizona, Tucson, AZ, 85721, USA. .,Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, 85721, USA. .,The Center for Applied Genetics and Genomics Medicine, The University of Arizona, Tucson, AZ, 85721, USA. .,The Center for Innovation in Brain Science, The University of Arizona, Tucson, AZ, 85721, USA. .,UA Cancer Center, The University of Arizona, Tucson, AZ, 85721, USA. .,University of Arizona Health Sciences, The University of Arizona, Tucson, AZ, 85721, USA.
| |
Collapse
|
12
|
Han J, Li J, Achour I, Pesce L, Foster I, Li H, Lussier YA. Convergent downstream candidate mechanisms of independent intergenic polymorphisms between co-classified diseases implicate epistasis among noncoding elements. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018; 23:524-535. [PMID: 29218911 PMCID: PMC5730078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Eighty percent of DNA outside protein coding regions was shown biochemically functional by the ENCODE project, enabling studies of their interactions. Studies have since explored how convergent downstream mechanisms arise from independent genetic risks of one complex disease. However, the cross-talk and epistasis between intergenic risks associated with distinct complex diseases have not been comprehensively characterized. Our recent integrative genomic analysis unveiled downstream biological effectors of disease-specific polymorphisms buried in intergenic regions, and we then validated their genetic synergy and antagonism in distinct GWAS. We extend this approach to characterize convergent downstream candidate mechanisms of distinct intergenic SNPs across distinct diseases within the same clinical classification. We construct a multipartite network consisting of 467 diseases organized in 15 classes, 2,358 disease-associated SNPs, 6,301 SNPassociated mRNAs by eQTL, and mRNA annotations to 4,538 Gene Ontology mechanisms. Functional similarity between two SNPs (similar SNP pairs) is imputed using a nested information theoretic distance model for which p-values are assigned by conservative scale-free permutation of network edges without replacement (node degrees constant). At FDR≤5%, we prioritized 3,870 intergenic SNP pairs associated, among which 755 are associated with distinct diseases sharing the same disease class, implicating 167 intergenic SNPs, 14 classes, 230 mRNAs, and 134 GO terms. Co-classified SNP pairs were more likely to be prioritized as compared to those of distinct classes confirming a noncoding genetic underpinning to clinical classification (odds ratio ∼3.8; p≤10-25). The prioritized pairs were also enriched in regions bound to the same/interacting transcription factors and/or interacting in long-range chromatin interactions suggestive of epistasis (odds ratio ∼ 2,500; p≤10-25). This prioritized network implicates complex epistasis between intergenic polymorphisms of co-classified diseases and offers a roadmap for a novel therapeutic paradigm: repositioning medications that target proteins within downstream mechanisms of intergenic disease-associated SNPs. Supplementary information and software: http://lussiergroup.org/publications/disease_class.
Collapse
Affiliation(s)
- Jiali Han
- Center for Biomedical Informatics and Biostatistics (CB2) and Departments of Medicine and of Systems and Industrial Engineering, The University of Arizona, Tucson, AZ 85721, USA,
| | | | | | | | | | | | | |
Collapse
|
13
|
Lynch SM, Mitra N, Ross M, Newcomb C, Dailey K, Jackson T, Zeigler-Johnson CM, Riethman H, Branas CC, Rebbeck TR. A Neighborhood-Wide Association Study (NWAS): Example of prostate cancer aggressiveness. PLoS One 2017; 12:e0174548. [PMID: 28346484 PMCID: PMC5367705 DOI: 10.1371/journal.pone.0174548] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 03/11/2017] [Indexed: 12/31/2022] Open
Abstract
PURPOSE Cancer results from complex interactions of multiple variables at the biologic, individual, and social levels. Compared to other levels, social effects that occur geospatially in neighborhoods are not as well-studied, and empiric methods to assess these effects are limited. We propose a novel Neighborhood-Wide Association Study(NWAS), analogous to genome-wide association studies(GWAS), that utilizes high-dimensional computing approaches from biology to comprehensively and empirically identify neighborhood factors associated with disease. METHODS Pennsylvania Cancer Registry data were linked to U.S. Census data. In a successively more stringent multiphase approach, we evaluated the association between neighborhood (n = 14,663 census variables) and prostate cancer aggressiveness(PCA) with n = 6,416 aggressive (Stage≥3/Gleason grade≥7 cases) vs. n = 70,670 non-aggressive (Stage<3/Gleason grade<7) cases in White men. Analyses accounted for age, year of diagnosis, spatial correlation, and multiple-testing. We used generalized estimating equations in Phase 1 and Bayesian mixed effects models in Phase 2 to calculate odds ratios(OR) and confidence/credible intervals(CI). In Phase 3, principal components analysis grouped correlated variables. RESULTS We identified 17 new neighborhood variables associated with PCA. These variables represented income, housing, employment, immigration, access to care, and social support. The top hits or most significant variables related to transportation (OR = 1.05;CI = 1.001-1.09) and poverty (OR = 1.07;CI = 1.01-1.12). CONCLUSIONS This study introduces the application of high-dimensional, computational methods to large-scale, publically-available geospatial data. Although NWAS requires further testing, it is hypothesis-generating and addresses gaps in geospatial analysis related to empiric assessment. Further, NWAS could have broad implications for many diseases and future precision medicine studies focused on multilevel risk factors of disease.
Collapse
Affiliation(s)
- Shannon M. Lynch
- Fox Chase Cancer Center, Cancer Prevention and Control, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| | - Nandita Mitra
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Michelle Ross
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Craig Newcomb
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Karl Dailey
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Tara Jackson
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
| | | | - Harold Riethman
- Old Dominion University, Norfolk, Virginia, United States of America
| | - Charles C. Branas
- University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, United States of America
- Columbia University, Mailman School of Public Health, New York, New York, United States of America
| | - Timothy R. Rebbeck
- Dana Farber Cancer Institute and Harvard TH Chan School of Public Health, Boston, Massachusetts, United States of America
| |
Collapse
|
14
|
Li Q, Schissler AG, Gardeux V, Berghout J, Achour I, Kenost C, Li H, Zhang HH, Lussier YA. kMEn: Analyzing noisy and bidirectional transcriptional pathway responses in single subjects. J Biomed Inform 2017; 66:32-41. [PMID: 28007582 PMCID: PMC5316373 DOI: 10.1016/j.jbi.2016.12.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 10/28/2016] [Accepted: 12/16/2016] [Indexed: 10/20/2022]
Abstract
MOTIVATION Understanding dynamic, patient-level transcriptomic response to therapy is an important step forward for precision medicine. However, conventional transcriptome analysis aims to discover cohort-level change, lacking the capacity to unveil patient-specific response to therapy. To address this gap, we previously developed two N-of-1-pathways methods, Wilcoxon and Mahalanobis distance, to detect unidirectionally responsive transcripts within a pathway using a pair of samples from a single subject. Yet, these methods cannot recognize bidirectionally (up and down) responsive pathways. Further, our previous approaches have not been assessed in presence of background noise and are not designed to identify differentially expressed mRNAs between two samples of a patient taken in different contexts (e.g. cancer vs non cancer), which we termed responsive transcripts (RTs). METHODS We propose a new N-of-1-pathways method, k-Means Enrichment (kMEn), that detects bidirectionally responsive pathways, despite background noise, using a pair of transcriptomes from a single patient. kMEn identifies transcripts responsive to the stimulus through k-means clustering and then tests for an over-representation of the responsive genes within each pathway. The pathways identified by kMEn are mechanistically interpretable pathways significantly responding to a stimulus. RESULTS In ∼9000 simulations varying six parameters, superior performance of kMEn over previous single-subject methods is evident by: (i) improved precision-recall at various levels of bidirectional response and (ii) lower rates of false positives (1-specificity) when more than 10% of genes in the genome are differentially expressed (background noise). In a clinical proof-of-concept, personal treatment-specific pathways identified by kMEn correlate with therapeutic response (p-value<0.01). CONCLUSION Through improved single-subject transcriptome dynamics of bidirectionally-regulated signals, kMEn provides a novel approach to identify mechanism-level biomarkers.
Collapse
Affiliation(s)
- Qike Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA
| | - A Grant Schissler
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA
| | - Vincent Gardeux
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Joanne Berghout
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Ikbel Achour
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Colleen Kenost
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA
| | - Haiquan Li
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA.
| | - Hao Helen Zhang
- Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA; Department of Mathematics, The University of Arizona, Tucson, AZ 85721, USA.
| | - Yves A Lussier
- Center for Biomedical Informatics and Biostatistics, The University of Arizona, Tucson, AZ 85721, USA; Bio5 Institute, The University of Arizona, Tucson, AZ 85721, USA; Department of Medicine, The University of Arizona, Tucson, AZ 85721, USA; Graduate Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ 85721, USA; University of Arizona Cancer Center, The University of Arizona, Tucson, AZ 85721, USA; Institute for Genomics and Systems Biology, The University of Chicago, IL 60637, USA.
| |
Collapse
|
15
|
Pouladi N, Achour I, Li H, Berghout J, Kenost C, Gonzalez-Garay ML, Lussier YA. Biomechanisms of Comorbidity: Reviewing Integrative Analyses of Multi-omics Datasets and Electronic Health Records. Yearb Med Inform 2016; 25:194-206. [PMID: 27830251 PMCID: PMC5171562 DOI: 10.15265/iy-2016-040] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
OBJECTIVES Disease comorbidity is a pervasive phenomenon impacting patients' health outcomes, disease management, and clinical decisions. This review presents past, current and future research directions leveraging both phenotypic and molecular information to uncover disease similarity underpinning the biology and etiology of disease comorbidity. METHODS We retrieved ~130 publications and retained 59, ranging from 2006 to 2015, that comprise a minimum number of five diseases and at least one type of biomolecule. We surveyed their methods, disease similarity metrics, and calculation of comorbidities in the electronic health records, if present. RESULTS Among the surveyed studies, 44% generated or validated disease similarity metrics in context of comorbidity, with 60% being published in the last two years. As inputs, 87% of studies utilized intragenic loci and proteins while 13% employed RNA (mRNA, LncRNA or miRNA). Network modeling was predominantly used (35%) followed by statistics (28%) to impute similarity between these biomolecules and diseases. Studies with large numbers of biomolecules and diseases used network models or naïve overlap of disease-molecule associations, while machine learning, statistics, and information retrieval were utilized in smaller and moderate sized studies. Multiscale computations comprising shared function, network topology, and phenotypes were performed exclusively on proteins. CONCLUSION This review highlighted the growing methods for identifying the molecular mechanisms underpinning comorbidities that leverage multiscale molecular information and patterns from electronic health records. The survey unveiled that intergenic polymorphisms have been overlooked for similarity imputation compared to their intragenic counterparts, offering new opportunities to bridge the mechanistic and similarity gaps of comorbidity.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Y A Lussier
- Dr. Yves A. Lussier, The University of Arizona, Bio5 Building, 1657 East Helen Street, Tucson, AZ 85721, USA, Fax: +1 520 626 4824, E-Mail:
| |
Collapse
|