Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Margulies EH, Blanchette M, Haussler D, Green ED. Identification and characterization of multi-species conserved sequences. Genome Res 2004;13:2507-18. [PMID: 14656959 PMCID: PMC403793 DOI: 10.1101/gr.1602203] [Citation(s) in RCA: 242] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

For:	Margulies EH, Blanchette M, Haussler D, Green ED. Identification and characterization of multi-species conserved sequences. Genome Res 2004;13:2507-18. [PMID: 14656959 PMCID: PMC403793 DOI: 10.1101/gr.1602203] [Citation(s) in RCA: 242] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Number

Cited by Other Article(s)

Argov CM, Shneyour A, Jubran J, Sabag E, Mansbach A, Sepunaru Y, Filtzer E, Gruber G, Volozhinsky M, Yogev Y, Birk O, Chalifa-Caspi V, Rokach L, Yeger-Lotem E. Tissue-aware interpretation of genetic variants advances the etiology of rare diseases. Mol Syst Biol 2024;20:1187-1206. [PMID: 39285047 PMCID: PMC11535248 DOI: 10.1038/s44320-024-00061-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/08/2024] [Accepted: 08/09/2024] [Indexed: 09/19/2024] Open

Affiliation(s)

Chanan M Argov Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Ariel Shneyour Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Juman Jubran Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Eric Sabag Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Avigdor Mansbach Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Yair Sepunaru Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Emmi Filtzer Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Gil Gruber Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Miri Volozhinsky Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Yuval Yogev Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, 84105, Israel
Ohad Birk Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, 84105, Israel The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Vered Chalifa-Caspi Ilse Katz Institute for Nanoscale Science & Technology, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
Lior Rokach Department of Software & Information Systems Engineering, Faculty of Engineering Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel
Esti Yeger-Lotem Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel. The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, 84105, Israel.

Collapse

Roberts M, Josephs EB. Previously unmeasured genetic diversity explains part of Lewontin's paradox in a k -mer-based meta-analysis of 112 plant species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.17.594778. [PMID: 38798362 PMCID: PMC11118579 DOI: 10.1101/2024.05.17.594778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024;20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open

Omori Y, Burgess SM. The Goldfish Genome and Its Utility for Understanding Gene Regulation and Vertebrate Body Morphology. Methods Mol Biol 2024;2707:335-355. [PMID: 37668923 DOI: 10.1007/978-1-0716-3401-1_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]

Guo Q, Wu S, Geschwind DH. Characterization of Gene Regulatory Elements in Human Fetal Cortical Development: Enhancing Our Understanding of Neurodevelopmental Disorders and Evolution. Dev Neurosci 2023;46:69-83. [PMID: 37231806 DOI: 10.1159/000530929] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 04/24/2023] [Indexed: 05/27/2023] Open

Song H, Wang Q, Zhang Z, Lin K, Pang E. Identification of clade-wide putative cis-regulatory elements from conserved non-coding sequences in Cucurbitaceae genomes. HORTICULTURE RESEARCH 2023;10:uhad038. [PMID: 37799630 PMCID: PMC10548412 DOI: 10.1093/hr/uhad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 02/20/2023] [Indexed: 10/07/2023]

Chey YCJ, Arudkumar J, Aartsma-Rus A, Adikusuma F, Thomas PQ. CRISPR applications for Duchenne muscular dystrophy: From animal models to potential therapies. WIREs Mech Dis 2023;15:e1580. [PMID: 35909075 PMCID: PMC10078488 DOI: 10.1002/wsbm.1580] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 04/28/2022] [Accepted: 06/30/2022] [Indexed: 01/31/2023]

Smeds L, Ellegren H. From high masked to high realized genetic load in inbred Scandinavian wolves. Mol Ecol 2022;32:1567-1580. [PMID: 36458895 DOI: 10.1111/mec.16802] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 11/17/2022] [Accepted: 11/28/2022] [Indexed: 12/03/2022]

Zheng M, Li RG, Song J, Zhao X, Tang L, Erhardt S, Chen W, Nguyen BH, Li X, Li M, Wang J, Evans SM, Christoffels VM, Li N, Wang J. Hippo-Yap Signaling Maintains Sinoatrial Node Homeostasis. Circulation 2022;146:1694-1711. [PMID: 36317529 PMCID: PMC9897204 DOI: 10.1161/circulationaha.121.058777] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 09/20/2022] [Indexed: 11/06/2022]

Abstract

BACKGROUND

The sinoatrial node (SAN) functions as the pacemaker of the heart, initiating rhythmic heartbeats. Despite its importance, the SAN is one of the most poorly understood cardiac entities because of its small size and complex composition and function. The Hippo signaling pathway is a molecular signaling pathway fundamental to heart development and regeneration. Although abnormalities of the Hippo pathway are associated with cardiac arrhythmias in human patients, the role of this pathway in the SAN is unknown.

METHODS

We investigated key regulators of the Hippo pathway in SAN pacemaker cells by conditionally inactivating the Hippo signaling kinases Lats1 and Lats2 using the tamoxifen-inducible, cardiac conduction system-specific Cre driver Hcn4CreERT2 with Lats1 and Lats2 conditional knockout alleles. In addition, the Hippo-signaling effectors Yap and Taz were conditionally inactivated in the SAN. To determine the function of Hippo signaling in the SAN and other cardiac conduction system components, we conducted a series of physiological and molecular experiments, including telemetry ECG recording, echocardiography, Masson Trichrome staining, calcium imaging, immunostaining, RNAscope, cleavage under targets and tagmentation sequencing using antibodies against Yap1 or H3K4me3, quantitative real-time polymerase chain reaction, and Western blotting. We also performed comprehensive bioinformatics analyses of various datasets.

RESULTS

We found that Lats1/2 inactivation caused severe sinus node dysfunction. Compared with the controls, Lats1/2 conditional knockout mutants exhibited dysregulated calcium handling and increased fibrosis in the SAN, indicating that Lats1/2 function through both cell-autonomous and non-cell-autonomous mechanisms. It is notable that the Lats1/2 conditional knockout phenotype was rescued by genetic deletion of Yap and Taz in the cardiac conduction system. These rescued mice had normal sinus rhythm and reduced fibrosis of the SAN, indicating that Lats1/2 function through Yap and Taz. Cleavage Under Targets and Tagmentation sequencing data showed that Yap potentially regulates genes critical for calcium homeostasis such as Ryr2 and genes encoding paracrine factors important in intercellular communication and fibrosis induction such as Tgfb1 and Tgfb3. Consistent with this, Lats1/2 conditional knockout mutants had decreased Ryr2 expression and increased Tgfb1 and Tgfb3 expression compared with control mice.

CONCLUSIONS

We reveal, for the first time to our knowledge, that the canonical Hippo-Yap pathway plays a pivotal role in maintaining SAN homeostasis.

Collapse

Affiliation(s)

Mingjie Zheng Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston (M.Z., X.Z., S.E., W.C., Jun Wang)
Rich G Li Texas Heart Institute, Houston (R.G.L., X.L.)
Jia Song Department of Medicine (Section of Cardiovascular Research), Cardiovascular Research Institute, Baylor College of Medicine, Houston, TX (J.S., N.L.)
Xiaolei Zhao Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston (M.Z., X.Z., S.E., W.C., Jun Wang)
Li Tang Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan, China (L.T., M.L., Jianxin Wang)
Shannon Erhardt Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston (M.Z., X.Z., S.E., W.C., Jun Wang) MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, The University of Texas, Houston (S.E., Jun Wang)
Wen Chen Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston (M.Z., X.Z., S.E., W.C., Jun Wang)
Bao H Nguyen Department of Molecular Physiology and Biophysics (B.H.N.)
Xiao Li Texas Heart Institute, Houston (R.G.L., X.L.)
Min Li Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan, China (L.T., M.L., Jianxin Wang)
Jianxin Wang Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan, China (L.T., M.L., Jianxin Wang)
Sylvia M Evans Skaggs School of Pharmacy and Pharmaceutical Sciences, Departments of Pharmacology and Medicine, University of California at San Diego, La Jolla (S.M.E.)
Vincent M Christoffels Medical Biology, Amsterdam Cardiovascular Sciences, Amsterdam UMC, University of Amsterdam, The Netherlands (V.M.C.)
Na Li Department of Medicine (Section of Cardiovascular Research), Cardiovascular Research Institute, Baylor College of Medicine, Houston, TX (J.S., N.L.)
Jun Wang Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston (M.Z., X.Z., S.E., W.C., Jun Wang) MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, The University of Texas, Houston (S.E., Jun Wang)

Collapse

Bae J, Choi YS, Cho G, Jang SJ. The Patient-Derived Cancer Organoids: Promises and Challenges as Platforms for Cancer Discovery. Cancers (Basel) 2022;14:cancers14092144. [PMID: 35565273 PMCID: PMC9105149 DOI: 10.3390/cancers14092144] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 04/21/2022] [Accepted: 04/22/2022] [Indexed: 02/01/2023] Open

Perera DDBD, Perera KML, Peiris DC. A Novel In Silico Benchmarked Pipeline Capable of Complete Protein Analysis: A Possible Tool for Potential Drug Discovery. BIOLOGY 2021;10:biology10111113. [PMID: 34827106 PMCID: PMC8615085 DOI: 10.3390/biology10111113] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/16/2021] [Accepted: 10/25/2021] [Indexed: 01/11/2023]

Abstract

Simple Summary

Protein interactions govern the majority of an organism’s biological processes. Therefore, to fully understand the functionality of an organism, we must know how proteins work at a molecular level. This study assembled a protocol that enables scientists to construct a protein’s tertiary structure easily and subsequently to investigate its mechanism and function. Each step involved in prediction, validation, and functional analysis of a protein is crucial to obtain an accurate result. We have dubbed this the trifecta analysis. It was clear early in our research that no single study in the literature had previously encompassed the complete trifecta analysis. In particular, studies that recommend free, open-source tools that have been benchmarked for each step are lacking. The present study ensures that predictions are accurate and validated and will greatly benefit new and experienced scientists alike in obtaining a strong understanding of the trifecta analysis, resulting in a domino effect that could lead to drug development.

Abstract

Current in silico proteomics require the trifecta analysis, namely, prediction, validation, and functional assessment of a modeled protein. The main drawback of this endeavor is the lack of a single protocol that utilizes a proper set of benchmarked open-source tools to predict a protein’s structure and function accurately. The present study rectifies this drawback through the design and development of such a protocol. The protocol begins with the characterization of a novel coding sequence to identify the expressed protein. It then recognizes and isolates evolutionarily conserved sequence motifs through phylogenetics. The next step is to predict the protein’s secondary structure, followed by the prediction, refinement, and validation of its three-dimensional tertiary structure. These steps enable the functional analysis of the macromolecule through protein docking, which facilitates the identification of the protein’s active site. Each of these steps is crucial for the complete characterization of the protein under study. We have dubbed this process the trifecta analysis. In this study, we have proven the effectiveness of our protocol using the cystatin C and AChE proteins. Beginning with just their sequences, we have characterized both proteins’ structures and functions, including identifying the cystatin C protein’s seven-residue active site and the AChE protein’s active-site gorge via protein–protein and protein–ligand docking, respectively. This process will greatly benefit new and experienced scientists alike in obtaining a strong understanding of the trifecta analysis, resulting in a domino effect that could expand drug development.

Collapse

Yang TH, Wang CY, Tsai HC, Liu CT. Human IRES Atlas: an integrative platform for studying IRES-driven translational regulation in humans. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021;2021:6263636. [PMID: 33942874 PMCID: PMC8094437 DOI: 10.1093/database/baab025] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 04/16/2021] [Accepted: 04/23/2021] [Indexed: 11/13/2022]

Abstract

It is now known that cap-independent translation initiation facilitated by internal ribosome entry sites (IRESs) is vital in selective cellular protein synthesis under stress and different physiological conditions. However, three problems make it hard to understand transcriptome-wide cellular IRES-mediated translation initiation mechanisms: (i) complex interplay between IRESs and other translation initiation–related information, (ii) reliability issue of in silico cellular IRES investigation and (iii) labor-intensive in vivo IRES identification. In this research, we constructed the Human IRES Atlas database for a comprehensive understanding of cellular IRESs in humans. First, currently available and suitable IRES prediction tools (IRESfinder, PatSearch and IRESpy) were used to obtain transcriptome-wide human IRESs. Then, we collected eight genres of translation initiation–related features to help study the potential molecular mechanisms of each of the putative IRESs. Three functional tests (conservation, structural RNA–protein scores and conditional translation efficiency) were devised to evaluate the functionality of the identified putative IRESs. Moreover, an easy-to-use interface and an IRES–translation initiation interaction map for each gene transcript were implemented to help understand the interactions between IRESs and translation initiation–related features. Researchers can easily search/browse an IRES of interest using the web interface and deduce testable mechanism hypotheses of human IRES-driven translation initiation based on the integrated results. In summary, Human IRES Atlas integrates putative IRES elements and translation initiation–related experiments for better usage of these data and deduction of mechanism hypotheses.

Database URL: http://cobishss0.im.nuk.edu.tw/Human_IRES_Atlas/

Collapse

Conserved long-range base pairings are associated with pre-mRNA processing of human genes. Nat Commun 2021;12:2300. [PMID: 33863890 PMCID: PMC8052449 DOI: 10.1038/s41467-021-22549-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 03/20/2021] [Indexed: 02/07/2023] Open

Crtc modulates fasting programs associated with 1-C metabolism and inhibition of insulin signaling. Proc Natl Acad Sci U S A 2021;118:2024865118. [PMID: 33723074 DOI: 10.1073/pnas.2024865118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Denes CE, Newsome TP, Miranda-Saksena M, Cunningham AL, Diefenbach RJ. A putative WAVE regulatory complex (WRC) interacting receptor sequence (WIRS) in the cytoplasmic tail of HSV-1 gE does not function in WRC recruitment or neuronal transport. Access Microbiol 2021;3:000206. [PMID: 34151161 PMCID: PMC8209697 DOI: 10.1099/acmi.0.000206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 02/04/2021] [Indexed: 11/18/2022] Open

Millet-Boureima C, Selber-Hnatiw S, Gamberi C. Drug discovery and chemical probing in Drosophila. Genome 2020;64:147-159. [PMID: 32551911 DOI: 10.1139/gen-2020-0037] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Huber CD, Kim BY, Lohmueller KE. Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution. PLoS Genet 2020;16:e1008827. [PMID: 32469868 PMCID: PMC7286533 DOI: 10.1371/journal.pgen.1008827] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 06/10/2020] [Accepted: 05/05/2020] [Indexed: 01/20/2023] Open

Abstract

Comparative genomic approaches have been used to identify sites where mutations are under purifying selection and of functional consequence by searching for sequences that are conserved across distantly related species. However, the performance of these approaches has not been rigorously evaluated under population genetic models. Further, short-lived functional elements may not leave a footprint of sequence conservation across many species. We use simulations to study how one measure of conservation, the Genomic Evolutionary Rate Profiling (GERP) score, relates to the strength of selection (N_es). We show that the GERP score is related to the strength of purifying selection. However, changes in selection coefficients or functional elements over time (i.e. functional turnover) can strongly affect the GERP distribution, leading to unexpected relationships between GERP and N_es. Further, we show that for functional elements that have a high turnover rate, adding more species to the analysis does not necessarily increase statistical power. Finally, we use the distribution of GERP scores across the human genome to compare models with and without turnover of sites where mutations are under purifying selection. We show that mutations in 4.51% of the noncoding human genome are under purifying selection and that most of this sequence has likely experienced changes in selection coefficients throughout mammalian evolution. Our work reveals limitations to using comparative genomic approaches to identify deleterious mutations. Commonly used GERP score thresholds miss over half of the noncoding sites in the human genome where mutations are under purifying selection.

One of the most significant and challenging tasks in modern genomics is to assess the functional consequences of a particular nucleotide change in a genome. A common approach to address this challenge prioritizes sequences that share similar nucleotides across distantly related species, with the rationale that mutations at such positions were deleterious and removed from the population by purifying natural selection. Our manuscript shows that one popular measure of sequence conservation, the GERP score, performs well at identifying selected mutations if mutations at a site were under selection across all of mammalian evolution. Changes in selection at a given site dramatically reduces the power of GERP to detect selected mutations in humans. We also combine population genetic models with the distribution of GERP scores at noncoding sites across the human genome to show that the degree of selection at individual sites has changed throughout mammalian evolution. Importantly, we demonstrate that at least 80 Mb of noncoding sequence under purifying selection in humans will not have extreme GERP scores and will likely be missed by modern comparative genomic approaches. Our work argues that new approaches, potentially based on genetic variation within species, will be required to identify deleterious mutations.

Collapse

Ramakrishnan A, Janga SC. Human protein-RNA interaction network is highly stable across mammals. BMC Genomics 2019;20:1004. [PMID: 31888461 PMCID: PMC6936122 DOI: 10.1186/s12864-019-6330-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

RNA-binding proteins (RBPs) are crucial in modulating RNA metabolism in eukaryotes thereby controlling an extensive network of RBP-RNA interactions. Although previous studies on the conservation of RBP targets have been carried out in lower eukaryotes such as yeast, relatively little is known about the extent of conservation of the binding sites of RBPs across mammalian species.

RESULTS

In this study, we employ CLIP-seq datasets for 60 human RBPs and demonstrate that most binding sites for a third of these RBPs are conserved in at least 50% of the studied vertebrate species. Across the studied RBPs, binding sites were found to exhibit a median conservation of 58%, ~ 20% higher than random genomic locations, suggesting a significantly higher preservation of RBP-RNA interaction networks across vertebrates. RBP binding sites were highly conserved across primates with weak conservation profiles in birds and fishes. We also note that phylogenetic relationship between members of an RBP family does not explain the extent of conservation of their binding sites across species. Multivariate analysis to uncover features contributing to differences in the extents of conservation of binding sites across RBPs revealed RBP expression level and number of post-transcriptional targets to be the most prominent factors. Examination of the location of binding sites at the gene level confirmed that binding sites occurring on the 3' region of a gene are highly conserved across species with 90% of the RBPs exhibiting a significantly higher conservation of binding sites in 3' regions of a gene than those occurring in the 5'. Gene set enrichment analysis on the extent of conservation of binding sites to identify significantly associated human phenotypes revealed an enrichment for multiple developmental abnormalities.

CONCLUSIONS

Our results suggest that binding sites of human RBPs are highly conserved across primates with weak conservation profiles in lower vertebrates and evolutionary relationship between members of an RBP family does not explain the extent of conservation of their binding sites. Expression level and number of targets of an RBP are important factors contributing to the differences in the extent of conservation of binding sites. RBP binding sites on 3' ends of a gene are the most conserved across species. Phenotypic analysis on the extent of conservation of binding sites revealed the importance of lineage-specific developmental events in post-transcriptional regulatory network evolution.

Collapse

Jiang Y, Wu C, Zhang Y, Zhang S, Yu S, Lei P, Lu Q, Xi Y, Wang H, Song Z. GTX.Digest.VCF: an online NGS data interpretation system based on intelligent gene ranking and large-scale text mining. BMC Med Genomics 2019;12:193. [PMID: 31856831 PMCID: PMC6923899 DOI: 10.1186/s12920-019-0637-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 11/26/2019] [Indexed: 02/07/2023] Open

Choi H, Joe S, Nam H. Development of Tissue-Specific Age Predictors Using DNA Methylation Data. Genes (Basel) 2019;10:genes10110888. [PMID: 31690030 PMCID: PMC6896025 DOI: 10.3390/genes10110888] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 11/01/2019] [Accepted: 11/01/2019] [Indexed: 12/17/2022] Open

Lenzini L, Di Patti F, Livi R, Fondi M, Fani R, Mengoni A. A Method for the Structure-Based, Genome-Wide Analysis of Bacterial Intergenic Sequences Identifies Shared Compositional and Functional Features. Genes (Basel) 2019;10:genes10100834. [PMID: 31652625 PMCID: PMC6826451 DOI: 10.3390/genes10100834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 10/07/2019] [Accepted: 10/16/2019] [Indexed: 11/16/2022] Open

Hoff K, Lemme M, Kahlert AK, Runde K, Audain E, Schuster D, Scheewe J, Attmann T, Pickardt T, Caliebe A, Siebert R, Kramer HH, Milting H, Hansen A, Ammerpohl O, Hitz MP. DNA methylation profiling allows for characterization of atrial and ventricular cardiac tissues and hiPSC-CMs. Clin Epigenetics 2019;11:89. [PMID: 31186048 PMCID: PMC6560887 DOI: 10.1186/s13148-019-0679-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Accepted: 05/03/2019] [Indexed: 02/07/2023] Open

Abstract

Background

Cardiac disease modelling using human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CM) requires thorough insight into cardiac cell type differentiation processes. However, current methods to discriminate different cardiac cell types are mostly time-consuming, are costly and often provide imprecise phenotypic evaluation. DNA methylation plays a critical role during early heart development and cardiac cellular specification. We therefore investigated the DNA methylation pattern in different cardiac tissues to identify CpG loci for further cardiac cell type characterization.

Results

An array-based genome-wide DNA methylation analysis using Illumina Infinium HumanMethylation450 BeadChips led to the identification of 168 differentially methylated CpG loci in atrial and ventricular human heart tissue samples (n = 49) from different patients with congenital heart defects (CHD). Systematic evaluation of atrial-ventricular DNA methylation pattern in cardiac tissues in an independent sample cohort of non-failing donor hearts and cardiac patients using bisulfite pyrosequencing helped us to define a subset of 16 differentially methylated CpG loci enabling precise characterization of human atrial and ventricular cardiac tissue samples. This defined set of reproducible cardiac tissue-specific DNA methylation sites allowed us to consistently detect the cellular identity of hiPSC-CM subtypes.

Conclusion

Testing DNA methylation of only a small set of defined CpG sites thus makes it possible to distinguish atrial and ventricular cardiac tissues and cardiac atrial and ventricular subtypes of hiPSC-CMs. This method represents a rapid and reliable system for phenotypic characterization of in vitro-generated cardiomyocytes and opens new opportunities for cardiovascular research and patient-specific therapy.

Electronic supplementary material

The online version of this article (10.1186/s13148-019-0679-0) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Kirstin Hoff Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
Marta Lemme DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany.,Department of Experimental Pharmacology and Toxicology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Anne-Karin Kahlert Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany.,Institute for Clinical Genetics, Carl Gustav Carus Faculty of Medicine, Dresden, Germany
Kerstin Runde Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Enrique Audain Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
Dorit Schuster Institute of Human Genetics, Christian-Albrechts-University Kiel & University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Jens Scheewe Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Tim Attmann Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Thomas Pickardt National Register for Congenital Heart Defects, DZHK (German Centre for Cardiovascular Research), Berlin, Germany.,Competence Network for Congenital Heart Defects, DZHK (German Centre for Cardiovascular Research), Berlin, Germany
Almuth Caliebe Institute of Human Genetics, Christian-Albrechts-University Kiel & University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Reiner Siebert Institute of Human Genetics, University Hospital Ulm, Ulm, Germany
Hans-Heiner Kramer Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
Hendrik Milting Erich and Hanna Klessmann Institute for Cardiovascular Research & Development (EHKI), Heart and Diabetes Center NRW, Ruhr University Bochum, Bad Oeynhausen, Germany
Arne Hansen DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany.,Department of Experimental Pharmacology and Toxicology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Ole Ammerpohl Institute of Human Genetics, University Hospital Ulm, Ulm, Germany
Marc-Phillip Hitz Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany. .,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany. .,Institute of Human Genetics, Christian-Albrechts-University Kiel & University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany. .,Wellcome Trust Sanger Institute, Cambridge, UK.

Collapse

Chen Z, Omori Y, Koren S, Shirokiya T, Kuroda T, Miyamoto A, Wada H, Fujiyama A, Toyoda A, Zhang S, Wolfsberg TG, Kawakami K, Phillippy AM, NISC Comparative Sequencing Program, Mullikin JC, Burgess SM. De novo assembly of the goldfish (Carassius auratus) genome and the evolution of genes after whole-genome duplication. SCIENCE ADVANCES 2019;5:eaav0547. [PMID: 31249862 PMCID: PMC6594761 DOI: 10.1126/sciadv.aav0547] [Citation(s) in RCA: 123] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 05/21/2019] [Indexed: 05/20/2023]

Affiliation(s)

Zelin Chen Translational and Functional Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Yoshihiro Omori Laboratory for Molecular and Developmental Biology, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
Sergey Koren Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Takuya Shirokiya Yatomi Station, Aichi Fisheries Research Institute, Yatomi, Aichi, Japan
Takuo Kuroda Yatomi Station, Aichi Fisheries Research Institute, Yatomi, Aichi, Japan
Atsushi Miyamoto Yatomi Station, Aichi Fisheries Research Institute, Yatomi, Aichi, Japan
Hironori Wada Laboratory of Molecular and Developmental Biology, National Institute of Genetics, and Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Shizuoka, Japan
Asao Fujiyama Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, Japan
Atsushi Toyoda Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, Japan Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka, Japan
Suiyuan Zhang Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Tyra G. Wolfsberg Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Koichi Kawakami Laboratory of Molecular and Developmental Biology, National Institute of Genetics, and Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Shizuoka, Japan
Adam M. Phillippy Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
NISC Comparative Sequencing Program NIH Intramural Sequencing Center, National Human Genome Research Institute, Bethesda, MD, USA
James C. Mullikin NIH Intramural Sequencing Center, National Human Genome Research Institute, Bethesda, MD, USA Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Shawn M. Burgess Translational and Functional Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA Corresponding author.

Collapse

Savel D, Koyutürk M. Characterizing human genomic coevolution in locus-gene regulatory interactions. BioData Min 2019;12:8. [PMID: 30923571 PMCID: PMC6419833 DOI: 10.1186/s13040-019-0195-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Accepted: 02/19/2019] [Indexed: 11/10/2022] Open

You Z, Zhang Q, Liu C, Song J, Yang N, Lian L. Integrated analysis of lncRNA and mRNA repertoires in Marek's disease infected spleens identifies genes relevant to resistance. BMC Genomics 2019;20:245. [PMID: 30922224 PMCID: PMC6438004 DOI: 10.1186/s12864-019-5625-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2018] [Accepted: 03/20/2019] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND

Marek's disease virus (MDV) is an oncogenic herpesvirus that can cause T-cell lymphomas in chicken. Long noncoding RNA (lncRNA) is strongly associated with various cancers and many other diseases. In chickens, lncRNAs have not been comprehensively identified. Here, we profiled mRNA and lncRNA repertoires in three groups of spleens from MDV-infected and non-infected chickens, including seven tumorous spleens (TS) from MDV-infected chickens, five spleens from the survivors (SS) without lesions after MDV infection, and five spleens from noninfected chickens (NS), to explore the underlying mechanism of host resistance in Marek's disease (MD).

RESULTS

By using a precise lncRNA identification pipeline, we identified 1315 putative lncRNAs and 1166 known lncRNAs in spleen tissue. Genomic features of putative lncRNAs were characterized. Differentially expressed (DE) mRNAs, putative lncRNAs, and known lncRNAs were profiled among three groups. We found that several specific intergroup differentially expressed genes were involved in important biological processes and pathways, including B cell activation and the Wnt signaling pathway; some of these genes were also found to be the hub genes in the co-expression network analyzed by WGCNA. Network analysis depicted both intergenic correlation and correlation between genes and MD traits. Five DE lncRNAs including MSTRG.360.1, MSTRG.6725.1, MSTRG.6754.1, MSTRG.15539.1, and MSTRG.7747.5 strongly correlated with MD-resistant candidate genes, such as IGF-I, CTLA4, HDAC9, SWAP70, CD72, JCHAIN, CXCL12, and CD8B, suggesting that lncRNAs may affect MD resistance and tumorigenesis in chicken spleens through their target genes.

CONCLUSIONS

Our results provide both transcriptomic and epigenetic insights on MD resistance and its pathological mechanism. The comprehensive lncRNA and mRNA transcriptomes in MDV-infected chicken spleens were profiled. Co-expression analysis identified integrated lncRNA-mRNA and gene-gene interaction networks, implying that hub genes or lncRNAs exert critical influence on MD resistance and tumorigenesis.

Collapse

Dewey CN. Whole-Genome Alignment. Methods Mol Biol 2019;1910:121-147. [PMID: 31278663 DOI: 10.1007/978-1-4939-9074-0_4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Hofstetter S, Seefried F, Häfliger IM, Jagannathan V, Leeb T, Drögemüller C. A non-coding regulatory variant in the 5'-region of the MITF gene is associated with white-spotted coat in Brown Swiss cattle. Anim Genet 2018;50:27-32. [PMID: 30506810 DOI: 10.1111/age.12751] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/19/2018] [Indexed: 01/29/2023]

Song H, Lin K, Hu J, Pang E. An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome. FRONTIERS IN PLANT SCIENCE 2018;9:325. [PMID: 29599790 PMCID: PMC5863696 DOI: 10.3389/fpls.2018.00325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 02/27/2018] [Indexed: 06/08/2023]

Abstract

Background: Although the cucumber reference genome and its annotation were published several years ago, the functional annotation of predicted genes, particularly protein-coding genes, still requires further improvement. In general, accurately determining orthologous relationships between genes allows for better and more robust functional assignments of predicted genes. As one of the most reliable strategies, the determination of collinearity information may facilitate reliable orthology inferences among genes from multiple related genomes. Currently, the identification of collinear segments has mainly been based on conservation of gene order and orientation. Over the course of plant genome evolution, various evolutionary events have disrupted or distorted the order of genes along chromosomes, making it difficult to use those genes as genome-wide markers for plant genome comparisons. Results: Using the localized LASTZ/MULTIZ analysis pipeline, we aligned 15 genomes, including cucumber and other related angiosperm plants, and identified a set of genomic segments that are short in length, stable in structure, uniform in distribution and highly conserved across all 15 plants. Compared with protein-coding genes, these conserved segments were more suitable for use as genomic markers for detecting collinear segments among distantly divergent plants. Guided by this set of identified collinear genomic segments, we inferred 94,486 orthologous protein-coding gene pairs (OPPs) between cucumber and 14 other angiosperm species, which were used as proxies for transferring functional terms to cucumber genes from the annotations of the other 14 genomes. In total, 10,885 protein-coding genes were assigned Gene Ontology (GO) terms which was nearly 1,300 more than results collected in Uniprot-proteomic database. Our results showed that annotation accuracy would been improved compared with other existing approaches. Conclusions: In this study, we provided an alternative resource for the functional annotation of predicted cucumber protein-coding genes, which we expect will be beneficial for the cucumber's biological study, accessible from http://cmb.bnu.edu.cn/functional_annotation. Meanwhile, using the cucumber reference genome as a case study, we presented an efficient strategy for transferring gene functional information from previously well-characterized protein-coding genes in model species to newly sequenced or "non-model" plant species.

Collapse

Chen CK, Yu CP, Li SC, Wu SM, Lu MYJ, Chen YH, Chen DR, Ng CS, Ting CT, Li WH. Identification and evolutionary analysis of long non-coding RNAs in zebra finch. BMC Genomics 2017;18:117. [PMID: 28143393 PMCID: PMC5282891 DOI: 10.1186/s12864-017-3506-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Accepted: 01/14/2017] [Indexed: 02/06/2023] Open

Abstract

Background

Long non-coding RNAs (lncRNAs) are important in various biological processes, but very few studies on lncRNA have been conducted in birds. To identify IncRNAs expressed during feather development, we analyzed single-stranded RNA-seq (ssRNA-seq) data from the anterior and posterior dorsal regions during zebra finch (Taeniopygia guttata) embryonic development. Using published transcriptomic data, we further analyzed the evolutionary conservation of IncRNAs in birds and amniotes.

Results

A total of 1,081 lncRNAs, including 965 intergenic lncRNAs (lincRNAs), 59 intronic lncRNAs, and 57 antisense lncRNAs (lncNATs), were identified using our newly developed pipeline. These avian IncRNAs share similar characteristics with lncRNAs in mammals, such as shorter transcript length, lower exon number, lower average expression level and less sequence conservation than mRNAs. However, the proportion of lncRNAs overlapping with transposable elements in birds is much lower than that in mammals. We predicted the functions of IncRNAs based on the enriched functions of co-expressed protein-coding genes. Clusters of lncRNAs associated with natal down development were identified. The sequences and expression levels of candidate lncRNAs that shared conserved sequences among birds were validated by qPCR in both zebra finch and chicken. Finally, we identified three highly conserved lncRNAs that may be associated with natal down development.

Conclusions

Our study provides the first systematical identification of avian lncRNAs using ssRNA-seq analysis and offers a resource of embryonically expressed lncRNAs in zebra finch. We also predicted the biological function of identified lncRNAs.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-017-3506-z) contains supplementary material, which is available to authorized users.

Collapse

Clustering and evolutionary analysis of small RNAs identify regulatory siRNA clusters induced under drought stress in rice. BMC SYSTEMS BIOLOGY 2016;10:115. [PMID: 28155667 PMCID: PMC5260113 DOI: 10.1186/s12918-016-0355-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Abstract

Motivation

Drought tolerance is an important trait related to growth and yield in crop. Until now, drought related research has focused on coding genes. However, non-coding RNAs also respond significantly to environmental stimuli such as drought stress. Unfortunately, characterizing the role of siRNAs under drought stress is difficult since a large number of heterogenous siRNA species are expressed under drought stress and non-coding RNAs have very weak evolutionary conservation. Thus, to characterize the role of siRNAs, we need a well designed biological and bioinformatics strategy. In this paper, to characterize the function of siRNAs we developed and used a bioinformatics pipeline that includes a genomic-location based clustering technique and an evolutionary conservation tool.

Results

By comparing the wild type Nipponbare and two drought resistant rice varities, we found that 21 nt and 24 nt siRNAs are significantly expressed in the three rice plants but at different time points under a short-term (0, 1, and 6 hrs) drought treatment. siRNAs were up-regulated in the wild type at an early stage while the up-regulation was delayed in the two drought tolerant plants. Genes targeted by up-regulated siRNAs were related to oxidation reduction and proteolysis, which are well known to be associated with water deficit phenotypes. More interestingly, we found that siRNAs were located in intronic regions as clusters and were of high evolutionary conservation among monocot grass plants. In summary, we show that siRNAs are important respondents to drought stress and regulate genes related to the drought tolerance in water deficit conditions.

Electronic supplementary material

The online version of this article (doi:10.1186/s12918-016-0355-3) contains supplementary material, which is available to authorized users.

Collapse

Hoffmann TJ, Keats BJ, Yoshikawa N, Schaefer C, Risch N, Lustig LR. A Large Genome-Wide Association Study of Age-Related Hearing Impairment Using Electronic Health Records. PLoS Genet 2016;12:e1006371. [PMID: 27764096 PMCID: PMC5072625 DOI: 10.1371/journal.pgen.1006371] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 09/16/2016] [Indexed: 01/22/2023] Open

Abstract

Age-related hearing impairment (ARHI), one of the most common sensory disorders, can be mitigated, but not cured or eliminated. To identify genetic influences underlying ARHI, we conducted a genome-wide association study of ARHI in 6,527 cases and 45,882 controls among the non-Hispanic whites from the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. We identified two novel genome-wide significant SNPs: rs4932196 (odds ratio = 1.185, p = 4.0x10^-11), 52Kb 3’ of ISG20, which replicated in a meta-analysis of the other GERA race/ethnicity groups (1,025 cases, 12,388 controls, p = 0.00094) and in a UK Biobank case-control analysis (30,802 self-reported cases, 78,586 controls, p = 0.015); and rs58389158 (odds ratio = 1.132, p = 1.8x10^-9), which replicated in the UK Biobank (p = 0.00021). The latter SNP lies just outside exon 8 and is highly correlated (r² = 0.96) with the missense SNP rs5756795 in exon 7 of TRIOBP, a gene previously associated with prelingual nonsyndromic hearing loss. We further tested these SNPs in phenotypes from audiologist notes available on a subset of GERA (4,903 individuals), stratified by case/control status, to construct an independent replication test, and found a significant effect of rs58389158 on speech reception threshold (SRT; overall GERA meta-analysis p = 1.9x10^-6). We also tested variants within exons of 132 other previously-identified hearing loss genes, and identified two common additional significant SNPs: rs2877561 (synonymous change in ILDR1, p = 6.2x10^-5), which replicated in the UK Biobank (p = 0.00057), and had a significant GERA SRT (p = 0.00019) and speech discrimination score (SDS; p = 0.0019); and rs9493627 (missense change in EYA4, p = 0.00011) which replicated in the UK Biobank (p = 0.0095), other GERA groups (p = 0.0080), and had a consistent significant result for SRT (p = 0.041) and suggestive result for SDS (p = 0.081). Large cohorts with GWAS data and electronic health records may be a useful method to characterize the genetic architecture of ARHI.

Age-related hearing impairment (ARHI) is one of the most common sensory disorders. While ARHI effects can be mitigated with current technologies, it cannot be cured or eliminated. It is thus hoped that identification of genetic influences on ARHI may one day lead to curative therapies. Towards this goal, the current study utilized electronic health record data from non-Hispanic whites in the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort to conduct a genome-wide association study of ARHI, and tested the significant variants for replication in other GERA race/ethnicity groups, independent GERA phenotypes, and self-reported ARHI from the UK Biobank. We discovered two genome-wide significant SNPs. The first was novel and near ISG20. The second was in TRIOBP, a gene previously associated with prelingual nonsyndromic hearing loss. Motivated by our TRIOBP results, we also looked at exons in known hearing loss genes, and identified two additional SNPs, rs2877561 in ILDR1 and rs9493672 in EYA4 (at a significance threshold adjusted for number of SNPs in those regions). These results suggest that large cohorts with GWAS data and electronic health records may be a useful method to characterize the genetic architecture of ARHI.

Collapse

Binet M, Gascuel O, Scornavacca C, Douzery EJP, Pardi F. Fast and accurate branch lengths estimation for phylogenomic trees. BMC Bioinformatics 2016;17:23. [PMID: 26744021 PMCID: PMC4705742 DOI: 10.1186/s12859-015-0821-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Accepted: 11/02/2015] [Indexed: 01/26/2023] Open

Abstract

Background

Branch lengths are an important attribute of phylogenetic trees, providing essential information for many studies in evolutionary biology. Yet, part of the current methodology to reconstruct a phylogeny from genomic information — namely supertree methods — focuses on the topology or structure of the phylogenetic tree, rather than the evolutionary divergences associated to it. Moreover, accurate methods to estimate branch lengths — typically based on probabilistic analysis of a concatenated alignment — are limited by large demands in memory and computing time, and may become impractical when the data sets are too large.

Results

Here, we present a novel phylogenomic distance-based method, named ERaBLE (Evolutionary Rates and Branch Length Estimation), to estimate the branch lengths of a given reference topology, and the relative evolutionary rates of the genes employed in the analysis. ERaBLE uses as input data a potentially very large collection of distance matrices, where each matrix is obtained from a different genomic region — either directly from its sequence alignment, or indirectly from a gene tree inferred from the alignment. Our experiments show that ERaBLE is very fast and fairly accurate when compared to other possible approaches for the same tasks. Specifically, it efficiently and accurately deals with large data sets, such as the OrthoMaM v8 database, composed of 6,953 exons from up to 40 mammals.

Conclusions

ERaBLE may be used as a complement to supertree methods — or it may provide an efficient alternative to maximum likelihood analysis of concatenated alignments — to estimate branch lengths from phylogenomic data sets.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0821-8) contains supplementary material, which is available to authorized users.

Collapse

Bahamonde MI, Serra SA, Drechsel O, Rahman R, Marcé-Grau A, Prieto M, Ossowski S, Macaya A, Fernández-Fernández JM. A Single Amino Acid Deletion (ΔF1502) in the S6 Segment of CaV2.1 Domain III Associated with Congenital Ataxia Increases Channel Activity and Promotes Ca2+ Influx. PLoS One 2015;10:e0146035. [PMID: 26716990 PMCID: PMC4696675 DOI: 10.1371/journal.pone.0146035] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2015] [Accepted: 12/11/2015] [Indexed: 02/07/2023] Open

Abstract

Mutations in the CACNA1A gene, encoding the pore-forming Ca_V2.1 (P/Q-type) channel α_1A subunit, result in heterogeneous human neurological disorders, including familial and sporadic hemiplegic migraine along with episodic and progressive forms of ataxia. Hemiplegic Migraine (HM) mutations induce gain-of-channel function, mainly by shifting channel activation to lower voltages, whereas ataxia mutations mostly produce loss-of-channel function. However, some HM-linked gain-of-function mutations are also associated to congenital ataxia and/or cerebellar atrophy, including the deletion of a highly conserved phenylalanine located at the S6 pore region of α_1A domain III (ΔF1502). Functional studies of ΔF1502 Ca_V2.1 channels, expressed in Xenopus oocytes, using the non-physiological Ba²⁺ as the charge carrier have only revealed discrete alterations in channel function of unclear pathophysiological relevance. Here, we report a second case of congenital ataxia linked to the ΔF1502 α_1A mutation, detected by whole-exome sequencing, and analyze its functional consequences on Ca_V2.1 human channels heterologously expressed in mammalian tsA-201 HEK cells, using the physiological permeant ion Ca²⁺. ΔF1502 strongly decreases the voltage threshold for channel activation (by ~ 21 mV), allowing significantly higher Ca²⁺ current densities in a range of depolarized voltages with physiological relevance in neurons, even though maximal Ca²⁺ current density through ΔF1502 Ca_V2.1 channels is 60% lower than through wild-type channels. ΔF1502 accelerates activation kinetics and slows deactivation kinetics of Ca_V2.1 within a wide range of voltage depolarization. ΔF1502 also slowed Ca_V2.1 inactivation kinetic and shifted the inactivation curve to hyperpolarized potentials (by ~ 28 mV). ΔF1502 effects on Ca_V2.1 activation and deactivation properties seem to be of high physiological relevance. Thus, ΔF1502 strongly promotes Ca²⁺ influx in response to either single or trains of action potential-like waveforms of different durations. Our observations support a causative role of gain-of-function Ca_V2.1 mutations in congenital ataxia, a neurodevelopmental disorder at the severe-most end of CACNA1A-associated phenotypic spectrum.

Collapse

Dillman AR, Macchietto M, Porter CF, Rogers A, Williams B, Antoshechkin I, Lee MM, Goodwin Z, Lu X, Lewis EE, Goodrich-Blair H, Stock SP, Adams BJ, Sternberg PW, Mortazavi A. Comparative genomics of Steinernema reveals deeply conserved gene regulatory networks. Genome Biol 2015;16:200. [PMID: 26392177 PMCID: PMC4578762 DOI: 10.1186/s13059-015-0746-6] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Accepted: 08/10/2015] [Indexed: 12/21/2022] Open

Thompson D, Regev A, Roy S. Comparative analysis of gene regulatory networks: from network reconstruction to evolution. Annu Rev Cell Dev Biol 2015;31:399-428. [PMID: 26355593 DOI: 10.1146/annurev-cellbio-100913-012908] [Citation(s) in RCA: 95] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Trends in genome dynamics among major orders of insects revealed through variations in protein families. BMC Genomics 2015;16:583. [PMID: 26251035 PMCID: PMC4528696 DOI: 10.1186/s12864-015-1771-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 07/13/2015] [Indexed: 01/22/2023] Open

Abstract

Background

Insects belong to a class that accounts for the majority of animals on earth. With over one million identified species, insects display a huge diversity and occupy extreme environments. At present, there are dozens of fully sequenced insect genomes that cover a range of habitats, social behavior and morphologies. In view of such diverse collection of genomes, revealing evolutionary trends and charting functional relationships of proteins remain challenging.

Results

We analyzed the relatedness of 17 complete proteomes representative of proteomes from insects including louse, bee, beetle, ants, flies and mosquitoes, as well as an out-group from the crustaceans. The analyzed proteomes mostly represented the orders of Hymenoptera and Diptera. The 287,405 protein sequences from the 18 proteomes were automatically clustered into 20,933 families, including 799 singletons. A comprehensive analysis based on statistical considerations identified the families that were significantly expanded or reduced in any of the studied organisms. Among all the tested species, ants are characterized by an exceptionally high rate of family gain and loss. By assigning annotations to hundreds of species-specific families, the functional diversity among species and between the major clades (Diptera and Hymenoptera) is revealed. We found that many species-specific families are associated with receptor signaling, stress-related functions and proteases. The highest variability among insects associates with the function of transposition and nucleic acids processes (collectively coined TNAP). Specifically, the wasp and ants have an order of magnitude more TNAP families and proteins relative to species that belong to Diptera (mosquitoes and flies).

Conclusions

An unsupervised clustering methodology combined with a comparative functional analysis unveiled proteomic signatures in the major clades of winged insects. We propose that the expansion of TNAP families in Hymenoptera potentially contributes to the accelerated genome dynamics that characterize the wasp and ants.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1771-2) contains supplementary material, which is available to authorized users.

Collapse

Chatagnon A, Veber P, Morin V, Bedo J, Triqueneaux G, Sémon M, Laudet V, d'Alché-Buc F, Benoit G. RAR/RXR binding dynamics distinguish pluripotency from differentiation associated cis-regulatory elements. Nucleic Acids Res 2015;43:4833-54. [PMID: 25897113 PMCID: PMC4446430 DOI: 10.1093/nar/gkv370] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Revised: 03/09/2015] [Accepted: 04/08/2015] [Indexed: 12/15/2022] Open

Tetreault M, Bareke E, Nadaf J, Alirezaie N, Majewski J. Whole-exome sequencing as a diagnostic tool: current challenges and future opportunities. Expert Rev Mol Diagn 2015;15:749-60. [PMID: 25959410 DOI: 10.1586/14737159.2015.1039516] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Naval-Sánchez M, Potier D, Hulselmans G, Christiaens V, Aerts S. Identification of Lineage-Specific Cis-Regulatory Modules Associated with Variation in Transcription Factor Binding and Chromatin Activity Using Ornstein-Uhlenbeck Models. Mol Biol Evol 2015;32:2441-55. [PMID: 25944915 PMCID: PMC4540964 DOI: 10.1093/molbev/msv107] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Gulko B, Hubisz MJ, Gronau I, Siepel A. A method for calculating probabilities of fitness consequences for point mutations across the human genome. Nat Genet 2015;47:276-83. [PMID: 25599402 PMCID: PMC4342276 DOI: 10.1038/ng.3196] [Citation(s) in RCA: 182] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 12/19/2014] [Indexed: 12/17/2022]

Taher L, Narlikar L, Ovcharenko I. Identification and computational analysis of gene regulatory elements. Cold Spring Harb Protoc 2015;2015:pdb.top083642. [PMID: 25561628 PMCID: PMC5885252 DOI: 10.1101/pdb.top083642] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Modolo L, Picard F, Lerat E. A new genome-wide method to track horizontally transferred sequences: application to Drosophila. Genome Biol Evol 2015;6:416-32. [PMID: 24497602 PMCID: PMC3942030 DOI: 10.1093/gbe/evu026] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Lim E, Liu Y, Chan Y, Tiinamaija T, Käräjämäki A, Madsen E, Altshuler D, Raychaudhuri S, Groop L, Flannick J, Hirschhorn J, Katsanis N, Daly M, Daly MJ. A novel test for recessive contributions to complex diseases implicates Bardet-Biedl syndrome gene BBS10 in idiopathic type 2 diabetes and obesity. Am J Hum Genet 2014;95:509-20. [PMID: 25439097 DOI: 10.1016/j.ajhg.2014.09.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 09/22/2014] [Indexed: 12/22/2022] Open

Yokoyama KD, Zhang Y, Ma J. Tracing the evolution of lineage-specific transcription factor binding sites in a birth-death framework. PLoS Comput Biol 2014;10:e1003771. [PMID: 25144359 PMCID: PMC4140645 DOI: 10.1371/journal.pcbi.1003771] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Accepted: 06/27/2014] [Indexed: 11/24/2022] Open

Abstract

Changes in cis-regulatory element composition that result in novel patterns of gene expression are thought to be a major contributor to the evolution of lineage-specific traits. Although transcription factor binding events show substantial variation across species, most computational approaches to study regulatory elements focus primarily upon highly conserved sites, and rely heavily upon multiple sequence alignments. However, sequence conservation based approaches have limited ability to detect lineage-specific elements that could contribute to species-specific traits. In this paper, we describe a novel framework that utilizes a birth-death model to trace the evolution of lineage-specific binding sites without relying on detailed base-by-base cross-species alignments. Our model was applied to analyze the evolution of binding sites based on the ChIP-seq data for six transcription factors (GATA1, SOX2, CTCF, MYC, MAX, ETS1) along the lineage toward human after human-mouse common ancestor. We estimate that a substantial fraction of binding sites (∼58–79% for each factor) in humans have origins since the divergence with mouse. Over 15% of all binding sites are unique to hominids. Such elements are often enriched near genes associated with specific pathways, and harbor more common SNPs than older binding sites in the human genome. These results support the ability of our method to identify lineage-specific regulatory elements and help understand their roles in shaping variation in gene regulation across species.

Recent experimental studies showed that the evolution of transcription factor binding sites (TFBS) is highly dynamic, with sites differing a great deal even between closely related mammalian species. Despite the substantial experimental evidence for rapid divergence of regulatory protein-binding events across species, computational methods designed to analyze regulatory elements evolution have focused primarily on phylogenetic footprinting approaches, in which putative functional regulatory elements are identified according to strong sequence conservation. Cross-species comparisons of non-coding sequences are limited in their ability to fully understand the evolution of regulatory sequences, particularly in cases where the elements are selected for novelty or species-specific. We have developed a novel framework to reconstruct the history of lineage-specific TFBS and showed that large amount of TFBS in human were born after human-mouse divergence. These elements also have distinct biological implications as compared to more ancient ones. This method can help understand the roles of lineage-specific TFBS in shaping gene regulation across different species.

Collapse

Macossay-Castillo M, Kosol S, Tompa P, Pancsa R. Synonymous constraint elements show a tendency to encode intrinsically disordered protein segments. PLoS Comput Biol 2014;10:e1003607. [PMID: 24809503 PMCID: PMC4014394 DOI: 10.1371/journal.pcbi.1003607] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 03/17/2014] [Indexed: 01/22/2023] Open

Genome-wide analysis of promoters: clustering by alignment and analysis of regular patterns. PLoS One 2014;9:e85260. [PMID: 24465517 PMCID: PMC3898993 DOI: 10.1371/journal.pone.0085260] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 11/26/2013] [Indexed: 01/08/2023] Open

Eren AM, Maignien L, Sul WJ, Murphy LG, Grim SL, Morrison HG, Sogin ML. Oligotyping: Differentiating between closely related microbial taxa using 16S rRNA gene data. Methods Ecol Evol 2013;4. [PMID: 24358444 PMCID: PMC3864673 DOI: 10.1111/2041-210x.12114] [Citation(s) in RCA: 444] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Haudry A, Platts AE, Vello E, Hoen DR, Leclercq M, Williamson RJ, Forczek E, Joly-Lopez Z, Steffen JG, Hazzouri KM, Dewar K, Stinchcombe JR, Schoen DJ, Wang X, Schmutz J, Town CD, Edger PP, Pires JC, Schumaker KS, Jarvis DE, Mandáková T, Lysak MA, van den Bergh E, Schranz ME, Harrison PM, Moses AM, Bureau TE, Wright SI, Blanchette M. An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat Genet 2013;45:891-8. [PMID: 23817568 DOI: 10.1038/ng.2684] [Citation(s) in RCA: 227] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2012] [Accepted: 06/04/2013] [Indexed: 12/17/2022]

Effect of genetic regions on the correlation between single point mutation variability and morbidity. Comput Biol Med 2013;43:594-9. [DOI: 10.1016/j.compbiomed.2013.01.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2011] [Revised: 07/27/2012] [Accepted: 01/19/2013] [Indexed: 11/19/2022]

Oct4 switches partnering from Sox2 to Sox17 to reinterpret the enhancer code and specify endoderm. EMBO J 2013;32:938-53. [PMID: 23474895 DOI: 10.1038/emboj.2013.31] [Citation(s) in RCA: 146] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 01/24/2013] [Indexed: 01/04/2023] Open