1
|
Jin J, Zhan Z, Wei X, Pan Z, Zhao Y, Yu D, Zhang F. Genomic insights into the chromosomal elongation in a family of Collembola. Proc Biol Sci 2024; 291:20232937. [PMID: 38471545 DOI: 10.1098/rspb.2023.2937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 02/12/2024] [Indexed: 03/14/2024] Open
Abstract
Collembola is a highly diverse and abundant group of soil arthropods with chromosome numbers ranging from 5 to 11. Previous karyotype studies indicated that the Tomoceridae family possesses an exceptionally long chromosome. To better understand chromosome size evolution in Collembola, we obtained a chromosome-level genome of Yoshiicerus persimilis with a size of 334.44 Mb and BUSCO completeness of 97.0% (n = 1013). Both genomes of Y. persimilis and Tomocerus qinae (recently published) have an exceptionally large chromosome (ElChr greater than 100 Mb), accounting for nearly one-third of the genome. Comparative genomic analyses suggest that chromosomal elongation occurred independently in the two species approximately 10 million years ago, rather than in the ancestor of the Tomoceridae family. The ElChr elongation was caused by large tandem and segmental duplications, as well as transposon proliferation, with genes in these regions experiencing weaker purifying selection (higher dN/dS) than conserved regions. Moreover, inter-genomic synteny analyses indicated that chromosomal fission/fusion events played a crucial role in the evolution of chromosome numbers (ranging from 5 to 7) within Entomobryomorpha. This study provides a valuable resource for investigating the chromosome evolution of Collembola.
Collapse
Affiliation(s)
- Jianfeng Jin
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| | - Zhihong Zhan
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| | - Xiping Wei
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| | - Zhixiang Pan
- School of Life Sciences, Taizhou University, Taizhou 318000, People's Republic of China
| | - Yuxin Zhao
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| | - Daoyuan Yu
- Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| | - Feng Zhang
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, People's Republic of China
| |
Collapse
|
2
|
Xu Y, Zhang Q, Wang Y, Zhou R, Ji X, Meng L, Luo C, Liu A, Jiao J, Chen H, Zeng H, Hu P, Xu Z. Optical Genome Mapping for Chromosomal Aberrations Detection-False-Negative Results and Contributing Factors. Diagnostics (Basel) 2024; 14:165. [PMID: 38248042 PMCID: PMC10814618 DOI: 10.3390/diagnostics14020165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/09/2024] [Accepted: 01/09/2024] [Indexed: 01/23/2024] Open
Abstract
Optical genome mapping (OGM) has been known as an all-in-one technology for chromosomal aberration detection. However, there are also aberrations beyond the detection range of OGM. This study aimed to report the aberrations missed by OGM and analyze the contributing factors. OGM was performed by taking both GRCh37 and GRCh38 as reference genomes. The OGM results were analyzed in blinded fashion and compared to standard assays. Quality control (QC) metrics, sample types, reference genome, effective coverage and classes and locations of aberrations were then analyzed. In total, 154 clinically reported variations from 123 samples were investigated. OGM failed to detect 10 (6.5%, 10/154) aberrations with GRCh37 assembly, including five copy number variations (CNVs), two submicroscopic balanced translocations, two pericentric inversion and one isochromosome (mosaicism). All the samples passed pre-analytical and analytical QC. With GRCh38 assembly, the false-negative rate of OGM fell to 4.5% (7/154). The breakpoints of the CNVs, balanced translocations and inversions undetected by OGM were located in segmental duplication (SD) regions or regions with no DLE-1 label. In conclusion, besides variations with centromeric breakpoints, structural variations (SVs) with breakpoints located in large repetitive sequences may also be missed by OGM. GRCh38 is recommended as the reference genome when OGM is performed. Our results highlight the necessity of fully understanding the detection range and limitation of OGM in clinical practice.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Ping Hu
- Department of Prenatal Diagnosis, Women’s Hospital of Nanjing Medical University, Nanjing Women and Children’s Health Care Hospital, Nanjing 210004, China; (Y.X.); (Q.Z.); (Y.W.); (R.Z.); (X.J.); (L.M.); (C.L.); (A.L.); (J.J.); (H.C.); (H.Z.)
| | - Zhengfeng Xu
- Department of Prenatal Diagnosis, Women’s Hospital of Nanjing Medical University, Nanjing Women and Children’s Health Care Hospital, Nanjing 210004, China; (Y.X.); (Q.Z.); (Y.W.); (R.Z.); (X.J.); (L.M.); (C.L.); (A.L.); (J.J.); (H.C.); (H.Z.)
| |
Collapse
|
3
|
Abubakar AS, Wu Y, Chen F, Zhu A, Chen P, Chen K, Qiu X, Huang X, Zhao H, Chen J, Gao G. Comprehensive Analysis of WUSCEL-Related Homeobox Gene Family in Ramie ( Boehmeria nivea) Indicates Its Potential Role in Adventitious Root Development. Biology (Basel) 2023; 12:1475. [PMID: 38132301 PMCID: PMC10740585 DOI: 10.3390/biology12121475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/10/2023] [Accepted: 11/15/2023] [Indexed: 12/23/2023]
Abstract
A WUSCHEL-related homeobox (WOX) gene family has been implicated in promoting vegetative organs to embryonic transition and maintaining plant embryonic stem cell identity. Using genome-wide analysis, we identified 17 candidates, WOX genes in ramie (Boehmeria nivea). The genes (BnWOX) showed highly conserved homeodomain regions typical of WOX. Based on phylogenetic analysis, they were classified into three distinct groups: modern, intermediate, and ancient clades. The genes displayed 65% and 35% collinearities with their Arabidopsis thaliana and Oryza sativa ortholog, respectively, and exhibited similar motifs, suggesting similar functions. Furthermore, four segmental duplications (BnWOX10/14, BnWOX13A/13B, BnWOX9A/9B, and BnWOX6A/Maker00021031) and a tandem-duplicated pair (BnWOX5/7) among the putative ramie WOX genes were obtained, suggesting that whole-genome duplication (WGD) played a role in WOX gene expansion. Expression profiling analysis of the genes in the bud, leaf, stem, and root of the stem cuttings revealed higher expression levels of BnWOX10 and BnWOX14 in the stem and root and lower in the leaf consistent with the qRT-PCR analysis, suggesting their direct roles in ramie root formation. Analysis of the rooting characteristics and expression in the stem cuttings of sixty-seven different ramie genetic resources showed a possible involvement of BnWOX14 in the adventitious rooting of ramie. Thus, this study provides valuable information on ramie WOX genes and lays the foundation for further research.
Collapse
Affiliation(s)
- Aminu Shehu Abubakar
- Hunan Provincial Key Laboratory of the TCM Agricultural Biogenomics, Changsha Medical University, Changsha 410219, China; (A.S.A.); (F.C.)
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- Department of Agronomy, Bayero University Kano, PMB 3011, Kano 700241, Nigeria
| | - Yongmei Wu
- Hunan Provincial Key Laboratory of the TCM Agricultural Biogenomics, Changsha Medical University, Changsha 410219, China; (A.S.A.); (F.C.)
| | - Fengming Chen
- Hunan Provincial Key Laboratory of the TCM Agricultural Biogenomics, Changsha Medical University, Changsha 410219, China; (A.S.A.); (F.C.)
| | - Aiguo Zhu
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
| | - Ping Chen
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
| | - Kunmei Chen
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
| | - Xiaojun Qiu
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
| | - Xiaoyu Huang
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
| | - Haohan Zhao
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
| | - Jikang Chen
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- Key Laboratory of Biological and Processing for Bast Fiber Crops, Changsha 410221, China
| | - Gang Gao
- Hunan Provincial Key Laboratory of the TCM Agricultural Biogenomics, Changsha Medical University, Changsha 410219, China; (A.S.A.); (F.C.)
- Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, Changsha 410221, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
| |
Collapse
|
4
|
Feng LY, Lin PF, Xu RJ, Kang HQ, Gao LZ. Comparative Genomic Analysis of Asian Cultivated Rice and Its Wild Progenitor ( Oryza rufipogon) Has Revealed Evolutionary Innovation of the Pentatricopeptide Repeat Gene Family through Gene Duplication. Int J Mol Sci 2023; 24:16313. [PMID: 38003501 PMCID: PMC10671101 DOI: 10.3390/ijms242216313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 11/10/2023] [Accepted: 11/12/2023] [Indexed: 11/26/2023] Open
Abstract
The pentatricopeptide repeat (PPR) gene family is one of the largest gene families in land plants. However, current knowledge about the evolution of the PPR gene family remains largely limited. In this study, we performed a comparative genomic analysis of the PPR gene family in O. sativa and its wild progenitor, O. rufipogon, and outlined a comprehensive landscape of gene duplications. Our findings suggest that the majority of PPR genes originated from dispersed duplications. Although segmental duplications have only expanded approximately 11.30% and 13.57% of the PPR gene families in the O. sativa and O. rufipogon genomes, we interestingly obtained evidence that segmental duplication promotes the structural diversity of PPR genes through incomplete gene duplications. In the O. sativa and O. rufipogon genomes, 10 (~33.33%) and 22 pairs of gene duplications (~45.83%) had non-PPR paralogous genes through incomplete gene duplication. Segmental duplications leading to incomplete gene duplications might result in the acquisition of domains, thus promoting functional innovation and structural diversification of PPR genes. This study offers a unique perspective on the evolution of PPR gene structures and underscores the potential role of segmental duplications in PPR gene structural diversity.
Collapse
Affiliation(s)
- Li-Ying Feng
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
| | - Pei-Fan Lin
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
| | - Rong-Jing Xu
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| | - Hai-Qi Kang
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| | - Li-Zhi Gao
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China; (L.-Y.F.); (P.-F.L.)
- Tropical Biodiversity and Genomics Research Center, Hainan University, Haikou 570228, China; (R.-J.X.); (H.-Q.K.)
| |
Collapse
|
5
|
Loehlin DW, McClain GL, Xu M, Kedia R, Root E. Demonstration of in vivo engineered tandem duplications of varying sizes using CRISPR and recombinases in Drosophila melanogaster. G3 (Bethesda) 2023; 13:jkad155. [PMID: 37462278 PMCID: PMC10542505 DOI: 10.1093/g3journal/jkad155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 01/08/2023] [Accepted: 06/09/2023] [Indexed: 07/28/2023]
Abstract
Tandem gene duplicates are important parts of eukaryotic genome structure, yet the phenotypic effects of new tandem duplications are not well-understood, in part owing to a lack of techniques to build and modify them. We introduce a method, Recombinase-Mediated Tandem Duplication, to engineer specific tandem duplications in vivo using CRISPR and recombinases. We describe construction of four different tandem duplications of the Alcohol Dehydrogenase (Adh) gene in Drosophila melanogaster, with duplicated block sizes ranging from 4.2 to 20.7 kb. Flies with the Adh duplications show elevated ADH enzyme activity over unduplicated single copies. This approach to engineering duplications is combinatoric, opening the door to systematic study of the relationship between the structure of tandem duplications and their effects on expression.
Collapse
Affiliation(s)
- David W Loehlin
- Biology Department, Williams College, Williamstown, MA 01267, USA
| | | | - Manting Xu
- Biology Department, Williams College, Williamstown, MA 01267, USA
| | - Ria Kedia
- Biology Department, Williams College, Williamstown, MA 01267, USA
| | - Elise Root
- Biology Department, Williams College, Williamstown, MA 01267, USA
| |
Collapse
|
6
|
Li B, Gschwend AR. Vitis labrusca genome assembly reveals diversification between wild and cultivated grapevine genomes. Front Plant Sci 2023; 14:1234130. [PMID: 37719220 PMCID: PMC10501149 DOI: 10.3389/fpls.2023.1234130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 08/03/2023] [Indexed: 09/19/2023]
Abstract
Wild grapevines are important genetic resources in breeding programs to confer adaptive fitness traits and unique fruit characteristics, but the genetics underlying these traits, and their evolutionary origins, are largely unknown. To determine the factors that contributed to grapevine genome diversification, we performed comprehensive intragenomic and intergenomic analyses with three cultivated European (including the PN40024 reference genome) and two wild North American grapevine genomes, including our newly released Vitis labrusca genome. We found the heterozygosity of the cultivated grapevine genomes was twice as high as the wild grapevine genomes studied. Approximately 30% of V. labrusca and 48% of V. vinifera Chardonnay genes were heterozygous or hemizygous and a considerable number of collinear genes between Chardonnay and V. labrusca had different gene zygosity. Our study revealed evidence that supports gene gain-loss events in parental genomes resulted in the inheritance of hemizygous genes in the Chardonnay genome. Thousands of segmental duplications supplied source material for genome-specific genes, further driving diversification of the genomes studied. We found an enrichment of recently duplicated, adaptive genes in similar functional pathways, but differential retention of environment-specific adaptive genes within each genome. For example, large expansions of NLR genes were discovered in the two wild grapevine genomes studied. Our findings support variation in transposable elements contributed to unique traits in grapevines. Our work revealed gene zygosity, segmental duplications, gene gain-and-loss variations, and transposable element polymorphisms can be key driving forces for grapevine genome diversification.
Collapse
Affiliation(s)
| | - Andrea R. Gschwend
- Department of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United States
| |
Collapse
|
7
|
Wang H, Makowski C, Zhang Y, Qi A, Kaufmann T, Smeland OB, Fiecas M, Yang J, Visscher PM, Chen CH. Chromosomal inversion polymorphisms shape human brain morphology. Cell Rep 2023; 42:112896. [PMID: 37505983 PMCID: PMC10508191 DOI: 10.1016/j.celrep.2023.112896] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 06/27/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
The impact of chromosomal inversions on human brain morphology remains underexplored. We studied 35 common inversions classified from genotypes of 33,018 adults with European ancestry. The inversions at 2p22.3, 16p11.2, and 17q21.31 reach genome-wide significance, followed by 8p23.1 and 6p21.33, in their association with cortical and subcortical morphology. The 17q21.31, 8p23.1, and 16p11.2 regions comprise the LRRC37, OR7E, and NPIP duplicated gene families. We find the 17q21.31 MAPT inversion region, known for harboring neurological risk, to be the most salient locus among common variants for shaping and patterning the cortex. Overall, we observe the inverted orientations decreasing brain size, with the exception that the 2p22.3 inversion is associated with increased subcortical volume and the 8p23.1 inversion is associated with increased motor cortex. These significant inversions are in the genomic hotspots of neuropsychiatric loci. Our findings are generalizable to 3,472 children and demonstrate inversions as essential genetic variation to understand human brain phenotypes.
Collapse
Affiliation(s)
- Hao Wang
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Carolina Makowski
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Yanxiao Zhang
- Ludwig Institute for Cancer Research, La Jolla, CA 92093, USA; School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Anna Qi
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA
| | - Tobias Kaufmann
- Department of Psychiatry and Psychotherapy, Tübingen Center for Mental Health, University of Tübingen, 72076 Tübingen, Germany; Norwegian Centre for Mental Disorders Research, Oslo University Hospital and University of Oslo, 0450 Oslo, Norway
| | - Olav B Smeland
- Norwegian Centre for Mental Disorders Research, Oslo University Hospital and University of Oslo, 0450 Oslo, Norway
| | - Mark Fiecas
- Division of Biostatistics, University of Minnesota School of Public Health, Minneapolis, MN 55455, USA
| | - Jian Yang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Chi-Hua Chen
- Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
8
|
Dachs N, Upadhyay M, Hannemann E, Hauser A, Krebs S, Seichter D, Russ I, Gehrke LJ, Thaller G, Medugorac I. Quantitative trait locus for calving traits on Bos taurus autosome 18 in Holstein cattle is embedded in a complex genomic region. J Dairy Sci 2023; 106:1925-1941. [PMID: 36710189 DOI: 10.3168/jds.2021-21625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 10/10/2022] [Indexed: 01/31/2023]
Abstract
Although the quantitative trait locus (QTL) on chromosome 18 (BTA18) associated with paternal calving ease and stillbirth in Holstein Friesian cattle and its cross has been known for over 20 years, to our knowledge, the exact causal genetic sequence has yet escaped identification. The aim of this study was to re-examine the region of the published QTL on BTA18 and to investigate the possible reasons behind this elusiveness. For this purpose, we carried out a combined linkage disequilibrium and linkage analysis using genotyping data of 2,697 German Holstein Friesian (HF) animals and subsequent whole-genome sequencing (WGS) data analyses and genome assembly of HF samples. We confirmed the known QTL in the 95% confidence interval of 1.089 Mbp between 58.34 and 59.43 Mbp on BTA18. Additionally, these 4 SNPs in the near-perfect linkage disequilibrium with the QTL haplotype were identified: rs381577268 (on 57,816,137 bp, C/T), rs381878735 (on 59,574,329 bp, A/T), rs464221818 (on 59,329,176 bp, C/T), and rs472502785 (on 59,345,689 bp, T/C). Search for the causal mutation using short and long-read sequences, and methylation data of the BTA18 QTL region did not reveal any candidates though. The assembly showed problems in the region, as well as an abundance of segmental duplications within and around the region. Taking the QTL of BTA18 in Holstein cattle as an example, the data presented in this study comprehensively characterize the genomic features that could also be relevant for other such elusive QTL in various other cattle breeds and livestock species as well.
Collapse
Affiliation(s)
- Nina Dachs
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany; Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Maulik Upadhyay
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany
| | - Elisabeth Hannemann
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany
| | - Andreas Hauser
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor-Lynen-Straße 25, 81377 Munich, Germany
| | - Stefan Krebs
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor-Lynen-Straße 25, 81377 Munich, Germany
| | - Doris Seichter
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Ingolf Russ
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str, 23, 85586 Poing, Germany
| | - Lilian Johanna Gehrke
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Olshausenstraße 40, 24098 Kiel, Germany; Vereinigte Informationssysteme Tierhaltung w.V. (vit) Verden, Heinrich-Schröder-Weg 1, 27283 Verden (Aller), Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Olshausenstraße 40, 24098 Kiel, Germany
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152 Martinsried, Germany.
| |
Collapse
|
9
|
Loehlin DW, McClain GL, Xu M, Kedia R, Root E. Demonstration of in vivo engineered tandem duplications of varying sizes using CRISPR and recombinases in Drosophila melanogaster. bioRxiv 2023:2023.01.08.523181. [PMID: 36711585 PMCID: PMC9881931 DOI: 10.1101/2023.01.08.523181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Tandem gene duplicates are important parts of eukaryotic genome structure, yet the phenotypic effects of new tandem duplications are not well-understood, in part owing to a lack of techniques to build and modify them. We introduce a method, Recombinase-Mediated Tandem Duplication (RMTD), to engineer specific tandem duplications in vivo using CRISPR and recombinases. We describe construction of four different tandem duplications of the Alcohol Dehydrogenase ( Adh ) gene in Drosophila melanogaster , with duplicated block sizes ranging from 4.2 kb to 20.7 kb. Flies with the Adh duplications show elevated ADH enzyme activity over unduplicated single copies. This approach to engineering duplications is combinatoric, opening the door to systematic study of the relationship between the structure of tandem duplications and their effects on expression.
Collapse
Affiliation(s)
| | | | - Manting Xu
- Biology Department, Williams College, Williamstown, MA 01267
| | - Ria Kedia
- Biology Department, Williams College, Williamstown, MA 01267
| | - Elise Root
- Biology Department, Williams College, Williamstown, MA 01267
| |
Collapse
|
10
|
Zhang B, Feng C, Chen L, Li B, Zhang X, Yang X. Identification and Functional Analysis of bZIP Genes in Cotton Response to Drought Stress. Int J Mol Sci 2022; 23:ijms232314894. [PMID: 36499218 PMCID: PMC9736030 DOI: 10.3390/ijms232314894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 11/11/2022] [Accepted: 11/17/2022] [Indexed: 11/29/2022] Open
Abstract
The basic leucine zipper (bZIP) transcription factors, which harbor a conserved bZIP domain composed of two regions, a DNA-binding basic region and a Leu Zipper region, operate as important switches of transcription networks in eukaryotes. However, this gene family has not been systematically characterized in cotton (Gossypium hirsutum). Here, we identified 197 bZIP family members in cotton. The chromosome distribution pattern indicates that the GhbZIP genes have undergone 53 genome-wide segmental and 7 tandem duplication events which contribute to the expansion of the cotton bZIP family. Phylogenetic analysis showed that cotton GhbZIP proteins cluster into 13 subfamilies, and homologous protein pairs showed similar characteristics. Inspection of the DNA-binding basic region and leucine repeat heptads within the bZIP domains indicated different DNA-binding site specificities as well as dimerization properties among different groups. Comprehensive expression analysis indicated the most highly and differentially expressed genes in root and leaf that might play significant roles in cotton response to drought stress. GhABF3D was identified as a highly and differentially expressed bZIP family gene in cotton leaf and root under drought stress treatments that likely controls drought stress responses in cotton. These data provide useful information for further functional analysis of the GhbZIP gene family and its potential application in crop improvement.
Collapse
|
11
|
Xu K, Zhao Y, Zhao Y, Feng C, Zhang Y, Wang F, Li X, Gao H, Liu W, Jing Y, Saxena RK, Feng X, Zhou Y, Li H. Soybean F-Box-Like Protein GmFBL144 Interacts With Small Heat Shock Protein and Negatively Regulates Plant Drought Stress Tolerance. Front Plant Sci 2022; 13:823529. [PMID: 35720533 PMCID: PMC9201338 DOI: 10.3389/fpls.2022.823529] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 04/28/2022] [Indexed: 06/15/2023]
Abstract
The F-box gene family is one of the largest gene families in plants. These genes regulate plant growth and development, as well as biotic and abiotic stress responses, and they have been extensively researched. Drought stress is one of the major factors limiting the yield and quality of soybean. In this study, bioinformatics analysis of the soybean F-box gene family was performed, and the role of soybean F-box-like gene GmFBL144 in drought stress adaptation was characterized. We identified 507 F-box genes in the soybean genome database, which were classified into 11 subfamilies. The expression profiles showed that GmFBL144 was highly expressed in plant roots. Overexpression of GmFBL144 increased the sensitivity of transgenic Arabidopsis to drought stress. Under drought stress, the hydrogen peroxide (H2O2) and malonaldehyde (MDA) contents of transgenic Arabidopsis were higher than those of the wild type (WT) and empty vector control, and the chlorophyll content was lower than that of the control. Y2H and bimolecular fluorescence complementation (BiFC) assays showed that GmFBL144 can interact with GmsHSP. Furthermore, our results showed that GmFBL144 can form SCF FBL144 (E3 ubiquitin ligase) with GmSkp1 and GmCullin1. Altogether, these results indicate that the soybean F-box-like protein GmFBL144 may negatively regulate plant drought stress tolerance by interacting with sHSP. These findings provide a basis for molecular genetics and breeding of soybean.
Collapse
Affiliation(s)
- Keheng Xu
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Yu Zhao
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Yan Zhao
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Chen Feng
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Yinhe Zhang
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Fawei Wang
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Xiaowei Li
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Hongtao Gao
- College of Tropical Crops, Sanya Nanfan Research Institute, Hainan University, Haikou, China
- Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Weican Liu
- College of Life Sciences, Jilin Agricultural University, Changchun, China
| | - Yan Jing
- College of Tropical Crops, Sanya Nanfan Research Institute, Hainan University, Haikou, China
- Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Rachit K. Saxena
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Xianzhong Feng
- Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, China
| | - Yonggang Zhou
- College of Tropical Crops, Sanya Nanfan Research Institute, Hainan University, Haikou, China
- Hainan Yazhou Bay Seed Laboratory, Sanya, China
| | - Haiyan Li
- College of Life Sciences, Jilin Agricultural University, Changchun, China
- College of Tropical Crops, Sanya Nanfan Research Institute, Hainan University, Haikou, China
- Hainan Yazhou Bay Seed Laboratory, Sanya, China
| |
Collapse
|
12
|
Wen Y, Raza A, Chu W, Zou X, Cheng H, Hu Q, Liu J, Wei W. Comprehensive In Silico Characterization and Expression Profiling of TCP Gene Family in Rapeseed. Front Genet 2021; 12:794297. [PMID: 34868279 PMCID: PMC8635964 DOI: 10.3389/fgene.2021.794297] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 11/01/2021] [Indexed: 11/13/2022] Open
Abstract
TCP proteins are plant-specific transcription factors that have multipurpose roles in plant developmental procedures and stress responses. Therefore, a genome-wide analysis was performed to categorize the TCP genes in the rapeseed genome. In this study, a total of 80 BnTCP genes were identified in the rapeseed genome and grouped into two main classes (PCF and CYC/TB1) according to phylogenetic analysis. The universal evolutionary analysis uncovered that BnTCP genes had experienced segmental duplications and positive selection pressure. Gene structure and conserved motif examination presented that Class I and Class II have diverse intron-exon patterns and motifs numbers. Overall, nine conserved motifs were identified and varied from 2 to 7 in all TCP genes; and some of them were gene-specific. Mainly, Class II (PCF and CYC/TB1) possessed diverse structures compared to Class I. We identified four hormone- and four stress-related responsive cis-elements in the promoter regions. Moreover, 32 bna-miRNAs from 14 families were found to be targeting 21 BnTCPs genes. Gene ontology enrichment analysis presented that the BnTCP genes were primarily related to RNA/DNA binding, metabolic processes, transcriptional regulatory activities, etc. Transcriptome-based tissue-specific expression analysis showed that only a few genes (mainly BnTCP9, BnTCP22, BnTCP25, BnTCP48, BnTCP52, BnTCP60, BnTCP66, and BnTCP74) presented higher expression in root, stem, leaf, flower, seeds, and silique among all tested tissues. Likewise, qRT-PCR-based expression analysis exhibited that BnTCP36, BnTCP39, BnTCP53, BnTCP59, and BnTCP60 showed higher expression at certain time points under various hormones and abiotic stress conditions but not by drought and MeJA. Our results opened the new groundwork for future understanding of the intricate mechanisms of BnTCP in various developmental processes and abiotic stress signaling pathways in rapeseed.
Collapse
Affiliation(s)
- Yunfei Wen
- College of Agriculture, Yangtze University, Jingzhou, China.,Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Ali Raza
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China.,Fujian Provincial Key Laboratory of Crop Molecular and Cell Biology, Center of Legume Crop Genetics and Systems Biology/College of Agriculture, Oil Crops Research Institute, Fujian Agriculture and Forestry University (FAFU), Fuzhou, China
| | - Wen Chu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Xiling Zou
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Hongtao Cheng
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Qiong Hu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Jia Liu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Wenliang Wei
- College of Agriculture, Yangtze University, Jingzhou, China
| |
Collapse
|
13
|
Ricchio J, Uno F, Carvalho AB. New Genes in the Drosophila Y Chromosome: Lessons from D. willistoni. Genes (Basel) 2021; 12:genes12111815. [PMID: 34828421 PMCID: PMC8623413 DOI: 10.3390/genes12111815] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 11/08/2021] [Accepted: 11/11/2021] [Indexed: 01/05/2023] Open
Abstract
Y chromosomes play important roles in sex determination and male fertility. In several groups (e.g., mammals) there is strong evidence that they evolved through gene loss from a common X-Y ancestor, but in Drosophila the acquisition of new genes plays a major role. This conclusion came mostly from studies in two species. Here we report the identification of the 22 Y-linked genes in D. willistoni. They all fit the previously observed pattern of autosomal or X-linked testis-specific genes that duplicated to the Y. The ratio of gene gains to gene losses is ~25 in D. willistoni, confirming the prominent role of gene gains in the evolution of Drosophila Y chromosomes. We also found four large segmental duplications (ranging from 62 kb to 303 kb) from autosomal regions to the Y, containing ~58 genes. All but four of these duplicated genes became pseudogenes in the Y or disappeared. In the GK20609 gene the Y-linked copy remained functional, whereas its original autosomal copy degenerated, demonstrating how autosomal genes are transferred to the Y chromosome. Since the segmental duplication that carried GK20609 contained six other testis-specific genes, it seems that chance plays a significant role in the acquisition of new genes by the Drosophila Y chromosome.
Collapse
|
14
|
Riba A, Fumagalli MR, Caselle M, Osella M. A Model-Driven Quantitative Analysis of Retrotransposon Distributions in the Human Genome. Genome Biol Evol 2021; 12:2045-2059. [PMID: 32986810 PMCID: PMC7750997 DOI: 10.1093/gbe/evaa201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/21/2022] Open
Abstract
Retrotransposons, DNA sequences capable of creating copies of themselves, compose about half of the human genome and played a central role in the evolution of mammals. Their current position in the host genome is the result of the retrotranscription process and of the following host genome evolution. We apply a model from statistical physics to show that the genomic distribution of the two most populated classes of retrotransposons in human deviates from random placement, and that this deviation increases with time. The time dependence suggests a major role of the host genome dynamics in shaping the current retrotransposon distributions. Focusing on a neutral scenario, we show that a simple model based on random placement followed by genome expansion and sequence duplications can reproduce the empirical retrotransposon distributions, even though more complex and possibly selective mechanisms can have contributed. Besides the inherent interest in understanding the origin of current retrotransposon distributions, this work sets a general analytical framework to analyze quantitatively the effects of genome evolutionary dynamics on the distribution of genomic elements.
Collapse
Affiliation(s)
| | - Maria Rita Fumagalli
- Institute of Biophysics - CNR, National Research Council, Genova, Italy.,Department of Environmental Science and Policy, Center for Complexity and Biosystems, University of Milan, Milano, Italy
| | - Michele Caselle
- Department of Physics and INFN, University of Torino, Torino, Italy
| | - Matteo Osella
- Department of Physics and INFN, University of Torino, Torino, Italy
| |
Collapse
|
15
|
Takeda I, Araki M, Ishiguro KI, Ohga T, Takada K, Yamaguchi Y, Hashimoto K, Kai T, Nakagata N, Imasaka M, Yoshinobu K, Araki K. Gene trapping reveals a new transcriptionally active genome element: The chromosome-specific clustered trap region. Genes Cells 2021; 26:874-890. [PMID: 34418226 DOI: 10.1111/gtc.12890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 08/18/2021] [Accepted: 08/18/2021] [Indexed: 12/01/2022]
Abstract
Nearly half of the human genome consists of repetitive sequences such as long interspersed nuclear elements. The relationship between these repeating sequences and diseases has remained unclear. Gene trapping is a useful technique for disrupting a gene and expressing a reporter gene by using the promoter activity of the gene. The analysis of trapped genes revealed a new genome element-the chromosome-specific clustered trap (CSCT) region. For any examined sequence within this region, an equivalent was found using the BLAT of the University of California, Santa Cruz (UCSC) Genome Browser. CSCT13 mapped to chromosome 13 and contained only three genes. To elucidate its in vivo function, the whole CSCT13 region (1.6 Mbp) was deleted using the CRISPR/Cas9 system in mouse embryonic stem cells, and subsequently, a CSCT13 knockout mouse line was established. The rate of homozygotes was significantly lower than expected according to Mendel's laws. In addition, the number of offspring obtained by mating homozygotes was significantly smaller than that obtained by crossing controls. Furthermore, CSCT13 might have an effect on meiotic homologous recombination. This study identifies a transcriptionally active CSCT with an important role in mouse development.
Collapse
Affiliation(s)
- Iyo Takeda
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Masatake Araki
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Kei-Ichiro Ishiguro
- Institute of Molecular Embryology and Genetics, Kumamoto University, Kumamoto, Japan
| | - Toshinori Ohga
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Kouki Takada
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Yusuke Yamaguchi
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Koichi Hashimoto
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Takuma Kai
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Naomi Nakagata
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Mai Imasaka
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Kumiko Yoshinobu
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| | - Kimi Araki
- Institute of Resource Development and Analysis, Kumamoto University, Kumamoto, Japan
| |
Collapse
|
16
|
Vervoort L, Dierckxsens N, Pereboom Z, Capozzi O, Rocchi M, Shaikh TH, Vermeesch JR. 22q11.2 Low Copy Repeats Expanded in the Human Lineage. Front Genet 2021; 12:706641. [PMID: 34335701 PMCID: PMC8320366 DOI: 10.3389/fgene.2021.706641] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 06/23/2021] [Indexed: 11/13/2022] Open
Abstract
Segmental duplications or low copy repeats (LCRs) constitute duplicated regions interspersed in the human genome, currently neglected in standard analyses due to their extreme complexity. Recent functional studies have indicated the potential of genes within LCRs in synaptogenesis, neuronal migration, and neocortical expansion in the human lineage. One of the regions with the highest proportion of duplicated sequence is the 22q11.2 locus, carrying eight LCRs (LCR22-A until LCR22-H), and rearrangements between them cause the 22q11.2 deletion syndrome. The LCR22-A block was recently reported to be hypervariable in the human population. It remains unknown whether this variability also exists in non-human primates, since research is strongly hampered by the presence of sequence gaps in the human and non-human primate reference genomes. To chart the LCR22 haplotypes and the associated inter- and intra-species variability, we de novo assembled the region in non-human primates by a combination of optical mapping techniques. A minimal and likely ancient haplotype is present in the chimpanzee, bonobo, and rhesus monkey without intra-species variation. In addition, the optical maps identified assembly errors and closed gaps in the orthologous chromosome 22 reference sequences. These findings indicate the LCR22 expansion to be unique to the human population, which might indicate involvement of the region in human evolution and adaptation. Those maps will enable LCR22-specific functional studies and investigate potential associations with the phenotypic variability in the 22q11.2 deletion syndrome.
Collapse
Affiliation(s)
| | | | - Zjef Pereboom
- Centre for Research and Conservation, Royal Zoological Society of Antwerp, Antwerp, Belgium
- Evolutionary Ecology Group, Department of Biology, Antwerp University, Antwerp, Belgium
| | | | | | - Tamim H. Shaikh
- Section of Genetics and Metabolism, Department of Pediatrics, University of Colorado School of Medicine, Aurora, CO, United States
| | | |
Collapse
|
17
|
Zhao X, Collins RL, Lee WP, Weber AM, Jun Y, Zhu Q, Weisburd B, Huang Y, Audano PA, Wang H, Walker M, Lowther C, Fu J, Gerstein MB, Devine SE, Marschall T, Korbel JO, Eichler EE, Chaisson MJP, Lee C, Mills RE, Brand H, Talkowski ME. Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies. Am J Hum Genet 2021; 108:919-928. [PMID: 33789087 PMCID: PMC8206509 DOI: 10.1016/j.ajhg.2021.03.014] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 03/12/2021] [Indexed: 12/13/2022] Open
Abstract
Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment.
Collapse
Affiliation(s)
- Xuefang Zhao
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Ryan L Collins
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Wan-Ping Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA
| | - Alexandra M Weber
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Yukyung Jun
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA
| | - Ben Weisburd
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Yongqing Huang
- Data Sciences Platform, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Peter A Audano
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Harold Wang
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Mark Walker
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Chelsea Lowther
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Jack Fu
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Mark B Gerstein
- Yale University Medical School, Computational Biology and Bioinformatics Program, New Haven, CT 06520, USA
| | - Scott E Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, 40225 Düsseldorf, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Mark J P Chaisson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA; Department of Graduate Studies - Life Sciences, Ewha Womans University, 52, Ewhayeodae-gil, Seodaemun-gu, Seoul 03760, South Korea; Precision Medicine Center, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an 710061, Shaanxi, People's Republic of China
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Program in Medical and Population Genetics and Stanley Center for Psychiatric Disorders, Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA; Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
18
|
Wen Y, Raza A, Chu W, Zou X, Cheng H, Hu Q, Liu J, Wei W. Comprehensive In Silico Characterization and Expression Profiling of TCP Gene Family in Rapeseed. Front Genet 2021. [PMID: 34868279 DOI: 10.3389/fgene2021.794297] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/20/2023] Open
Abstract
TCP proteins are plant-specific transcription factors that have multipurpose roles in plant developmental procedures and stress responses. Therefore, a genome-wide analysis was performed to categorize the TCP genes in the rapeseed genome. In this study, a total of 80 BnTCP genes were identified in the rapeseed genome and grouped into two main classes (PCF and CYC/TB1) according to phylogenetic analysis. The universal evolutionary analysis uncovered that BnTCP genes had experienced segmental duplications and positive selection pressure. Gene structure and conserved motif examination presented that Class I and Class II have diverse intron-exon patterns and motifs numbers. Overall, nine conserved motifs were identified and varied from 2 to 7 in all TCP genes; and some of them were gene-specific. Mainly, Class II (PCF and CYC/TB1) possessed diverse structures compared to Class I. We identified four hormone- and four stress-related responsive cis-elements in the promoter regions. Moreover, 32 bna-miRNAs from 14 families were found to be targeting 21 BnTCPs genes. Gene ontology enrichment analysis presented that the BnTCP genes were primarily related to RNA/DNA binding, metabolic processes, transcriptional regulatory activities, etc. Transcriptome-based tissue-specific expression analysis showed that only a few genes (mainly BnTCP9, BnTCP22, BnTCP25, BnTCP48, BnTCP52, BnTCP60, BnTCP66, and BnTCP74) presented higher expression in root, stem, leaf, flower, seeds, and silique among all tested tissues. Likewise, qRT-PCR-based expression analysis exhibited that BnTCP36, BnTCP39, BnTCP53, BnTCP59, and BnTCP60 showed higher expression at certain time points under various hormones and abiotic stress conditions but not by drought and MeJA. Our results opened the new groundwork for future understanding of the intricate mechanisms of BnTCP in various developmental processes and abiotic stress signaling pathways in rapeseed.
Collapse
Affiliation(s)
- Yunfei Wen
- College of Agriculture, Yangtze University, Jingzhou, China
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Ali Raza
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
- Fujian Provincial Key Laboratory of Crop Molecular and Cell Biology, Center of Legume Crop Genetics and Systems Biology/College of Agriculture, Oil Crops Research Institute, Fujian Agriculture and Forestry University (FAFU), Fuzhou, China
| | - Wen Chu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Xiling Zou
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Hongtao Cheng
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Qiong Hu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Jia Liu
- Key Laboratory for Biological Sciences and Genetic Improvement of Oil Crops, Oil Crops Research Institute of Chinese Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Wenliang Wei
- College of Agriculture, Yangtze University, Jingzhou, China
| |
Collapse
|
19
|
Islam T, Ranjan D, Zubair M, Young E, Xiao M, Riethman H. Analysis of Subtelomeric REXTAL Assemblies Using QUAST. IEEE/ACM Trans Comput Biol Bioinform 2021; 18:365-372. [PMID: 31056507 PMCID: PMC6940546 DOI: 10.1109/tcbb.2019.2913845] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Genomic regions of high segmental duplication content and/or structural variation have led to gaps and misassemblies in the human reference sequence, and are refractory to assembly from whole-genome short-read datasets. Human subtelomere regions are highly enriched in both segmental duplication content and structural variations, and as a consequence are both impossible to assemble accurately and highly variable from individual to individual. Recently, we developed a pipeline for improved region-specific assembly called Regional Extension of Assemblies Using Linked-Reads (REXTAL). In this study, we evaluate REXTAL and genome-wide assembly (Supernova) approaches on 10X Genomics linked-reads data sets partitioned and barcoded using the Gel Bead in Emulsion (GEM) microfluidic method. Our results describe the accuracy and relative performance of these two approaches using the reference-based assessment module of QUAST. We show that REXTAL dramatically outperforms the Supernova whole genome assembler in subtelomeric segmental duplication regions, and results in highly accurate assemblies. Nearly all of the REXTAL "misassemblies" identified using default QUAST parameters simply pinpoint locations of tandem repeat arrays in the reference sequence where the repeat array length differs from that in the cognate REXTAL assembly by 1000 bp.
Collapse
|
20
|
Frazier AE, Compton AG, Kishita Y, Hock DH, Welch AE, Amarasekera SSC, Rius R, Formosa LE, Imai-Okazaki A, Francis D, Wang M, Lake NJ, Tregoning S, Jabbari JS, Lucattini A, Nitta KR, Ohtake A, Murayama K, Amor DJ, McGillivray G, Wong FY, van der Knaap MS, Jeroen Vermeulen R, Wiltshire EJ, Fletcher JM, Lewis B, Baynam G, Ellaway C, Balasubramaniam S, Bhattacharya K, Freckmann ML, Arbuckle S, Rodriguez M, Taft RJ, Sadedin S, Cowley MJ, Minoche AE, Calvo SE, Mootha VK, Ryan MT, Okazaki Y, Stroud DA, Simons C, Christodoulou J, Thorburn DR. Fatal perinatal mitochondrial cardiac failure caused by recurrent de novo duplications in the ATAD3 locus. Med (N Y) 2020; 2:49-73. [PMID: 33575671 DOI: 10.1016/j.medj.2020.06.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Background In about half of all patients with a suspected monogenic disease, genomic investigations fail to identify the diagnosis. A contributing factor is the difficulty with repetitive regions of the genome, such as those generated by segmental duplications. The ATAD3 locus is one such region, in which recessive deletions and dominant duplications have recently been reported to cause lethal perinatal mitochondrial diseases characterized by pontocerebellar hypoplasia or cardiomyopathy, respectively. Methods Whole exome, whole genome and long-read DNA sequencing techniques combined with studies of RNA and quantitative proteomics were used to investigate 17 subjects from 16 unrelated families with suspected mitochondrial disease. Findings We report six different de novo duplications in the ATAD3 gene locus causing a distinctive presentation including lethal perinatal cardiomyopathy, persistent hyperlactacidemia, and frequently corneal clouding or cataracts and encephalopathy. The recurrent 68 Kb ATAD3 duplications are identifiable from genome and exome sequencing but usually missed by microarrays. The ATAD3 duplications result in the formation of identical chimeric ATAD3A/ATAD3C proteins, altered ATAD3 complexes and a striking reduction in mitochondrial oxidative phosphorylation complex I and its activity in heart tissue. Conclusions ATAD3 duplications appear to act in a dominant-negative manner and the de novo inheritance infers a low recurrence risk for families, unlike most pediatric mitochondrial diseases. More than 350 genes underlie mitochondrial diseases. In our experience the ATAD3 locus is now one of the five most common causes of nuclear-encoded pediatric mitochondrial disease but the repetitive nature of the locus means ATAD3 diagnoses may be frequently missed by current genomic strategies. Funding Australian NHMRC, US Department of Defense, Japanese AMED and JSPS agencies, Australian Genomics Health Alliance and Australian Mito Foundation.
Collapse
Affiliation(s)
- Ann E Frazier
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia.,These authors contributed equally: A.E. Frazier, A.G. Compton
| | - Alison G Compton
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia.,These authors contributed equally: A.E. Frazier, A.G. Compton
| | - Yoshihito Kishita
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Juntendo University, Graduate School of Medicine, Tokyo, 113-8421, Japan
| | - Daniella H Hock
- Department of Biochemistry and Molecular Biology and Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Melbourne, VIC 3052, Australia
| | - AnneMarie E Welch
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Sumudu S C Amarasekera
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia
| | - Rocio Rius
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia
| | - Luke E Formosa
- Department of Biochemistry and Molecular Biology, Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Atsuko Imai-Okazaki
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Juntendo University, Graduate School of Medicine, Tokyo, 113-8421, Japan.,Division of Genomic Medicine Research, Medical Genomics Center, National Center for Global Health and Medicine, Tokyo 162-8655, Japan
| | - David Francis
- Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Min Wang
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Nicole J Lake
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia.,Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Simone Tregoning
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Jafar S Jabbari
- Australian Genome Research Facility Ltd, Victorian Comprehensive Cancer Centre, Melbourne VIC 3052, Australia
| | - Alexis Lucattini
- Australian Genome Research Facility Ltd, Victorian Comprehensive Cancer Centre, Melbourne VIC 3052, Australia
| | - Kazuhiro R Nitta
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Juntendo University, Graduate School of Medicine, Tokyo, 113-8421, Japan
| | - Akira Ohtake
- Department of Pediatrics & Clinical Genomics, Saitama Medical University Hospital, Saitama, 350-0495, Japan
| | - Kei Murayama
- Department of Metabolism, Chiba Children's Hospital, Chiba, 266-0007, Japan
| | - David J Amor
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia
| | - George McGillivray
- Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Flora Y Wong
- Ritchie Centre, Hudson Institute of Medical Research; Department of Paediatrics, Monash University; and Monash Newborn, Monash Children's Hospital, Melbourne, VIC 3168, Australia
| | - Marjo S van der Knaap
- Child Neurology, Emma Children's Hospital, Amsterdam University Medical Centers, Vrije Universiteit and Amsterdam Neuroscience, 1081 HV Amsterdam, The Netherlands.,Functional Genomics, Center for Neurogenomics and Cognitive Research, Vrije Universiteit and Amsterdam Neuroscience, 1081 HV Amsterdam, The Netherlands
| | - R Jeroen Vermeulen
- Department of Neurology, Maastricht University Medical Center, 6229 HX, Maastricht, The Netherlands
| | - Esko J Wiltshire
- Department of Paediatrics and Child Health, University of Otago Wellington and Capital and Coast District Health Board, Wellington 6021, New Zealand
| | - Janice M Fletcher
- Department of Genetics and Molecular Pathology, SA Pathology, Adelaide, SA 5000, Australia
| | - Barry Lewis
- Department of Clinical Biochemistry, PathWest Laboratory Medicine Western Australia, Nedlands, WA 6009, Australia
| | - Gareth Baynam
- Western Australian Register of Developmental Anomalies and Genetic Services of Western Australia and King Edward Memorial Hospital for Women Perth, Subiaco, WA 6008, Australia.,Telethon Kids Institute and School of Paediatrics and Child Health, The University of Western Australia, Perth, WA 6009, Australia
| | - Carolyn Ellaway
- Genetic Metabolic Disorders Service, Sydney Children's Hospital Network, The Children's Hospital at Westmead, Sydney, NSW 2145, Australia.,Disciplines of Genomic Medicine and Child and Adolescent Health, Sydney Medical School, University of Sydney, NSW 2145, Australia
| | - Shanti Balasubramaniam
- Genetic Metabolic Disorders Service, Sydney Children's Hospital Network, The Children's Hospital at Westmead, Sydney, NSW 2145, Australia
| | - Kaustuv Bhattacharya
- Genetic Metabolic Disorders Service, Sydney Children's Hospital Network, The Children's Hospital at Westmead, Sydney, NSW 2145, Australia.,Disciplines of Genomic Medicine and Child and Adolescent Health, Sydney Medical School, University of Sydney, NSW 2145, Australia
| | | | - Susan Arbuckle
- Department of Histopathology, The Children's Hospital at Westmead, Sydney Children's Hospital Network, Sydney, NSW 2145, Australia
| | - Michael Rodriguez
- Discipline of Pathology, School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
| | | | - Simon Sadedin
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia
| | - Mark J Cowley
- Children's Cancer Institute, Kensington, NSW 2750, Australia; St Vincent's Clinical School, UNSW Sydney, Darlinghurst, NSW 2010, Australia.,Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia
| | - André E Minoche
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010, Australia
| | - Sarah E Calvo
- Broad Institute, Cambridge, MA 02142, USA; Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Harvard Medical School, Boston, MA 02446, USA
| | - Vamsi K Mootha
- Broad Institute, Cambridge, MA 02142, USA; Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Harvard Medical School, Boston, MA 02446, USA
| | - Michael T Ryan
- Department of Biochemistry and Molecular Biology, Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Yasushi Okazaki
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Juntendo University, Graduate School of Medicine, Tokyo, 113-8421, Japan
| | - David A Stroud
- Department of Biochemistry and Molecular Biology and Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Melbourne, VIC 3052, Australia
| | - Cas Simons
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072 Australia
| | - John Christodoulou
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia.,Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Disciplines of Genomic Medicine and Child and Adolescent Health, Sydney Medical School, University of Sydney, NSW 2145, Australia
| | - David R Thorburn
- Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Department of Paediatrics, University of Melbourne, Melbourne, VIC 3052, Australia.,Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, VIC 3052, Australia.,Lead contact
| |
Collapse
|
21
|
Moharana KC, Venancio TM. Polyploidization events shaped the transcription factor repertoires in legumes (Fabaceae). Plant J 2020; 103:726-741. [PMID: 32270526 DOI: 10.1111/tpj.14765] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 03/13/2020] [Accepted: 03/25/2020] [Indexed: 06/11/2023]
Abstract
Transcription factors (TFs) are essential for plant growth and development. Several legumes (e.g. soybean) are rich sources of protein and oil and have great economic importance. Here we report a phylogenomic analysis of TF families in legumes and their potential association with important traits (e.g. nitrogen fixation). We used TF DNA-binding domains to systematically screen the genomes of 15 leguminous and five non-leguminous species. Transcription factor orthologous groups (OGs) were used to estimate OG sizes in ancestral nodes using a gene birth-death model, which allowed the identification of lineage-specific expansions. The OG analysis and rate of synonymous substitutions show that major TF expansions are strongly associated with whole-genome duplication (WGD) events in the legume (approximately 58 million years ago) and Glycine (approximately 13 million years ago) lineages, which account for a large fraction of the Phaseolus vulgaris and Glycine max TF repertoires. Of the 3407 G. max TFs, 1808 and 676 have homeologs within single syntenic regions in Phaseolus vulgaris and Vitis vinifera, respectively. We found a trend for TFs expanded in legumes to be preferentially transcribed in roots and nodules, supporting their recruitment early in the evolution of nodulation in the legume clade. Some families also showed count differences between G. max and the wild soybean Glycine soja, including genes located within important quantitative trait loci. Our findings strongly support the roles of two WGDs in shaping the TF repertoires in the legume and Glycine lineages, and these are probably related to important aspects of legume and soybean biology.
Collapse
Affiliation(s)
- Kanhu C Moharana
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, Brazil
| | - Thiago M Venancio
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, Brazil
| |
Collapse
|
22
|
Pang H, Yu X, Kim YM, Wang X, Jinkins JK, Yin J, Li S, Gu H. Disorders Associated With Diverse, Recurrent Deletions and Duplications at 1q21.1. Front Genet 2020; 11:577. [PMID: 32655619 PMCID: PMC7325322 DOI: 10.3389/fgene.2020.00577] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 05/11/2020] [Indexed: 01/22/2023] Open
Abstract
The subchromosomal region 1q21.1 is one of the hotspots in the human genome for deletions and reciprocal duplications, owing to the existence of hundreds of segmental duplications. Recurrent deletions and duplications in this region are thought to be causative in patients with variable clinical manifestations. Based on the genomic locations, deletions and duplications at the 1q21.1 locus have been associated with distinguishable syndromes: chromosome 1q21.1 deletion syndrome, chromosome 1q21.1 duplication syndrome, and thrombocytopenia-absent radius (TAR) syndrome, which is partially due to deletions at the proximal 1q21.1 region. We report here diverse, recurrent deletions and duplications at the 1q21.1 locus in 36 patients from a cohort of 5,200 individuals. Among the 36 patients, 18 patients carry 1q21.1 deletions, nine individuals have reciprocal duplications at 1q21.1, two patients share an identical short deletion, and the remaining seven possess variable sizes of duplications at the proximal 1q21.1 region. Furthermore, we provide cytogenetic characterization and detailed clinical features for each patient. Notably, duplications at the proximal 1q21.1 region have not been associated with a defined disorder in publications. However, recurrent duplications at the proximal 1q21.1 region among the seven patients strongly suggested that the variants are likely pathogenic. The common phenotypical features of those disorders are also summarized to facilitate clinical diagnoses and genetic counseling.
Collapse
Affiliation(s)
- Hui Pang
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States
| | - Xiaowei Yu
- The First Affiliated Hospital of Jilin University, Changchun, China
| | - Young Mi Kim
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States
| | - Xianfu Wang
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States
| | - Jeremy K Jinkins
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States
| | - Jianing Yin
- The First Affiliated Hospital of Jilin University, Changchun, China
| | - Shibo Li
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States
| | - Hongcang Gu
- Department of Pediatrics, The University of Oklahoma Health Sciences Center, Oklahoma, OK, United States.,Broad Institute of MIT and Harvard, Cambridge, MA, United States
| |
Collapse
|
23
|
Meng L, Liu X, He C, Xu B, Li Y, Hu Y. Functional divergence and adaptive selection of KNOX gene family in plants. Open Life Sci 2020; 15:346-363. [PMID: 33817223 PMCID: PMC7874613 DOI: 10.1515/biol-2020-0036] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 04/09/2020] [Accepted: 04/23/2020] [Indexed: 12/16/2022] Open
Abstract
KNOTTED-like homeodomain (KNOX) genes are transcriptional regulators that play an important role in morphogenesis. In the present study, a comparative analysis was performed to investigate the molecular evolution of the characteristics of the KNOX gene family in 10 different plant species. We identified 129 KNOX gene family members, which were categorized into two subfamilies based on multiple sequence alignment and phylogenetic tree reconstruction. Several segmental duplication pairs were found, indicating that different species share a common expansion model. Functional divergence analysis identified the 15 and 52 amino acid sites with significant changes in evolutionary rates and amino acid physicochemical properties as functional divergence sites. Additional selection analysis showed that 14 amino acid sites underwent positive selection during evolution, and two groups of co-evolutionary amino acid sites were identified by Coevolution Analysis using Protein Sequences software. These sites could play critical roles in the molecular evolution of the KNOX gene family in these species. In addition, the expression profiles of KNOX duplicated genes demonstrated functional divergence. Taken together, these results provide novel insights into the structural and functional evolution of the KNOX gene family.
Collapse
Affiliation(s)
- Lingyan Meng
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| | - Xiaomei Liu
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| | - Congfen He
- Beijing Key Lab of Plant Resource Research and Development, Beijing Technology and Business University, Beijing, 100048, China
| | - Biyao Xu
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| | - Yaxuan Li
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| | - Yingkao Hu
- College of Life Sciences, Capital Normal University, Beijing, 100048, China
| |
Collapse
|
24
|
McCartney AM, Hyland EM, Cormican P, Moran RJ, Webb AE, Lee KD, Hernandez-Rodriguez J, Prado-Martinez J, Creevey CJ, Aspden JL, McInerney JO, Marques-Bonet T, O'Connell MJ. Gene Fusions Derived by Transcriptional Readthrough are Driven by Segmental Duplication in Human. Genome Biol Evol 2020; 11:2678-2690. [PMID: 31400206 PMCID: PMC6764479 DOI: 10.1093/gbe/evz163] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2019] [Indexed: 12/14/2022] Open
Abstract
Gene fusion occurs when two or more individual genes with independent open reading frames becoming juxtaposed under the same open reading frame creating a new fused gene. A small number of gene fusions described in detail have been associated with novel functions, for example, the hominid-specific PIPSL gene, TNFSF12, and the TWE-PRIL gene family. We use Sequence Similarity Networks and species level comparisons of great ape genomes to identify 45 new genes that have emerged by transcriptional readthrough, that is, transcription-derived gene fusion. For 35 of these putative gene fusions, we have been able to assess available RNAseq data to determine whether there are reads that map to each breakpoint. A total of 29 of the putative gene fusions had annotated transcripts (9/29 of which are human-specific). We carried out RT-qPCR in a range of human tissues (placenta, lung, liver, brain, and testes) and found that 23 of the putative gene fusion events were expressed in at least one tissue. Examining the available ribosome foot-printing data, we find evidence for translation of three of the fused genes in human. Finally, we find enrichment for transcription-derived gene fusions in regions of known segmental duplication in human. Together, our results implicate chromosomal structural variation brought about by segmental duplication with the emergence of novel transcripts and translated protein products.
Collapse
Affiliation(s)
- Ann M McCartney
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Edel M Hyland
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Institute for Global Food Security, Queens University Belfast, United Kingdom
| | - Paul Cormican
- Teagasc Animal and Bioscience Research Department, Animal & Grassland Research and Innovation Centre, Teagasc, Grange, Dunsany, County Meath, Ireland
| | - Raymond J Moran
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - Andrew E Webb
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland
| | - Kate D Lee
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,School of Biological Sciences, University of Auckland, New Zealand.,School of Fundamental Sciences, Massey University, New Zealand
| | | | - Javier Prado-Martinez
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom
| | - Christopher J Creevey
- Institute for Global Food Security, Queens University Belfast, United Kingdom.,Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, United Kingdom
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom
| | - James O McInerney
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, M13 9PL, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010, Barcelona, Spain.,NAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallés, Barcelona, Spain
| | - Mary J O'Connell
- Bioinformatics and Molecular Evolution Group, School of Biotechnology, Dublin City University, Ireland.,Computational and Molecular Evolutionary Biology Group, School of Biology, Faculty of Biological Sciences, The University of Leeds, United Kingdom.,School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, NG7 2RD, United Kingdom
| |
Collapse
|
25
|
Arimoto A, Nishitsuji K, Higa Y, Arakaki N, Hisata K, Shinzato C, Satoh N, Shoguchi E. A siphonous macroalgal genome suggests convergent functions of homeobox genes in algae and land plants. DNA Res 2019; 26:183-192. [PMID: 30918953 PMCID: PMC6476727 DOI: 10.1093/dnares/dsz002] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 02/15/2019] [Indexed: 11/13/2022] Open
Abstract
Genome evolution and development of unicellular, multinucleate macroalgae (siphonous algae) are poorly known, although various multicellular organisms have been studied extensively. To understand macroalgal developmental evolution, we assembled the ∼26 Mb genome of a siphonous green alga, Caulerpa lentillifera, with high contiguity, containing 9,311 protein-coding genes. Molecular phylogeny using 107 nuclear genes indicates that the diversification of the class Ulvophyceae, including C. lentillifera, occurred before the split of the Chlorophyceae and Trebouxiophyceae. Compared with other green algae, the TALE superclass of homeobox genes, which expanded in land plants, shows a series of lineage-specific duplications in this siphonous macroalga. Plant hormone signalling components were also expanded in a lineage-specific manner. Expanded transport regulators, which show spatially different expression, suggest that the structural patterning strategy of a multinucleate cell depends on diversification of nuclear pore proteins. These results not only imply functional convergence of duplicated genes among green plants, but also provide insight into evolutionary roots of green plants. Based on the present results, we propose cellular and molecular mechanisms involved in the structural differentiation in the siphonous alga.
Collapse
Affiliation(s)
- Asuka Arimoto
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Koki Nishitsuji
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Yoshimi Higa
- Onna Village Fisheries Cooperative, Onna, Okinawa, Japan
| | - Nana Arakaki
- DNA Sequencing Section, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Kanako Hisata
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Chuya Shinzato
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Noriyuki Satoh
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| | - Eiichi Shoguchi
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, Japan
| |
Collapse
|
26
|
Vollger MR, Dishuck PC, Sorensen M, Welch AE, Dang V, Dougherty ML, Graves-Lindsay TA, Wilson RK, Chaisson MJP, Eichler EE. Long-read sequence and assembly of segmental duplications. Nat Methods 2019; 16:88-94. [PMID: 30559433 PMCID: PMC6382464 DOI: 10.1038/s41592-018-0236-3] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 10/30/2018] [Indexed: 01/22/2023]
Abstract
We have developed a computational method based on polyploid phasing of long sequence reads to resolve collapsed regions of segmental duplications within genome assemblies. Segmental Duplication Assembler (SDA; https://github.com/mvollger/SDA ) constructs graphs in which paralogous sequence variants define the nodes and long-read sequences provide attraction and repulsion edges, enabling the partition and assembly of long reads corresponding to distinct paralogs. We apply it to single-molecule, real-time sequence data from three human genomes and recover 33-79 megabase pairs (Mb) of duplications in which approximately half of the loci are diverged (<99.8%) compared to the reference genome. We show that the corresponding sequence is highly accurate (>99.9%) and that the diverged sequence corresponds to copy-number-variable paralogs that are absent from the human reference genome. Our method can be applied to other complex genomes to resolve the last gene-rich gaps, improve duplicate gene annotation, and better understand copy-number-variant genetic diversity at the base-pair level.
Collapse
Affiliation(s)
- Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Melanie Sorensen
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - AnneMarie E Welch
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Vy Dang
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Max L Dougherty
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Tina A Graves-Lindsay
- The McDonnell Genome Institute at Washington University, Washington University School of Medicine, St. Louis, MO, USA
| | - Richard K Wilson
- Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
27
|
Zhao Y, Guo T, Fiksinski A, Breetvelt E, McDonald-McGinn DM, Crowley TB, Diacou A, Schneider M, Eliez S, Swillen A, Breckpot J, Vermeesch J, Chow EWC, Gothelf D, Duijff S, Evers R, van Amelsvoort TA, van den Bree M, Owen M, Niarchou M, Bearden CE, Ornstein C, Pontillo M, Buzzanca A, Vicari S, Armando M, Murphy KC, Murphy C, Garcia-Minaur S, Philip N, Campbell L, Morey-Cañellas J, Raventos J, Rosell J, Heine-Suner D, Shprintzen RJ, Gur RE, Zackai E, Emanuel BS, Wang T, Kates WR, Bassett AS, Vorstman JAS, Morrow BE. Variance of IQ is partially dependent on deletion type among 1,427 22q11.2 deletion syndrome subjects. Am J Med Genet A 2018; 176:2172-2181. [PMID: 30289625 PMCID: PMC6209529 DOI: 10.1002/ajmg.a.40359] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 05/02/2018] [Accepted: 05/23/2018] [Indexed: 12/28/2022]
Abstract
The 22q11.2 deletion syndrome is caused by non-allelic homologous recombination events during meiosis between low copy repeats (LCR22) termed A, B, C, and D. Most patients have a typical LCR22A-D (AD) deletion of 3 million base pairs (Mb). In this report, we evaluated IQ scores in 1,478 subjects with 22q11.2DS. The mean of full scale IQ, verbal IQ, and performance IQ scores in our cohort were 72.41 (standard deviation-SD of 13.72), 75.91(SD of 14.46), and 73.01(SD of 13.71), respectively. To investigate whether IQ scores are associated with deletion size, we examined individuals with the 3 Mb, AD (n = 1,353) and nested 1.5 Mb, AB (n = 74) deletions, since they comprised the largest subgroups. We found that full scale IQ was decreased by 6.25 points (p = .002), verbal IQ was decreased by 8.17 points (p = .0002) and performance IQ was decreased by 4.03 points (p = .028) in subjects with the AD versus AB deletion. Thus, individuals with the smaller, 1.5 Mb AB deletion have modestly higher IQ scores than those with the larger, 3 Mb AD deletion. Overall, the deletion of genes in the AB region largely explains the observed low IQ in the 22q11.2DS population. However, our results also indicate that haploinsufficiency of genes in the LCR22B-D region (BD) exert an additional negative impact on IQ. Furthermore, we did not find evidence of a confounding effect of severe congenital heart disease on IQ scores in our cohort.
Collapse
Affiliation(s)
- Yingjie Zhao
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Tingwei Guo
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Ania Fiksinski
- Department of Psychiatry, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
- Center for Addiction and Mental Health and the University of Toronto, Toronto, Canada
| | - Elemi Breetvelt
- Center for Addiction and Mental Health and the University of Toronto, Toronto, Canada
| | - Donna M. McDonald-McGinn
- Division of Human Genetics, Children’s Hospital of Philadelphia and Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Terrence B. Crowley
- Division of Human Genetics, Children’s Hospital of Philadelphia and Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Alexander Diacou
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Maude Schneider
- Developmental Imaging and Psychopathology Lab, University of Geneva School of Medicine, Geneva, Switzerland
| | - Stephan Eliez
- Developmental Imaging and Psychopathology Lab, University of Geneva School of Medicine, Geneva, Switzerland
| | - Ann Swillen
- Center for Human Genetics, Katholieke Universiteit Leuven (KU Leuven), Leuven, Belgium
| | - Jeroen Breckpot
- Center for Human Genetics, Katholieke Universiteit Leuven (KU Leuven), Leuven, Belgium
| | - Joris Vermeesch
- Center for Human Genetics, Katholieke Universiteit Leuven (KU Leuven), Leuven, Belgium
| | - Eva W. C. Chow
- Center for Addiction and Mental Health and the University of Toronto, Toronto, Canada
| | - Doron Gothelf
- Sackler Faculty of Medicine and Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
- The Child Psychiatry Division, Edmond and Lily Sapfra Children’s Hospital, Sheba Medical Center, Ramat Gan, Israel
| | - Sasja Duijff
- Department of Psychiatry, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Rens Evers
- Department of Psychiatry and Psychology, Maastricht University, Maastricht, The Netherlands
| | | | - Marianne van den Bree
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neuroscience, Cardiff University, Cardiff, Wales
| | - Michael Owen
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neuroscience, Cardiff University, Cardiff, Wales
| | - Maria Niarchou
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neuroscience, Cardiff University, Cardiff, Wales
| | - Carrie E. Bearden
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, Los Angeles, CA, USA
| | - Claudia Ornstein
- Department of Psychiatry, Hospital Clinico Universidad de Chile,, Santiago, Chile
| | - Maria Pontillo
- Child and Adolescence Neuropsychiatry Unit, Department of Neuroscience, Children Hospital Bambino Gesu, Rome, Italy
| | - Antonino Buzzanca
- Department of Human Neuroscience, University Sapienza of Rome, Rome, Italy
| | - Stefano Vicari
- Child and Adolescence Neuropsychiatry Unit, Department of Neuroscience, Children Hospital Bambino Gesu, Rome, Italy
| | - Marco Armando
- Developmental Imaging and Psychopathology Lab, University of Geneva School of Medicine, Geneva, Switzerland
- Child and Adolescence Neuropsychiatry Unit, Department of Neuroscience, Children Hospital Bambino Gesu, Rome, Italy
| | - Kieran C. Murphy
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Clodagh Murphy
- Department of Psychiatry, King’s College London, London, England
| | - Sixto Garcia-Minaur
- Section of Clinical Genetics and Dismorphology, Instituto de Genética Médica y Molecular, INGEMM, Hospital Universitario La Paz, Madrid, Spain
| | - Nicole Philip
- Department of Medical Genetics, APHM, MMG, INSERM, Aix-Marseille University, Marseille, France
| | - Linda Campbell
- School of Psychology, University of Newcastle, Newcastle, Australia
| | | | | | - Jordi Rosell
- Section of Genetics, Hospital Son Espases, Palma, Spain
| | | | - Robert J. Shprintzen
- The Virtual Center for Velo-Cardio-Facial Syndrome and Related Disorders, Syracuse, NY, USA
| | - Raquel E. Gur
- Department of Psychiatry and the Lifespan Brain Institute, Perelman School of Medicine and Children’s Hospital of Philadelphia, University of Pennsylvania, Philadelphia, USA
| | - Elaine Zackai
- Division of Human Genetics, Children’s Hospital of Philadelphia and Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Beverly S. Emanuel
- Division of Human Genetics, Children’s Hospital of Philadelphia and Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Tao Wang
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Wendy R. Kates
- Department of Psychiatry and Behavioral Sciences, and Program in Neuroscience, SUNY Upstate Medical University, Syracuse, USA
| | - Anne S. Bassett
- Center for Addiction and Mental Health and the University of Toronto, Toronto, Canada
- The Dalglish 22q Clinic for Adults, Toronto General Hospital, University Health Network, Toronto, Canada
| | | | - Bernice E. Morrow
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | | |
Collapse
|
28
|
Magwanga RO, Lu P, Kirungu JN, Cai X, Zhou Z, Wang X, Diouf L, Xu Y, Hou Y, Hu Y, Dong Q, Wang K, Liu F. Whole Genome Analysis of Cyclin Dependent Kinase ( CDK) Gene Family in Cotton and Functional Evaluation of the Role of CDKF4 Gene in Drought and Salt Stress Tolerance in Plants. Int J Mol Sci 2018; 19:ijms19092625. [PMID: 30189594 PMCID: PMC6164816 DOI: 10.3390/ijms19092625] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2018] [Revised: 08/24/2018] [Accepted: 08/29/2018] [Indexed: 12/12/2022] Open
Abstract
Cotton (Gossypium spp.) is the number one crop cultivated for fiber production and the cornerstone of the textile industry. Drought and salt stress are the major abiotic stresses, which can have a huge economic impact on cotton production; this has been aggravated with continued climate change, and compounded by pollution. Various survival strategies evolved by plants include the induction of various stress responsive genes, such as cyclin dependent kinases (CDKs). In this study, we performed a whole-genome identification and analysis of the CDK gene family in cotton. We identified 31, 12, and 15 CDK genes in G. hirsutum, G. arboreum, and G. raimondii respectively, and they were classified into 6 groups. CDK genes were distributed in 15, 10, and 9 linkage groups of AD, D, and A genomes, respectively. Evolutionary analysis revealed that segmental types of gene duplication were the primary force underlying CDK genes expansion. RNA sequence and RT-qPCR validation revealed that Gh_D12G2017 (CDKF4) was strongly induced by drought and salt stresses. The transient expression of Gh_D12G2017-GFP fusion protein in the protoplast showed that Gh_D12G2017 was localized in the nucleus. The transgenic Arabidopsis lines exhibited higher concentration levels of the antioxidant enzymes measured, including peroxidase (POD), superoxide dismutase (SOD), and catalase (CAT) concentrations under drought and salt stress conditions with very low levels of oxidants. Moreover, cell membrane stability (CMS), excised leaf water loss (ELWL), saturated leaf weight (SLW), and chlorophyll content measurements showed that the transgenic Arabidopsis lines were highly tolerant to either of the stress factors compared to their wild types. Moreover, the expression of the stress-related genes was also significantly up-regulated in Gh_D12G2017(CDKF4) transgenic Arabidopsis plants under drought and salt conditions. We infer that CDKF-4s and CDKG-2s might be the primary regulators of salt and drought responses in cotton.
Collapse
Affiliation(s)
- Richard Odongo Magwanga
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
- School of Biological and Physical sciences (SBPS), Main campus, Jaramogi Oginga Odinga University of Science and Technology (JOOUST), P.O Box 210-40601, Bondo, Kenya.
| | - Pu Lu
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Joy Nyangasi Kirungu
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Xiaoyan Cai
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Zhongli Zhou
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Xingxing Wang
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Latyr Diouf
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Yanchao Xu
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Yuqing Hou
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Yangguang Hu
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Qi Dong
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Kunbo Wang
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| | - Fang Liu
- Research Base in Anyang Institute of Technology, State Key Laboratory of Cotton Biology/Institute of Cotton Research, Chinese Academy of Agricultural Science (ICR, CAAS), Anyang 455000, China.
| |
Collapse
|
29
|
Abstract
It is currently impossible to get complete de-novo assembly of segmentally duplicated genome regions using genome-wide short-read datasets. Here, we devise a new computational method called Regional Extension of Assemblies Using Linked-Reads (REXTAL) for improved region-specific assembly of segmental duplication-containing DNA, leveraging genomic short-read datasets generated from large DNA molecules partitioned and barcoded using the "Gel Bead in Emulsion" (GEM) microfluidic method (Zheng et al., 2016). We show that using REXTAL, it is possible to extend assembly of single-copy diploid DNA into adjacent, otherwise inaccessible subtelomere segmental duplication regions and other subtelomeric gap regions. Moreover, REXTAL is computationally more efficient for the directed assembly of such regions from multiple genomes (e.g., for the comparison of structural variation) than genome-wide assembly approaches.
Collapse
Affiliation(s)
- Tunazzina Islam
- Department of Computer Science, Old Dominion University, Norfolk, VA
| | - Desh Ranjan
- Department of Computer Science, Old Dominion University, Norfolk, VA
| | - Eleanor Young
- School of Biomedical Engineering, Drexel University, Philadelphia, PA
| | - Ming Xiao
- School of Biomedical Engineering, Drexel University, Philadelphia, PA
- Institute of Molecular Medicine and Infectious Disease, School of Medicine, Drexel University, Philadelphia, PA
| | - Mohammad Zubair
- Department of Computer Science, Old Dominion University, Norfolk, VA
| | - Harold Riethman
- School of Medical Diagnostic & Translational Sciences, Old Dominion University, Norfolk, VA
| |
Collapse
|
30
|
Kouprina N, Liskovykh M, Lee NCO, Noskov VN, Waterfall JJ, Walker RL, Meltzer PS, Topol EJ, Larionov V. Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient. Oncotarget 2018; 9:15275-15291. [PMID: 29632643 PMCID: PMC5880603 DOI: 10.18632/oncotarget.24567] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Accepted: 02/10/2018] [Indexed: 11/25/2022] Open
Abstract
Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer.
Collapse
Affiliation(s)
- Natalay Kouprina
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Mikhail Liskovykh
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Nicholas C O Lee
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Vladimir N Noskov
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | | | - Robert L Walker
- Genetics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Paul S Meltzer
- Genetics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Eric J Topol
- The Scripps Translational Science Institute, The Scripps Research Institute and Scripps Health, La Jolla, CA 92037, USA
| | - Vladimir Larionov
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| |
Collapse
|
31
|
Yang Z, Gong Q, Wang L, Jin Y, Xi J, Li Z, Qin W, Yang Z, Lu L, Chen Q, Li F. Genome-Wide Study of YABBY Genes in Upland Cotton and Their Expression Patterns under Different Stresses. Front Genet 2018; 9:33. [PMID: 29467795 PMCID: PMC5808293 DOI: 10.3389/fgene.2018.00033] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 01/25/2018] [Indexed: 11/13/2022] Open
Abstract
Members of the YABBY gene family, a small plant-specific family of genes, have been proposed to function in specifying abaxial cell fate. Although to date little has been learned about cotton YABBY genes, completion of the cotton genome enables a comprehensive genome-wide analysis of YABBY genes in cotton. Here, a total of 12, 12, and 23 YABBY genes were identified in Gossypium arboreum (2n = 26, A2), G. raimondii (2n = 26, D5), and G. hirsutum (2n = 4x = 52, [AD]t), respectively. Sequence analysis showed that the N-terminal zinc-finger and C-terminal YABBY domains in YABBY proteins are highly conserved among cotton, Arabidopsis, and rice. Eighty-five genes from eight sequenced species naturally clustered into five groups, and the YAB2-like group could be divided into three sub-groups, indicating that YABBYs are highly conserved among the examined species. Orthologs from the At and Dt sub-genomes (where “t” indicates tetraploid) showed good collinearity, indicating that YABBY loci are highly conserved between these two sub-genomes. Whole-genome duplication was the primary cause of upland cotton YABBY gene expansion, segmental duplication played important roles in YABBY gene expansion within the At and Dt sub-genomes, and the YAB5-like group was mainly generated by segmental duplication. The long-terminal repeat retroelements Copia and Gypsy were identified as major transposable elements accompanying the appearance of duplicated YABBY genes, suggesting that transposable element expansion might be involved in gene duplication. Selection pressure analyses using PAML revealed that relaxed purifying selection might be the main impetus during evolution of YABBY genes in the examined species. Furthermore, exon/intron pattern and motif analyses indicated that genes within the same group were significantly conserved between Arabidopsis and cotton. In addition, the expression patterns in different tissues suggest that YABBY proteins may play roles in ovule development because YABBYs are highly expressed in ovules. The expression pattern of YABBY genes showed that approximately half of the YABBYs were down-regulated under different stress treatments. Collectively, our results represent a comprehensive genome-wide study of the YABBY gene family, which should be helpful in further detailed studies on the gene function and evolution of YABBY genes in cotton.
Collapse
Affiliation(s)
- Zhaoen Yang
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China.,Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Qian Gong
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Lingling Wang
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Yuying Jin
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Jianping Xi
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Zhi Li
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Wenqiang Qin
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China.,Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Zuoren Yang
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Lili Lu
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Quanjia Chen
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
| | - Fuguang Li
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China.,Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| |
Collapse
|
32
|
Chen NWG, Thareau V, Ribeiro T, Magdelenat G, Ashfield T, Innes RW, Pedrosa-Harand A, Geffroy V. Common Bean Subtelomeres Are Hot Spots of Recombination and Favor Resistance Gene Evolution. Front Plant Sci 2018; 9:1185. [PMID: 30154814 PMCID: PMC6102362 DOI: 10.3389/fpls.2018.01185] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 07/24/2018] [Indexed: 05/10/2023]
Abstract
Subtelomeres of most eukaryotes contain fast-evolving genes usually involved in adaptive processes. In common bean (Phaseolus vulgaris), the Co-2 anthracnose resistance (R) locus corresponds to a cluster of nucleotide-binding-site leucine-rich-repeat (NL) encoding sequences, the prevalent class of plant R genes. To study the recent evolution of this R gene cluster, we used a combination of sequence, genetic and cytogenetic comparative analyses between common bean genotypes from two distinct gene pools (Andean and Mesoamerican) that diverged 0.165 million years ago. Co-2 is a large subtelomeric cluster on chromosome 11 comprising from 32 (Mesoamerican) to 52 (Andean) NL sequences embedded within khipu satellite repeats. Since the recent split between Andean and Mesoamerican gene pools, the Co-2 cluster has experienced numerous gene-pool specific NL losses, leading to distinct NL repertoires. The high proportion of solo-LTR retrotransposons indicates that the Co-2 cluster is located in a hot spot of unequal intra-strand homologous recombination. Furthermore, we observe large segmental duplications involving both Non-Homologous End Joining and Homologous Recombination double-strand break repair pathways. Finally, the identification of a Mesoamerican-specific subtelomeric sequence reveals frequent interchromosomal recombinations between common bean subtelomeres. Altogether, our results highlight that common bean subtelomeres are hot spots of recombination and favor the rapid evolution of R genes. We propose that chromosome ends could act as R gene incubators in many plant genomes.
Collapse
Affiliation(s)
- Nicolas W. G. Chen
- Institute of Plant Sciences Paris-Saclay (IPS2), UMR 9213/UMR1403, CNRS, INRA, Université Paris-Sud, Université d’Evry, Université Paris-Diderot Sorbonne Paris Cité, Orsay, France
- IRHS, INRA, AGROCAMPUS OUEST, Université d’Angers, SFR 4207 QUASAV, Beaucouzé, France
| | - Vincent Thareau
- Institute of Plant Sciences Paris-Saclay (IPS2), UMR 9213/UMR1403, CNRS, INRA, Université Paris-Sud, Université d’Evry, Université Paris-Diderot Sorbonne Paris Cité, Orsay, France
| | - Tiago Ribeiro
- Laboratory of Plant Cytogenetics, Federal University of Pernambuco, Recife, Brazil
| | - Ghislaine Magdelenat
- Genoscope/Commissariat à l’Energie Atomique-Centre National de Séquençage, Evry, France
| | - Tom Ashfield
- Department of Biology, Indiana University, Bloomington, IN, United States
| | - Roger W. Innes
- Department of Biology, Indiana University, Bloomington, IN, United States
| | | | - Valérie Geffroy
- Institute of Plant Sciences Paris-Saclay (IPS2), UMR 9213/UMR1403, CNRS, INRA, Université Paris-Sud, Université d’Evry, Université Paris-Diderot Sorbonne Paris Cité, Orsay, France
- *Correspondence: Valérie Geffroy,
| |
Collapse
|
33
|
Dallery JF, Lapalu N, Zampounis A, Pigné S, Luyten I, Amselem J, Wittenberg AHJ, Zhou S, de Queiroz MV, Robin GP, Auger A, Hainaut M, Henrissat B, Kim KT, Lee YH, Lespinet O, Schwartz DC, Thon MR, O'Connell RJ. Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters. BMC Genomics 2017; 18:667. [PMID: 28851275 PMCID: PMC5576322 DOI: 10.1186/s12864-017-4083-x] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 08/21/2017] [Indexed: 11/11/2022] Open
Abstract
Background The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Results Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. Conclusion The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen. Electronic supplementary material The online version of this article (10.1186/s12864-017-4083-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jean-Félix Dallery
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | - Nicolas Lapalu
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | - Antonios Zampounis
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France.,Present Address: Department of Deciduous Fruit Trees, Institute of Plant Breeding and Plant Genetic Resources, Hellenic Agricultural Organization 'Demeter', Naoussa, Greece
| | - Sandrine Pigné
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | | | | | | | - Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Marisa V de Queiroz
- Laboratório de Genética Molecular de Fungos, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Guillaume P Robin
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | - Annie Auger
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France
| | - Matthieu Hainaut
- CNRS UMR 7257, Aix-Marseille University, Marseille, France.,INRA, USC 1408 AFMB, Marseille, France
| | - Bernard Henrissat
- CNRS UMR 7257, Aix-Marseille University, Marseille, France.,INRA, USC 1408 AFMB, Marseille, France.,Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Ki-Tae Kim
- Department of Agricultural Biotechnology, Center for Fungal Genetic Resources, Seoul National University, Seoul, Korea
| | - Yong-Hwan Lee
- Department of Agricultural Biotechnology, Center for Fungal Genetic Resources, Seoul National University, Seoul, Korea
| | - Olivier Lespinet
- Laboratoire de Recherche en Informatique, CNRS, Université Paris-Sud, Orsay, France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Sud, Orsay, France
| | - David C Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Michael R Thon
- Instituto Hispano-Luso de Investigaciones Agrarias (CIALE), Department of Microbiology and Genetics, University of Salamanca, Salamanca, Spain
| | - Richard J O'Connell
- UMR BIOGER, INRA, AgroParisTech, Université Paris-Saclay, Thiverval-Grignon, France.
| |
Collapse
|
34
|
Abstract
Deciphering the genetic basis of human disease requires a comprehensive knowledge of genetic variants irrespective of their class or frequency. Although an impressive number of human genetic variants have been catalogued, a large fraction of the genetic difference that distinguishes two human genomes is still not understood at the base-pair level. This is because the emphasis has been on single-nucleotide variation as opposed to less tractable and more complex genetic variants, including indels and structural variants. The latter, we propose, will have a large impact on human phenotypes but require a more systematic assessment of genomes at deeper coverage and alternate sequencing and mapping technologies.
Collapse
|
35
|
Muthuswamy S, Agarwal S. Segmental Duplication QF-PCR: A Simple and Alternative Method of Rapid Aneuploidy Testing for Developing Country Like India. J Clin Lab Anal 2016; 31. [PMID: 27580119 DOI: 10.1002/jcla.22038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Accepted: 07/09/2016] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Aneuploidy screening is becoming an integral part of routine prenatal screening in developing countries like India, and the need for more cheaper and rapid aneuploidy testing methods are required to relive the anxiety and financial burden among the high-risk couples. Segmental duplication quantitative fluorescent polymerase chain reaction (SD-QF-PCR) emerged as an alternative aneuploidy diagnostic method. METHODS This study was conducted to optimize and access the utility of SD-QF-PCR in routine prenatal diagnosis to complement existing short tandem repeats (STR) based QF-PCR. About 50 control samples, 50 Down's syndrome samples, and one each trisomy 18 and Klinefelter samples were studied to optimize the assay. Later, 100 amniotic fluid samples were also studied. RESULTS AND CONCLUSION The assay was able to successfully identify normal and aneuploidy samples with 100% sensitivity and specificity. The results of amniotic fluid analysis by SD-QF-PCR were in agreement with results of STR-QF-PCR. Observed results qualify SD-QF-PCR as a preliminary aneuploidy diagnosis method.
Collapse
Affiliation(s)
- Srinivasan Muthuswamy
- Department of Medical Genetics, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, India
| | - Sarita Agarwal
- Department of Medical Genetics, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, India
| |
Collapse
|
36
|
Abstract
Genomes of the plant-pathogenic genus Phytophthora are characterized by small duplicated blocks consisting of two consecutive genes (2HOM blocks) and by an elevated abundance of similarly aged gene duplicates. Both properties, in particular the presence of 2HOM blocks, have been attributed to a whole-genome duplication (WGD) at the last common ancestor of Phytophthora. However, large intraspecies synteny—compelling evidence for a WGD—has not been detected. Here, we revisited the WGD hypothesis by deducing the age of 2HOM blocks. Two independent timing methods reveal that the majority of 2HOM blocks arose after divergence of the Phytophthora lineages. In addition, a large proportion of the 2HOM block copies colocalize on the same scaffold. Therefore, the presence of 2HOM blocks does not support a WGD at the last common ancestor of Phytophthora. Thus, genome evolution of Phytophthora is likely driven by alternative mechanisms, such as bursts of transposon activity.
Collapse
Affiliation(s)
- Jolien J E van Hooff
- Theoretical Biology and Bioinformatics, Department of Biology, Utrecht University, The Netherlands
| | | | | |
Collapse
|
37
|
Tong H, Jin Y, Xu Y, Zou B, Ye H, Wu H, Kumar S, Pitman JL, Zhou G, Song Q. Prenatal diagnosis of trisomy 21, 18 and 13 by quantitative pyrosequencing of segmental duplications. Clin Genet 2016; 90:451-455. [PMID: 26948280 DOI: 10.1111/cge.12772] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2015] [Revised: 03/02/2016] [Accepted: 03/02/2016] [Indexed: 01/22/2023]
Abstract
Chromosomal aberration mostly occurs in chromosomes 21, 18 and 13, with an incidence approximately 1 out of 160 live births in humans, therefore making prenatal diagnosis necessary in clinics. Current methods have drawbacks such as time consuming, high cost, complicated operations and low sensitivity. In this paper, a novel method for rapid and accurate prenatal diagnosis of aneuploidy is proposed based on pyrosequencing, which quantitatively detects the peak height ratio (PHR) of different bases of segmental duplication. A direct polymerase chain reaction (PCR) approach was undertaken, where a small volume of amniotic fluid was used as the starting material without DNA extraction. Single-stranded DNA was prepared from PCR products and subsequently analyzed using pyrosequencing. The PHR between target and reference chromosome of 2.2 for euploid and 3:2 for a trisomy fetus were used as reference. The reference intervals and z scores were calculated for discrimination of aneuploidy. A total of 132 samples were collected, within trisomy 21 (n = 11), trisomy 18 (n = 3), trisomy 13 (n = 2), and unaffected controls (n = 116). A set of six segmental duplications were chosen for analysis. This method had consistent results with karyotyping analysis, a correct diagnosis with 100% sensitivity and 99.9% specificity.
Collapse
Affiliation(s)
- H Tong
- Key Laboratory of Drug Quality Control and Pharmacovigilance of Ministry of Education, China Pharmaceutical University, Nanjing, China
| | - Y Jin
- Key Laboratory of Drug Quality Control and Pharmacovigilance of Ministry of Education, China Pharmaceutical University, Nanjing, China
| | - Y Xu
- Key Laboratory of Drug Quality Control and Pharmacovigilance of Ministry of Education, China Pharmaceutical University, Nanjing, China
| | - B Zou
- Department of pharmacology, Jinling Hospital, Medical School of Nanjing University, Nanjing, China
| | - H Ye
- Department of pharmacology, Jinling Hospital, Medical School of Nanjing University, Nanjing, China
| | - H Wu
- Department of pharmacology, Jinling Hospital, Medical School of Nanjing University, Nanjing, China
| | - S Kumar
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - J L Pitman
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - G Zhou
- Key Laboratory of Drug Quality Control and Pharmacovigilance of Ministry of Education, China Pharmaceutical University, Nanjing, China. .,Department of pharmacology, Jinling Hospital, Medical School of Nanjing University, Nanjing, China.
| | - Q Song
- Key Laboratory of Drug Quality Control and Pharmacovigilance of Ministry of Education, China Pharmaceutical University, Nanjing, China. .,Department of pharmacology, Jinling Hospital, Medical School of Nanjing University, Nanjing, China.
| |
Collapse
|
38
|
Chen J, Huddleston J, Buckley RM, Malig M, Lawhon SD, Skow LC, Lee MO, Eichler EE, Andersson L, Womack JE. Bovine NK-lysin: Copy number variation and functional diversification. Proc Natl Acad Sci U S A 2015; 112:E7223-9. [PMID: 26668394 DOI: 10.1073/pnas.1519374113] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
NK-lysin is an antimicrobial peptide and effector protein in the host innate immune system. It is coded by a single gene in humans and most other mammalian species. In this study, we provide evidence for the existence of four NK-lysin genes in a repetitive region on cattle chromosome 11. The NK2A, NK2B, and NK2C genes are tandemly arrayed as three copies in ∼30-35-kb segments, located 41.8 kb upstream of NK1. All four genes are functional, albeit with differential tissue expression. NK1, NK2A, and NK2B exhibited the highest expression in intestine Peyer's patch, whereas NK2C was expressed almost exclusively in lung. The four peptide products were synthesized ex vivo, and their antimicrobial effects against both Gram-positive and Gram-negative bacteria were confirmed with a bacteria-killing assay. Transmission electron microcopy indicated that bovine NK-lysins exhibited their antimicrobial activities by lytic action in the cell membranes. In summary, the single NK-lysin gene in other mammals has expanded to a four-member gene family by tandem duplications in cattle; all four genes are transcribed, and the synthetic peptides corresponding to the core regions are biologically active and likely contribute to innate immunity in ruminants.
Collapse
|
39
|
Wang L, Wu N, Zhu Y, Song W, Zhao X, Li Y, Hu Y. The divergence and positive selection of the plant-specific BURP-containing protein family. Ecol Evol 2015; 5:5394-5412. [PMID: 30151141 PMCID: PMC6102523 DOI: 10.1002/ece3.1792] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 09/13/2015] [Accepted: 09/17/2015] [Indexed: 11/21/2022] Open
Abstract
BURP domain-containing proteins belong to a plant-specific protein family and have diverse roles in plant development and stress responses. However, our understanding about the genetic divergence patterns and evolutionary rates of these proteins remain inadequate. In this study, 15 plant genomes were explored to elucidate the genetic origins, divergence, and functions of these proteins. One hundred and twenty-five BURP protein-encoding genes were identified from four main plant lineages, including 13 higher plant species. The absence of BURP family genes in unicellular and multicellular algae suggests that this family (1) appeared when plants shifted from relatively stable aquatic environments to land, where conditions are more variable and stressful, and (2) is critical in the adaptation of plants to adverse environments. Promoter analysis revealed that several responsive elements to plant hormones and external environment stresses are concentrated in the promoter region of BURP protein-encoding genes. This finding confirms that these genes influence plant stress responses. Several segmentally and tandem-duplicated gene pairs were identified from eight plant species. Thus, in general, BURP domain-containing genes have been subject to strong positive selection, even though these genes have conformed to different expansion models in different species. Our study also detected certain critical amino acid sites that may have contributed to functional divergence among groups or subgroups. Unexpectedly, all of the critical amino acid residues of functional divergence and positive selection were exclusively located in the C-terminal region of the BURP domain. In conclusion, our results contribute novel insights into the genetic divergence patterns and evolutionary rates of BURP proteins.
Collapse
Affiliation(s)
- Lihui Wang
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Ningning Wu
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Yan Zhu
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Wanlu Song
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Xin Zhao
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Yaxuan Li
- College of Life SciencesCapital Normal UniversityBeijing100048China
| | - Yingkao Hu
- College of Life SciencesCapital Normal UniversityBeijing100048China
| |
Collapse
|
40
|
Wang D, Yu C, Zuo T, Zhang J, Weber DF, Peterson T. Alternative Transposition Generates New Chimeric Genes and Segmental Duplications at the Maize p1 Locus. Genetics 2015; 201:925-35. [PMID: 26434719 DOI: 10.1534/genetics.115.178210] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 09/06/2015] [Indexed: 02/04/2023] Open
Abstract
The maize Ac/Ds transposon family was the first transposable element system identified and characterized by Barbara McClintock. Ac/Ds transposons belong to the hAT family of class II DNA transposons. We and others have shown that Ac/Ds elements can undergo a process of alternative transposition in which the Ac/Ds transposase acts on the termini of two separate, nearby transposons. Because these termini are present in different elements, alternative transposition can generate a variety of genome alterations such as inversions, duplications, deletions, and translocations. Moreover, Ac/Ds elements transpose preferentially into genic regions, suggesting that structural changes arising from alternative transposition may potentially generate chimeric genes at the rearrangement breakpoints. Here we identified and characterized 11 independent cases of gene fusion induced by Ac alternative transposition. In each case, a functional chimeric gene was created by fusion of two linked, paralogous genes; moreover, each event was associated with duplication of the ∼70-kb segment located between the two paralogs. An extant gene in the maize B73 genome that contains an internal duplication apparently generated by an alternative transposition event was also identified. Our study demonstrates that alternative transposition-induced duplications may be a source for spontaneous creation of diverse genome structures and novel genes in maize.
Collapse
|
41
|
Daga A, Ansari A, Rawal R, Umrania V. Characterization of chromosomal translocation breakpoint sequences in solid tumours: "an in silico analysis". Open Med Inform J 2015; 9:1-8. [PMID: 25972994 PMCID: PMC4421838 DOI: 10.2174/1874431101509010001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 02/19/2015] [Accepted: 02/28/2015] [Indexed: 01/07/2023] Open
Abstract
Chromosomal translocations that results in formation and activation of fusion oncogenes are observed in numerous solid malignancies since years back. Expression of fusion kinases in these cancers drives the initiation & progression that ultimately leads to tumour development and thus comes out to be clinically imperative in terms of diagnosis and treatment of cancer. Nonetheless, molecular mechanisms beneath these translocations remained unexplored consequently limiting our knowledge of carcinogenesis and hence is the current field where further research is required. The issue of prime focus is the precision with which the chromosomes breaks and reunites within genome. Characterization of Genomic sequences located at Breakpoint region may direct us towards the thorough understanding of mechanism leading to chromosomal rearrangement. A unique computational multi-parametric analysis was performed for characterization of genomic sequence within and around breakpoint region. This study turns out to be novel as it reveals the occurrence of Segmental Duplications flanking the breakpoints of all translocation. Breakpoint Islands were also investigated for the presence of other intricate genomic architecture and various physico-chemical parameters. Our study particularly highlights the probable role of SDs and specific genomic features in precise chromosomal breakage. Additionally, it pinpoints the potential features that may be significant for double-strand breaks leading to chromosomal rearrangements.
Collapse
Affiliation(s)
- Aditi Daga
- Department of Microbiology, MVM Science College, Saurashtra University, Rajkot, Gujarat, India
| | - Afzal Ansari
- BIT Virtual Institute of Bioinformatics (GCRI Node), GSBTM, Gandhinagar, Gujarat, India
| | - Rakesh Rawal
- Department of Cancer Biology, The Gujarat Cancer & Research Institute, Ahmedabad, Gujarat, India
| | - Valentina Umrania
- Department of Microbiology, MVM Science College, Saurashtra University, Rajkot, Gujarat, India
| |
Collapse
|
42
|
Meng X, Wang C, Rahman SU, Wang Y, Wang A, Tao S. Genome-wide identification and evolution of HECT genes in soybean. Int J Mol Sci 2015; 16:8517-35. [PMID: 25894222 PMCID: PMC4425094 DOI: 10.3390/ijms16048517] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/13/2015] [Accepted: 04/13/2015] [Indexed: 01/10/2023] Open
Abstract
Proteins containing domains homologous to the E6-associated protein (E6-AP) carboxyl terminus (HECT) are an important class of E3 ubiquitin ligases involved in the ubiquitin proteasome pathway. HECT-type E3s play crucial roles in plant growth and development. However, current understanding of plant HECT genes and their evolution is very limited. In this study, we performed a genome-wide analysis of the HECT domain-containing genes in soybean. Using high-quality genome sequences, we identified 19 soybean HECT genes. The predicted HECT genes were distributed unevenly across 15 of 20 chromosomes. Nineteen of these genes were inferred to be segmentally duplicated gene pairs, suggesting that in soybean, segmental duplications have made a significant contribution to the expansion of the HECT gene family. Phylogenetic analysis showed that these HECT genes can be divided into seven groups, among which gene structure and domain architecture was relatively well-conserved. The Ka/Ks ratios show that after the duplication events, duplicated HECT genes underwent purifying selection. Moreover, expression analysis reveals that 15 of the HECT genes in soybean are differentially expressed in 14 tissues, and are often highly expressed in the flowers and roots. In summary, this work provides useful information on which further functional studies of soybean HECT genes can be based.
Collapse
Affiliation(s)
- Xianwen Meng
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| | - Chen Wang
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| | - Siddiq Ur Rahman
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| | - Yaxu Wang
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| | - Ailan Wang
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| | - Shiheng Tao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling 712100, China.
- Bioinformatics Center, Northwest A&F University, Yangling 712100, China.
| |
Collapse
|
43
|
Abstract
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.
Collapse
Affiliation(s)
- Florian Massip
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany UR1077, Unite Mathematiques Informatique et Genome, INRA, domaine de Vilvert, Jouy-en-Josas, France
| | - Michael Sheinman
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Sophie Schbath
- UR1077, Unite Mathematiques Informatique et Genome, INRA, domaine de Vilvert, Jouy-en-Josas, France
| | - Peter F Arndt
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| |
Collapse
|
44
|
Liu D, Sun W, Yuan Y, Zhang N, Hayward A, Liu Y, Wang Y. Phylogenetic analyses provide the first insights into the evolution of OVATE family proteins in land plants. Ann Bot 2014; 113:1219-33. [PMID: 24812252 PMCID: PMC4030818 DOI: 10.1093/aob/mcu061] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2013] [Accepted: 03/07/2014] [Indexed: 05/22/2023]
Abstract
BACKGROUND AND AIMS The OVATE gene encodes a nuclear-localized regulatory protein belonging to a distinct family of plant-specific proteins known as the OVATE family proteins (OFPs). OVATE was first identified as a key regulator of fruit shape in tomato, with nonsense mutants displaying pear-shaped fruits. However, the role of OFPs in plant development has been poorly characterized. METHODS Public databases were searched and a total of 265 putative OVATE protein sequences were identified from 13 sequenced plant genomes that represent the major evolutionary lineages of land plants. A phylogenetic analysis was conducted based on the alignment of the conserved OVATE domain from these 13 selected plant genomes. The expression patterns of tomato SlOFP genes were analysed via quantitative real-time PCR. The pattern of OVATE gene duplication resulting in the expansion of the gene family was determined in arabidopsis, rice and tomato. KEY RESULTS Genes for OFPs were found to be present in all the sampled land plant genomes, including the early-diverged lineages, mosses and lycophytes. Phylogenetic analysis based on the amino acid sequences of the conserved OVATE domain defined 11 sub-groups of OFPs in angiosperms. Different evolutionary mechanisms are proposed for OVATE family evolution, namely conserved evolution and divergent expansion. Characterization of the AtOFP family in arabidopsis, the OsOFP family in rice and the SlOFP family in tomato provided further details regarding the evolutionary framework and revealed a major contribution of tandem and segmental duplications towards expansion of the OVATE gene family. CONCLUSIONS This first genome-wide survey on OFPs provides new insights into the evolution of the OVATE protein family and establishes a solid base for future functional genomics studies on this important but poorly characterized regulatory protein family in plants.
Collapse
Affiliation(s)
- Di Liu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wei Sun
- Institute of Chinese Materia Medica, Chinese Academy of Chinese Medical Science, Beijing 100700, China Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Yaowu Yuan
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Ning Zhang
- Department of Biology, the Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Alice Hayward
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Yongliang Liu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ying Wang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| |
Collapse
|
45
|
Fawcett JA, Innan H. The role of gene conversion in preserving rearrangement hotspots in the human genome. Trends Genet 2013; 29:561-8. [PMID: 23953668 DOI: 10.1016/j.tig.2013.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Revised: 06/20/2013] [Accepted: 07/08/2013] [Indexed: 11/27/2022]
Abstract
Hotspots of non-allelic homologous recombination (NAHR) have a crucial role in creating genetic diversity and are also associated with dozens of genomic disorders. Recent studies suggest that many human NAHR hotspots have been preserved throughout the evolution of primates. NAHR hotspots are likely to remain active as long as the segmental duplications (SDs) promoting NAHR retain sufficient similarity. Here, we propose an evolutionary model of SDs that incorporates the effect of gene conversion and compare it with a null model that assumes SDs evolve independently without gene conversion. The gene conversion model predicts a much longer lifespan of NAHR hotspots compared with the null model. We show that the literature on copy number variants (CNVs) and genomic disorders, and also the results of additional analysis of CNVs, are all more consistent with the gene conversion model.
Collapse
Affiliation(s)
- Jeffrey A Fawcett
- Graduate University for Advanced Studies, Hayama, Kanagawa 240-0193, Japan
| | | |
Collapse
|
46
|
Xu L, Hou Y, Bickhart DM, Song J, Liu GE. Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data. Microarrays (Basel) 2013; 2:171-85. [PMID: 27605188 DOI: 10.3390/microarrays2030171] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 06/04/2013] [Accepted: 06/05/2013] [Indexed: 11/23/2022]
Abstract
Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.
Collapse
|
47
|
He C, Cui K, Duan A, Zeng Y, Zhang J. Genome-wide and molecular evolution analysis of the Poplar KT/HAK/KUP potassium transporter gene family. Ecol Evol 2012; 2:1996-2004. [PMID: 22957200 PMCID: PMC3434002 DOI: 10.1002/ece3.299] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2012] [Revised: 05/12/2012] [Accepted: 05/15/2012] [Indexed: 01/03/2023] Open
Abstract
As the largest K(+) transport gene family, KT/HAK/KUP family plays an important role in plant growth, development, and stress adaptation. However, there is limited information about this family in woody plant species. In this study, with genome-wide in-depth investigation, 31 Poplar KT/HAK/KUP transporter genes including six pairs of tandem duplicated and eight pairs of segmental duplicated paralogs have been identified, suggesting segmental and tandem duplication events contributed to the expansion of this family in Poplar. The combination of phylogenetic, exon structure and splice site, and paragon analysis revealed 11 pairs of Poplar KT/HAK/KUP duplicates. For these 11 pairs, all pairs are subject to purify selection, and asymmetric evolutionary rates have been found to occur in three pairs. This study might provide more insights into the underlying evolution mechanisms of trees acclimating to their natural habitat.
Collapse
Affiliation(s)
- Caiyun He
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Silviculture of the State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry Beijing, 100091, People's Republic of China
| | | | | | | | | |
Collapse
|
48
|
Vu TH, Coccaro EF, Eichler EE, Girirajan S. Genomic architecture of aggression: rare copy number variants in intermittent explosive disorder. Am J Med Genet B Neuropsychiatr Genet 2011; 156B:808-16. [PMID: 21812102 PMCID: PMC3168586 DOI: 10.1002/ajmg.b.31225] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/26/2011] [Accepted: 07/11/2011] [Indexed: 12/29/2022]
Abstract
Copy number variants (CNVs) are known to be associated with complex neuropsychiatric disorders (e.g., schizophrenia and autism) but have not been explored in the isolated features of aggressive behaviors such as intermittent explosive disorder (IED). IED is characterized by recurrent episodes of aggression in which individuals act impulsively and grossly out of proportion from the involved stressors. Previous studies have identified genetic variants in the serotonergic pathway that play a role in susceptibility to this behavior, but additional contributors have not been identified. Therefore, to further delineate possible genetic influences, we investigated CNVs in individuals diagnosed with IED and/or personality disorder (PD). We carried out array comparative genomic hybridization on 113 samples of individuals with isolated features of IED (n = 90) or PD (n = 23). We detected a recurrent 1.35-Mbp deletion on chromosome 1q21.1 in one IED subject and a novel ∼350-kbp deletion on chromosome 16q22.3q23.1 in another IED subject. While five recent reports have suggested the involvement of an ∼1.6-Mbp 15q13.3 deletion in individuals with behavioral problems, particularly aggression, we report an absence of such events in our study of individuals specifically selected for aggression. We did, however, detect a smaller ∼430-kbp 15q13.3 duplication containing CHRNA7 in one individual with PD. While these results suggest a possible role for rare CNVs in identifying genes underlying IED or PD, further studies on a large number of well-characterized individuals are necessary.
Collapse
Affiliation(s)
- Tiffany H Vu
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, USA.
| | - Emil F Coccaro
- Department of Psychiatry and Behavioral Neuroscience, University of ChicagoChicago, Illinois
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of MedicineSeattle, Washington,Howard Hughes Medical Institute, University of Washington School of MedicineSeattle, Washington
| | - Santhosh Girirajan
- Department of Genome Sciences, University of Washington School of MedicineSeattle, Washington,*Correspondence to: Santhosh Girirajan, MBBS, Ph.D., Department of Genome Sciences, University of Washington, Foege S-413A, Box 355065, 3720 15th Ave NE, Seattle, WA 98195. E-mail:
| |
Collapse
|
49
|
Fouvry L, Ogereau D, Berger A, Gavory F, Montchamp-Moreau C. Sequence Analysis of the Segmental Duplication Responsible for Paris Sex-Ratio Drive in Drosophila simulans. G3 (Bethesda) 2011; 1:401-10. [PMID: 22384350 DOI: 10.1534/g3.111.000315] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2011] [Accepted: 08/25/2011] [Indexed: 12/25/2022]
Abstract
Sex-ratio distorters are X-linked selfish genetic elements that facilitate their own transmission by subverting Mendelian segregation at the expense of the Y chromosome. Naturally occurring cases of sex-linked distorters have been reported in a variety of organisms, including several species of Drosophila; they trigger genetic conflict over the sex ratio, which is an important evolutionary force. However, with a few exceptions, the causal loci are unknown. Here, we molecularly characterize the segmental duplication involved in the Paris sex-ratio system that is still evolving in natural populations of Drosophila simulans. This 37.5 kb tandem duplication spans six genes, from the second intron of the Trf2 gene (TATA box binding protein-related factor 2) to the first intron of the org-1 gene (optomotor-blind-related-gene-1). Sequence analysis showed that the duplication arose through the production of an exact copy on the template chromosome itself. We estimated this event to be less than 500 years old. We also detected specific signatures of the duplication mechanism; these support the Duplication-Dependent Strand Annealing model. The region at the junction between the two duplicated segments contains several copies of an active transposable element, Hosim1, alternating with 687 bp repeats that are noncoding but transcribed. The almost-complete sequence identity between copies made it impossible to complete the sequencing and assembly of this region. These results form the basis for the functional dissection of Paris sex-ratio drive and will be valuable for future studies designed to better understand the dynamics and the evolutionary significance of sex chromosome drive.
Collapse
|
50
|
Han Y, Zheng D, Vimolmangkang S, Khan MA, Beever JE, Korban SS. Integration of physical and genetic maps in apple confirms whole-genome and segmental duplications in the apple genome. J Exp Bot 2011; 62:5117-30. [PMID: 21743103 PMCID: PMC3193016 DOI: 10.1093/jxb/err215] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
A total of 355 simple sequence repeat (SSR) markers were developed, based on expressed sequence tag (EST) and bacterial artificial chromosome (BAC)-end sequence databases, and successfully used to construct an SSR-based genetic linkage map of the apple. The consensus linkage map spanned 1143 cM, with an average density of 2.5 cM per marker. Newly developed SSR markers along with 279 SSR markers previously published by the HiDRAS project were further used to integrate physical and genetic maps of the apple using a PCR-based BAC library screening approach. A total of 470 contigs were unambiguously anchored onto all 17 linkage groups of the apple genome, and 158 contigs contained two or more molecular markers. The genetically mapped contigs spanned ∼421 Mb in cumulative physical length, representing 60.0% of the genome. The sizes of anchored contigs ranged from 97 kb to 4.0 Mb, with an average of 995 kb. The average physical length of anchored contigs on each linkage group was ∼24.8 Mb, ranging from 17.0 Mb to 37.73 Mb. Using BAC DNA as templates, PCR screening of the BAC library amplified fragments of highly homologous sequences from homoeologous chromosomes. Upon integrating physical and genetic maps of the apple, the presence of not only homoeologous chromosome pairs, but also of multiple locus markers mapped to adjacent sites on the same chromosome was detected. These findings demonstrated the presence of both genome-wide and segmental duplications in the apple genome and provided further insights into the complex polyploid ancestral origin of the apple.
Collapse
Affiliation(s)
- Yuepeng Han
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Moshan, Wuhan, 430074, PR China
| | - Danman Zheng
- Department of Natural Resources and Environmental Sciences, University of Illinois, 1201 W. Gregory, Urbana, IL 61801, USA
| | - Sornkanok Vimolmangkang
- Department of Natural Resources and Environmental Sciences, University of Illinois, 1201 W. Gregory, Urbana, IL 61801, USA
| | - Muhammad A. Khan
- Department of Natural Resources and Environmental Sciences, University of Illinois, 1201 W. Gregory, Urbana, IL 61801, USA
| | - Jonathan E. Beever
- Department of Animal Sciences, University of Illinois, 1201 W. Gregory, Urbana, IL 61801, USA
| | - Schuyler S. Korban
- Department of Natural Resources and Environmental Sciences, University of Illinois, 1201 W. Gregory, Urbana, IL 61801, USA
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|