1
|
Othoum G, Prigent S, Derouiche A, Shi L, Bokhari A, Alamoudi S, Bougouffa S, Gao X, Hoehndorf R, Arold ST, Gojobori T, Hirt H, Lafi FF, Nielsen J, Bajic VB, Mijakovic I, Essack M. Comparative genomics study reveals Red Sea Bacillus with characteristics associated with potential microbial cell factories (MCFs). Sci Rep 2019; 9:19254. [PMID: 31848398 PMCID: PMC6917714 DOI: 10.1038/s41598-019-55726-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 11/27/2019] [Indexed: 02/07/2023] Open
Abstract
Recent advancements in the use of microbial cells for scalable production of industrial enzymes encourage exploring new environments for efficient microbial cell factories (MCFs). Here, through a comparison study, ten newly sequenced Bacillus species, isolated from the Rabigh Harbor Lagoon on the Red Sea shoreline, were evaluated for their potential use as MCFs. Phylogenetic analysis of 40 representative genomes with phylogenetic relevance, including the ten Red Sea species, showed that the Red Sea species come from several colonization events and are not the result of a single colonization followed by speciation. Moreover, clustering reactions in reconstruct metabolic networks of these Bacillus species revealed that three metabolic clades do not fit the phylogenetic tree, a sign of convergent evolution of the metabolism of these species in response to special environmental adaptation. We further showed Red Sea strains Bacillus paralicheniformis (Bac48) and B. halosaccharovorans (Bac94) had twice as much secreted proteins than the model strain B. subtilis 168. Also, Bac94 was enriched with genes associated with the Tat and Sec protein secretion system and Bac48 has a hybrid PKS/NRPS cluster that is part of a horizontally transferred genomic region. These properties collectively hint towards the potential use of Red Sea Bacillus as efficient protein secreting microbial hosts, and that this characteristic of these strains may be a consequence of the unique ecological features of the isolation environment.
Collapse
Affiliation(s)
- G Othoum
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - S Prigent
- Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden
| | - A Derouiche
- Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden
| | - L Shi
- Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden
| | - A Bokhari
- Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - S Alamoudi
- Department of Biology, Science and Arts College, King Abdulaziz University, Rabigh, 21589, Kingdom of Saudi Arabia
| | - S Bougouffa
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - X Gao
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - R Hoehndorf
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - S T Arold
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - T Gojobori
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.,Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - H Hirt
- Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - F F Lafi
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.,College of Natural and Health Sciences, Zayed University, 144534, Abu-Dhabi, United Arab Emirates
| | - J Nielsen
- Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark.,Science for Life Laboratory, Royal Institute of Technology, Solna, Sweden
| | - V B Bajic
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - I Mijakovic
- Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden. .,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark.
| | - M Essack
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.
| |
Collapse
|
2
|
|
3
|
Ezawa K, Ikeo K, Gojobori T, Saitou N. Evolutionary Pattern of Gene Homogenization between Primate-Specific Paralogs after Human and Macaque Speciation Using the 4-2-4 Method. Mol Biol Evol 2010; 27:2152-71. [DOI: 10.1093/molbev/msq109] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
4
|
Yoshida S, Ogura A, Ishikawa K, Yoshida A, Kohno R, Yamaji Y, Ikeo K, Gojobori T, Kono T, Ishibashi T. Gene expression profile of fibrovascular membranes from patients with proliferative diabetic retinopathy. Br J Ophthalmol 2009; 94:795-801. [DOI: 10.1136/bjo.2009.167072] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
5
|
Abstract
DDBJ (http://www.ddbj.nig.ac.jp) collected and released 1 880 115 entries or 1 134 086 245 bases in the period from July 2006 to June 2007. The released data contains the high-throughput cDNAs of cricket and high-quality draft genome of medaka among others. Our computer system has been upgraded since March 2007. Another new aspect is an efficient data retrieval tool that has recently been equipped and served at DDBJ. It is called All-round Retrieval for Sequence and Annotation, which enables the user to search for keywords also in the Feature/Qualifier of the International Nucleotide Sequence Database Collaboration (http://www.insdc.org/). We will also replace our home page with a more efficient one by the end of 2007.
Collapse
Affiliation(s)
- H Sugawara
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Yata, Mishima 411-8540, Japan
| | | | | | | | | |
Collapse
|
6
|
Abstract
Protein-protein interactions (PPIs) are one of the most important components of biological networks. It is important to understand the evolutionary process of PPIs in order to elucidate how the evolution of biological networks has contributed to diversification of the existent organisms. We focused on the evolutionary rates of proteins involved with PPIs, because it had been shown that for a given protein-coding gene the number of its PPIs in a biological network was one of the important factors in determining the evolutionary rate of the gene. We studied the evolutionary rates of duplicated gene products that were involved with PPIs, reviewing the current situation of this subject. In addition, we focused on how the evolutionary rates of proteins were influenced by the characteristic features of PPIs. We, then, concluded that the evolutionary rates of the proteins in the PPI networks were strongly influenced by their PPI partners. Finally, we emphasized that evolutionary considerations of the PPI proteins were very important for understanding the building up of the current PPI networks.
Collapse
Affiliation(s)
- T Makino
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata, Mishima, Japan
| | | |
Collapse
|
7
|
Choy KW, Wang CC, Ogura A, Lau TK, Rogers MS, Ikeo K, Gojobori T, Tang LY, Lam DSC, Chung TKH, Pang CP. Molecular characterization of the developmental gene in eyes: through data-mining on integrated transcriptome databases. Clin Biochem 2006; 39:224-30. [PMID: 16427038 DOI: 10.1016/j.clinbiochem.2005.12.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2005] [Revised: 11/03/2005] [Accepted: 12/13/2005] [Indexed: 11/17/2022]
Abstract
OBJECTIVES Our aim was to utilize publicly available and proprietary sources to discover candidate genes important for ocular development. DESIGN AND METHODS The collated information on our 5092 non-redundant clusters was grouped and functional annotation was conducted using gene ontology (FatiGO) for categorizing them with respect to molecular function. The web-based viewer technological platform (H-InvDB) was employed for transcription analyses of in-house high quality fetal eye Expressed Sequence Tags (ESTs). Eye-specific ESTs were also analyzed across species by using EMBEST. RESULTS According to adult eye cDNA libraries, nucleic acid binding and cell structure/cytoskeletal protein genes were the most abundant among the ESTs of fetal eyes. Using cDNA assembly in H-InvDB, 20 (80%) of the 25 most commonly expressed genes in the human eye are also expressed in extraocular tissues. The crystalline gamma S gene is highly expressed in the eye, but not in other tissues. We used EMBEST to compare human fetal eye and octopus eye ESTs and the expression similarity was low (1.6%). This indicated that our fetal eye library contains genes necessary for the developmental process and biological function of the eye, which may not be expressed in the fully developed octopus eyes. The human fetal eye cDNA library also contained highly abundant eye tissue genes, including alphaA-crystallin, eukaryotic translation elongation factor 1 alpha 1 (EEF1A1), bestrophin (VMD2), cystatin C, and transforming growth factor, beta-induced (BIGH3). CONCLUSIONS Our annotated EST set provides a valuable resource for gene discovery and functional genomic analysis. This display will help to appreciate the strengths and weaknesses of the different technological platforms, so that in future studies the maximum amount of beneficial information can be derived from the appropriate use of each method.
Collapse
Affiliation(s)
- K W Choy
- Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, New Territories, Hong Kong.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Choy KW, Wang CC, Ogura A, Lau TK, Rogers MS, Ikeo K, Gojobori T, Lam DSC, Pang CP. Genomic annotation of 15,809 ESTs identified from pooled early gestation human eyes. Physiol Genomics 2005; 25:9-15. [PMID: 16368877 DOI: 10.1152/physiolgenomics.00121.2005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
To complement cDNA libraries from the human eye at early gestation and to discover candidate genes associated with early ocular development, we used freshly dissected human eyeballs from week 9-14 of gestation to construct the early human fetal eye cDNA library. A total of 15,809 clones were isolated and sequenced from the unamplified and unnormalized library. We screened 11,246 good-quality ESTs, leading to the identification of 5,534 nonredundant clusters. Among them, 4,010 (72%) genes matched in the human protein database (Ensembl). The remaining 28% (1,524) corresponded to potentially novel or previously unidentified ESTs. We used BLASTX to compare our EST data with eight organisms and found common expression of a high portion of genes: Caenorhabditis briggsae (26%), Caenorhabditis elegans (27%), Anopheles gambiae (37%), Drosophila melanogaster (32%), Danio rerio (42%), Fugu rubripes (49%), Rattus norvegicusvalitus (52%), and Mus musculus (59%). Nevertheless, 48% (2,680 of 5,534) of the genes expressed in the early developing eye were not shared with current NEIBank human eye cDNA data. In addition, eight known retinal disease genes existed in our ESTs. Among them, six (COL11A1, BBS5, PDE6B, OAT, VMD2, and PGK1) were conserved among the genomes of other organisms, indicating that our annotated EST set provides not only a valuable resource for gene discovery and functional genomic analysis but also for phylogenetic analysis. Our foremost early gestation human eye cDNA library could provide detailed comparisons across species to identify physiological functions of genes and to elucidate evolutionary mechanisms.
Collapse
Affiliation(s)
- K W Choy
- Department of Ophthalmology, Chinese University of Hong Kong, Hong Kong
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, Kodzius R, Shimokawa K, Bajic VB, Brenner SE, Batalov S, Forrest ARR, Zavolan M, Davis MJ, Wilming LG, Aidinis V, Allen JE, Ambesi-Impiombato A, Apweiler R, Aturaliya RN, Bailey TL, Bansal M, Baxter L, Beisel KW, Bersano T, Bono H, Chalk AM, Chiu KP, Choudhary V, Christoffels A, Clutterbuck DR, Crowe ML, Dalla E, Dalrymple BP, de Bono B, Della Gatta G, di Bernardo D, Down T, Engstrom P, Fagiolini M, Faulkner G, Fletcher CF, Fukushima T, Furuno M, Futaki S, Gariboldi M, Georgii-Hemming P, Gingeras TR, Gojobori T, Green RE, Gustincich S, Harbers M, Hayashi Y, Hensch TK, Hirokawa N, Hill D, Huminiecki L, Iacono M, Ikeo K, Iwama A, Ishikawa T, Jakt M, Kanapin A, Katoh M, Kawasawa Y, Kelso J, Kitamura H, Kitano H, Kollias G, Krishnan SPT, Kruger A, Kummerfeld SK, Kurochkin IV, Lareau LF, Lazarevic D, Lipovich L, Liu J, Liuni S, McWilliam S, Madan Babu M, Madera M, Marchionni L, Matsuda H, Matsuzawa S, Miki H, Mignone F, Miyake S, Morris K, Mottagui-Tabar S, Mulder N, Nakano N, Nakauchi H, Ng P, Nilsson R, Nishiguchi S, Nishikawa S, Nori F, Ohara O, Okazaki Y, Orlando V, Pang KC, Pavan WJ, Pavesi G, Pesole G, Petrovsky N, Piazza S, Reed J, Reid JF, Ring BZ, Ringwald M, Rost B, Ruan Y, Salzberg SL, Sandelin A, Schneider C, Schönbach C, Sekiguchi K, Semple CAM, Seno S, Sessa L, Sheng Y, Shibata Y, Shimada H, Shimada K, Silva D, Sinclair B, Sperling S, Stupka E, Sugiura K, Sultana R, Takenaka Y, Taki K, Tammoja K, Tan SL, Tang S, Taylor MS, Tegner J, Teichmann SA, Ueda HR, van Nimwegen E, Verardo R, Wei CL, Yagi K, Yamanishi H, Zabarovsky E, Zhu S, Zimmer A, Hide W, Bult C, Grimmond SM, Teasdale RD, Liu ET, Brusic V, Quackenbush J, Wahlestedt C, Mattick JS, Hume DA, Kai C, Sasaki D, Tomaru Y, Fukuda S, Kanamori-Katayama M, Suzuki M, Aoki J, Arakawa T, Iida J, Imamura K, Itoh M, Kato T, Kawaji H, Kawagashira N, Kawashima T, Kojima M, Kondo S, Konno H, Nakano K, Ninomiya N, Nishio T, Okada M, Plessy C, Shibata K, Shiraki T, Suzuki S, Tagami M, Waki K, Watahiki A, Okamura-Oho Y, Suzuki H, Kawai J, Hayashizaki Y. The transcriptional landscape of the mammalian genome. Science 2005; 309:1559-63. [PMID: 16141072 DOI: 10.1126/science.1112014] [Citation(s) in RCA: 2607] [Impact Index Per Article: 137.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Collapse
|
10
|
Abstract
In the past year, we at DDBJ (DNA Data Bank of Japan; http://www.ddbj.nig.ac.jp) collected and released 1 066 084 entries or 718 072 425 bases including the whole chromosome 22 of chimpanzee, the whole-genome shotgun sequences of silkworm and various others. On the other hand, we hosted workshops for human full-length cDNA annotation and participated in jamborees of mouse full-length cDNA annotation. The annotated data are made public at DDBJ. We are also in collaboration with a RIKEN team to accept and release the CAGE (Cap Analysis Gene Expression) data under a new category, MGA (Mass Sequences for Genome Annotation). The data will be useful for studying gene expression control in many aspects.
Collapse
Affiliation(s)
- Y Tateno
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Yata, Mishima, 411-8540, Japan.
| | | | | | | | | |
Collapse
|
11
|
Abstract
The maximum likelihood estimation (MLE) is one of the most popular ways to estimate haplotype frequencies of a population with genotype data whose linkage phases are unknown. The MLE is commonly implemented in the use of the Expectation-Maximization (EM) algorithm. It is known that the EM algorithm carries the risk that an estimator may converge erroneously to one of the local maxima or saddle points of the likelihood surface, resulting in serious errors in the MLE of haplotype frequencies. In this note, by theoretical treatments we present the necessary and sufficient conditions that the local maxima or saddle points on the likelihood surface appear. As a rule of thumb, that the difference between the coupling and repulsive haplotype frequencies in phase known individuals is 3/2 times larger than the frequency of phase ambiguous individuals is the sufficient condition that the likelihood surface is unimodal. Moreover, we present the analytic solution to the biallelic two-locus problem, and construct a general algorithm to obtain the global maximum.
Collapse
Affiliation(s)
- S Mano
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Bellgard M, Ye J, Gojobori T, Appels R. The bioinformatics challenges in comparative analysis of cereal genomes-an overview. Funct Integr Genomics 2004; 4:1-11. [PMID: 14770300 DOI: 10.1007/s10142-004-0102-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2003] [Revised: 12/16/2003] [Accepted: 12/16/2003] [Indexed: 11/24/2022]
Abstract
Comparative genomic analysis is the cornerstone of in silico-based approaches to understanding biological systems and processes across cereal species, such as rice, wheat and barley, in order to identify genes of agronomic interest. The size of the genomic repositories is nearly doubling every year, and this has significant implications on the way bioinformatics analyses are carried out. In this overview the concepts and technology underpinning bioinformatics as applied to comparative genomic analysis are considered in the context of other manuscripts appearing in this issue of Functional and Integrative Genomics.
Collapse
Affiliation(s)
- M Bellgard
- Molecular Plant Breeding CRC, Murdoch University, South Street, WA 6152 Murdoch, Australia
| | | | | | | |
Collapse
|
13
|
Miyazaki S, Sugawara H, Ikeo K, Gojobori T, Tateno Y. DDBJ in the stream of various biological data. Nucleic Acids Res 2004; 32:D31-4. [PMID: 14681352 PMCID: PMC308861 DOI: 10.1093/nar/gkh127] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2003] [Revised: 10/03/2003] [Accepted: 10/23/2003] [Indexed: 11/13/2022] Open
Abstract
In the past year we at DDBJ (http://www.ddbj.nig. ac.jp) have made a steady increase in the number of data submissions with a 50.6% increment in the number of bases or 46.5% increment in the number of entries. Among them the genome data of man, ascidian and rice hold the top three. Our activity has extended to providing a tool that enables sequence retrieval using regular expressions, and to launching our SOAP server and web services to facilitate the acquisition of proper data and tools from a huge number of biological data resources on websites worldwide. We have also opened our public gene expression database, CIBEX.
Collapse
Affiliation(s)
- S Miyazaki
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata, Mishima 411-8540, Japan
| | | | | | | | | |
Collapse
|
14
|
Kadota M, Nishigaki R, Wang CC, Toda T, Shirayoshi Y, Inoue T, Gojobori T, Ikeo K, Rogers MS, Oshimura M. Proteomic signatures and aberrations of mouse embryonic stem cells containing a single human chromosome 21 in neuronal differentiation: An in vitro model of down syndrome. Neuroscience 2004; 129:325-35. [PMID: 15501590 DOI: 10.1016/j.neuroscience.2004.06.081] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/23/2004] [Indexed: 11/16/2022]
Abstract
Neurodegeneration in fetal development of Down syndrome (DS) patients is proposed to result in apparent neuropathological abnormalities and to contribute to the phenotypic characteristics of mental retardation and premature development of Alzheimer disease. In order to identify the aberrant and specific genes involved in the early differentiation of DS neurons, we have utilized an in vitro neuronal differentiation system of mouse ES cells containing a single human chromosome 21 (TT2F/hChr21) with TT2F parental ES cells as a control. The paired protein extracts from TT2F and TT2F/hChr21 cells at several stages of neuronal differentiation were subjected to two-dimensional polyacrylamide gel electrophoresis protein separation followed by matrix-assisted laser desorption/ionization-time of flight mass spectrometry to identify the proteins differentially expressed between TT2F and TT2F/hChr21 cells. We provide here a novel set of specific gene products altered in early differentiating DS neuronal cells, which differs from that identified in adult or fetal brain with DS. The aberrant protein expression in early differentiating neurons, due to the hChr21 gene dosage effects or chromosomal imbalance, may affect neuronal outgrowth, proliferation and differentiation, producing developmental abnormalities in neural patterning, which eventually leads to formation of a suboptimal functioning neuronal network in DS.
Collapse
Affiliation(s)
- M Kadota
- Department of Human Genome Science (Kirin Brewery), Graduate School of Medical Science, Tottori University, 86 Nishimachi, Yonago, Tottori 683-8503, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Nakamura K, Suzuki Y, Ikeo K, Ikeda Y, Sato E, Nguyen NTP, Gojobori T, Mikami T, Miyazawa T. Phylogenetic analysis of Vietnamese isolates of feline immunodeficiency virus: genetic diversity of subtype C. Arch Virol 2003; 148:783-91. [PMID: 12664300 DOI: 10.1007/s00705-002-0954-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Phylogenetic relationships of novel Vietnamese strains of feline immunodeficiency virus (FIV) were analysed. One Vietnamese strain was found to cluster with subtype D, which was previously known only in Japan, while the other seven strains were placed with members of subtype C. Calculation of the relative numbers of mutations resulting in amino acid and silent changes in FIV env subtypes suggested that subtype C isolates may be less structurally constrained (potentially more pathogenic) than subtype B.
Collapse
Affiliation(s)
- K Nakamura
- Department of Veterinary Microbiology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Pignatelli S, Dal Monte P, Rossini G, Chou S, Gojobori T, Hanada K, Guo JJ, Rawlinson W, Britt W, Mach M, Landini MP. Human cytomegalovirus glycoprotein N (gpUL73-gN) genomic variants: identification of a novel subgroup, geographical distribution and evidence of positive selective pressure. J Gen Virol 2003; 84:647-655. [PMID: 12604817 DOI: 10.1099/vir.0.18704-0] [Citation(s) in RCA: 82] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Human cytomegalvirus (HCMV) ORF UL73 is a polymorphic locus, encoding the viral glycoprotein gpUL73-gN, a component of the gC-II envelope complex. The previously identified gN genomic variants, denoted gN-1, gN-2, gN-3 and gN-4, were further investigated in this work by analysing a large panel of HCMV clinical isolates collected from all over the world (223 samples). Sequencing and phylogenetic analysis confirmed the existence of the four gN genotypes, but also allowed the identification of a novel subgroup belonging to the gN-3 genotype, which was designated gN-3b. The number of non-synonymous (d(N)) and synonymous (d(S)) nucleotide substitutions and their ratio (d(N)/d(S)) were estimated among the gN genotypes to evaluate the possibility of positive selection. Results showed that the four variants evolved by neutral (random) selection, but that the gN-3 and gN-4 genotypes are maintained by positive selective pressure. The 223 HCMV clinical isolates were subdivided according to their geographical origin, and four main regions of gN prevalence were identified: Europe, China, Australia and Northern America. The gN variants were found to be widespread and represented within the regions analysed without any significant difference, and no new genotype was detected. Finally, for clinical and epidemiological purposes, a rapid and low-cost method for genetic grouping of the HCMV clinical isolates was developed based on the RFLP revealed by SacI, ScaI and SalI digestion of the PCR-amplified UL73 sequence. This technique enabled us to distinguish all four gN genomic variants and also their subtypes.
Collapse
Affiliation(s)
- S Pignatelli
- Department of Clinical and Experimental Medicine, Division Microbiology - St Orsola General Hospital, University of Bologna, Via Massarenti 9, 40138 Bologna, Italy
| | - P Dal Monte
- Department of Clinical and Experimental Medicine, Division Microbiology - St Orsola General Hospital, University of Bologna, Via Massarenti 9, 40138 Bologna, Italy
| | - G Rossini
- Department of Clinical and Experimental Medicine, Division Microbiology - St Orsola General Hospital, University of Bologna, Via Massarenti 9, 40138 Bologna, Italy
| | - S Chou
- Medical and Research Services, VA Medical Center and Division of Infectious Diseases, Oregon Health Sciences University, Portland, OR, USA
| | - T Gojobori
- Center of Information Biology, National Institute of Genetics, Mishima, Japan
| | - K Hanada
- Center of Information Biology, National Institute of Genetics, Mishima, Japan
| | - J J Guo
- Department of Clinical and Experimental Medicine, Division Microbiology - St Orsola General Hospital, University of Bologna, Via Massarenti 9, 40138 Bologna, Italy
| | - W Rawlinson
- Department of Microbiology, SEALS, Prince of Wales Hospital, Randwick, NSW, Australia
| | - W Britt
- Department of Pediatrics and Microbiology, University of Alabama, Birmingham, AL, USA
| | - M Mach
- Institute of Clinical and Molecular Virology, University of Erlangen-Nurnberg, Germany
| | - M P Landini
- Department of Clinical and Experimental Medicine, Division Microbiology - St Orsola General Hospital, University of Bologna, Via Massarenti 9, 40138 Bologna, Italy
| |
Collapse
|
17
|
Abstract
The DNA Data Bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) has collected and released more entries and bases than last year. This is mainly due to large-scale submissions from Japanese sequencing teams on mouse, rice, chimpanzee, nematoda and other organisms. The contributions of DDBJ over the past year are 17.3% (entries) and 10.3% (bases) of the combined outputs of the International Nucleotide Sequence Databases (INSD). Our complete genome sequence database, Genome Information Broker (GIB), has been improved by incorporating XML. It is now possible to perform a more sophisticated database search against the new GIB than the ordinary BLAST or FASTA search.
Collapse
Affiliation(s)
- S Miyazaki
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata, Mishima 411-8540, Japan
| | | | | | | |
Collapse
|
18
|
Okazaki Y, Furuno M, Kasukawa T, Adachi J, Bono H, Kondo S, Nikaido I, Osato N, Saito R, Suzuki H, Yamanaka I, Kiyosawa H, Yagi K, Tomaru Y, Hasegawa Y, Nogami A, Schönbach C, Gojobori T, Baldarelli R, Hill DP, Bult C, Hume DA, Quackenbush J, Schriml LM, Kanapin A, Matsuda H, Batalov S, Beisel KW, Blake JA, Bradt D, Brusic V, Chothia C, Corbani LE, Cousins S, Dalla E, Dragani TA, Fletcher CF, Forrest A, Frazer KS, Gaasterland T, Gariboldi M, Gissi C, Godzik A, Gough J, Grimmond S, Gustincich S, Hirokawa N, Jackson IJ, Jarvis ED, Kanai A, Kawaji H, Kawasawa Y, Kedzierski RM, King BL, Konagaya A, Kurochkin IV, Lee Y, Lenhard B, Lyons PA, Maglott DR, Maltais L, Marchionni L, McKenzie L, Miki H, Nagashima T, Numata K, Okido T, Pavan WJ, Pertea G, Pesole G, Petrovsky N, Pillai R, Pontius JU, Qi D, Ramachandran S, Ravasi T, Reed JC, Reed DJ, Reid J, Ring BZ, Ringwald M, Sandelin A, Schneider C, Semple CAM, Setou M, Shimada K, Sultana R, Takenaka Y, Taylor MS, Teasdale RD, Tomita M, Verardo R, Wagner L, Wahlestedt C, Wang Y, Watanabe Y, Wells C, Wilming LG, Wynshaw-Boris A, Yanagisawa M, Yang I, Yang L, Yuan Z, Zavolan M, Zhu Y, Zimmer A, Carninci P, Hayatsu N, Hirozane-Kishikawa T, Konno H, Nakamura M, Sakazume N, Sato K, Shiraki T, Waki K, Kawai J, Aizawa K, Arakawa T, Fukuda S, Hara A, Hashizume W, Imotani K, Ishii Y, Itoh M, Kagawa I, Miyazaki A, Sakai K, Sasaki D, Shibata K, Shinagawa A, Yasunishi A, Yoshino M, Waterston R, Lander ES, Rogers J, Birney E, Hayashizaki Y. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 2002; 420:563-73. [PMID: 12466851 DOI: 10.1038/nature01266] [Citation(s) in RCA: 1226] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2002] [Accepted: 10/28/2002] [Indexed: 01/10/2023]
Abstract
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Collapse
MESH Headings
- Alternative Splicing/genetics
- Amino Acid Motifs
- Animals
- Chromosomes, Mammalian/genetics
- Cloning, Molecular
- DNA, Complementary/genetics
- Databases, Genetic
- Expressed Sequence Tags
- Genes/genetics
- Genomics/methods
- Humans
- Membrane Proteins/genetics
- Mice/genetics
- Physical Chromosome Mapping
- Protein Structure, Tertiary
- Proteome/chemistry
- Proteome/genetics
- RNA, Antisense/genetics
- RNA, Messenger/analysis
- RNA, Messenger/genetics
- RNA, Untranslated/analysis
- RNA, Untranslated/genetics
- Transcription Initiation Site
- Transcription, Genetic/genetics
Collapse
Affiliation(s)
- Y Okazaki
- [1] Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Tateno Y, Imanishi T, Miyazaki S, Fukami-Kobayashi K, Saitou N, Sugawara H, Gojobori T. DNA Data Bank of Japan (DDBJ) for genome scale research in life science. Nucleic Acids Res 2002; 30:27-30. [PMID: 11752245 PMCID: PMC99140 DOI: 10.1093/nar/30.1.27] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The DNA Data Bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) has made an effort to collect as much data as possible mainly from Japanese researchers. The increase rates of the data we collected, annotated and released to the public in the past year are 43% for the number of entries and 52% for the number of bases. The increase rates are accelerated even after the human genome was sequenced, because sequencing technology has been remarkably advanced and simplified, and research in life science has been shifted from the gene scale to the genome scale. In addition, we have developed the Genome Information Broker (GIB, http://gib.genes.nig.ac.jp) that now includes more than 50 complete microbial genome and Arabidopsis genome data. We have also developed a database of the human genome, the Human Genomics Studio (HGS, http://studio.nig.ac.jp). HGS provides one with a set of sequences being as continuous as possible in any one of the 24 chromosomes. Both GIB and HGS have been updated incorporating newly available data and retrieval tools.
Collapse
Affiliation(s)
- Y Tateno
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata, Mishima 411-8540, Japan
| | | | | | | | | | | | | |
Collapse
|
20
|
Ogihara Y, Isono K, Kojima T, Endo A, Hanaoka M, Shiina T, Terachi T, Utsugi S, Murata M, Mori N, Takumi S, Ikeo K, Gojobori T, Murai R, Murai K, Matsuoka Y, Ohnishi Y, Tajiri H, Tsunewaki K. Structural features of a wheat plastome as revealed by complete sequencing of chloroplast DNA. Mol Genet Genomics 2002; 266:740-6. [PMID: 11810247 DOI: 10.1007/s00438-001-0606-9] [Citation(s) in RCA: 97] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2001] [Accepted: 10/15/2001] [Indexed: 10/27/2022]
Abstract
Structural features of the wheat plastome were clarified by comparison of the complete sequence of wheat chloroplast DNA with those of rice and maize chloroplast genomes. The wheat plastome consists of a 134,545-bp circular molecule with 20,703-bp inverted repeats and the same gene content as the rice and maize plastomes. However, some structural divergence was found even in the coding regions of genes. These alterations are due to illegitimate recombination between two short direct repeats and/or replication slippage. Overall comparison of chloroplast DNAs among the three cereals indicated the presence of some hot-spot regions for length mutations. Whereas the region with clustered tRNA genes and that downstream of rbcL showed divergence in a species-specific manner, the deletion patterns of ORFs in the inverted-repeat regions and the borders between the inverted repeats and the small single-copy region support the notion that wheat and rice are related more closely to each other than to maize.
Collapse
Affiliation(s)
- Y Ogihara
- Kihara Institute for Biological Research and Graduate School of Integrated Science, Yokohama City University, Yokohama 244-0813, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Sato S, Tanaka M, Miura H, Ikeo K, Gojobori T, Takeuchi T, Yamamoto H. Functional conservation of the promoter regions of vertebrate tyrosinase genes. J Investig Dermatol Symp Proc 2001; 6:10-8. [PMID: 11764277 DOI: 10.1046/j.0022-202x.2001.00008.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Tyrosinase is the key enzyme for synthesizing melanin pigments, which primarily determine mammalian skin coloration. Considering the important roles of pigments in the evolution and the adaptation of vertebrates, phylogenetic changes in the coding and flanking regulatory sequences of the tyrosinase gene are particularly intriguing. We have now cloned cDNA encoding tyrosinase from Japanese quail and snapping turtle. These nonmammalian cDNA are highly homologous to those of the mouse and human tyrosinases, whereas the 5' flanking sequences are far less conserved except for a few short sequence motifs. Nevertheless, we demonstrate that the 5' flanking sequences from the quail or turtle tyrosinase genes are capable of directing the expression of a fused mouse tyrosinase cDNA when introduced into cultured mouse albino melanocytes. This experimental method, which reveals the functional conservation of regulatory sequences in one cell type (the melanocyte), may be utilized to evaluate phylogenetic differences in mechanisms controlling specific gene expression in many other types of cells. We also provide evidence that the 5' flanking sequences from these nonmammalian genes are functional in vivo by producing transgenic mice. Phylogenetic changes of vertebrate tyrosinase promoters and the possible involvement of conserved sequence motifs in melanocyte-specific expression of tyrosinase are discussed.
Collapse
Affiliation(s)
- S Sato
- Biological Institute, Graduate School of Science, Tohoku University, Sendai, Miyagi, Japan
| | | | | | | | | | | | | |
Collapse
|
22
|
Abstract
To predict the amino acid sites important for the clearance of hepatitis C virus (HCV) subtype 1b in vivo, positively selected amino acid sites were detected by analyzing the sequence data collected from the international DNA databank. The rate of nonsynonymous substitutions per nonsynonymous site was compared with that of synonymous substitutions per synonymous site for each codon site in the entire coding region. As a result, 13 out of 3010 amino acid sites were found to be positively selected. Among the 13 positively selected amino acid sites, eight were located in the structural proteins and five were in the nonstructural proteins. Moreover, eight were located in B-cell epitopes and two were in T-cell epitopes. These observations suggest that both the antibody and the cytotoxic T lymphocyte are involved in the clearance of HCV subtype 1b in vivo. These positively selected amino acid sites represent candidate vaccination targets for HCV subtype 1b.
Collapse
Affiliation(s)
- Y Suzuki
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, 1111 Yata, Mishima-shi, 411-8540, Shizuoka-ken, Japan
| | | |
Collapse
|
23
|
Tsunoyama K, Bellgard MI, Gojobori T. Intragenic variation of synonymous substitution rates is caused by nonrandom mutations at methylated CpG. J Mol Evol 2001; 53:456-64. [PMID: 11675605 DOI: 10.1007/s002390010235] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2001] [Accepted: 02/28/2001] [Indexed: 11/24/2022]
Abstract
It has been observed that synonymous substitution rates vary among genes in various organisms, although the cause of the variation is unresolved. At the intragenic level, however, the variation of synonymous substitutions is somewhat controversial. By developing a rigorous statistical test and applying the test to 418 homologous gene pairs between mouse and rat, we found that more than 90% of gene pairs showed a statistical significance in intragenic variation of synonymous substitution rates. Moreover, by examining all conceivable possibilities for the cause of the variation, we successfully found that intragenic variation of synonymous substitutions in mammalian genes is caused mainly by a nonrandom mutation due to the methylation of CpG dinucleotides rather than by functional constraints.
Collapse
Affiliation(s)
- K Tsunoyama
- Department of Genetics, Graduate University for Advanced Studies, Mishima, Japan
| | | | | |
Collapse
|
24
|
Bellgard M, Schibeci D, Trifonov E, Gojobori T. Early detection of G + C differences in bacterial species inferred from the comparative analysis of the two completely sequenced Helicobacter pylori strains. J Mol Evol 2001; 53:465-8. [PMID: 11675606 DOI: 10.1007/s002390010236] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2001] [Accepted: 05/16/2001] [Indexed: 10/28/2022]
Abstract
Identifying the G + C difference between closely related bacterial species or between different strains of the same species is one of the first steps in understanding the evolutionary mechanisms accounting for the differences observed among bacterial species. The G + C content can be one of the most important factors in the evolution of genomic structures. In this paper, we describe a new method for detecting an initial stage of differentiation of the G + C content at the third codon base position between two strains of the same bacterial species. We apply this method to the two strains of Helicobacter pylori. A group of genes is detected with large variations of G + C in the third positions-apparently genes of early response to pressures of changing G + C. We discuss our findings from the viewpoint of genomic evolution.
Collapse
Affiliation(s)
- M Bellgard
- Centre for Bioinformatics and Biological Computing, School of Information Technology, Murdoch University, Murdoch, WA 6150, Australia.
| | | | | | | |
Collapse
|
25
|
Iwama H, Gojobori T. [Evolution of molecules involved with neural transmission: in the case of catecholamine receptors and G protein alpha subunits]. Tanpakushitsu Kakusan Koso 2001; 46:1586-93. [PMID: 11579554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
|
26
|
Abstract
UNLABELLED ADAPTSITE is a program package for detecting natural selection at single amino acid sites, using a multiple alignment of protein-coding sequences for a given phylogenetic tree. The program infers ancestral codons at all interior nodes, and computes the total numbers of synonymous (c(S)) and nonsynonymous (c(N)) substitutions as well as the average numbers of synonymous (s(S)) and nonsynonymous (s(N)) sites for each codon site. The probabilities of occurrence of synonymous and nonsynonymous substitutions are approximated by s(S) / (s(S) + s(N)) and s(N) / (s(S) + s(N)), respectively. The null hypothesis of selective neutrality is tested for each codon site, assuming a binomial distribution for the probability of obtaining c(S) and c(N). AVAILABILITY ADAPTSITE is available free of charge at the World-Wide Web sites http://mep.bio.psu.edu/adaptivevol.html and http://www.cib.nig.ac.jp/dda/yossuzuk/welcome.html. The package includes the source code written in C, binary files for UNIX operating systems, manual, and example files.
Collapse
Affiliation(s)
- Y Suzuki
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, 328 Mueller Laboratory, University Park, PA 16802, USA.
| | | | | |
Collapse
|
27
|
Sato T, Terabe M, Watanabe H, Gojobori T, Hori-Takemoto C. Codon and base biases after the initiation codon of the open reading frames in the Escherichia coli genome and their influence on the translation efficiency. J Biochem 2001; 129:851-60. [PMID: 11388898 DOI: 10.1093/oxfordjournals.jbchem.a002929] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Nucleotide sequences around the boundaries of all open reading frames in the Escherichia coli whole genome were analyzed. Characteristic base biases were observed after the initiation codon and before the termination codon. We examined the effect of the base sequence after the initiation codon on the translation efficiency, by introducing mutations after the initiation codon of the E. coli dihydrofolate reductase (DHFR) gene, considering codon and base biases, and using in vitro and in vivo translation systems. In both assay systems, the two most frequent second codons, AAA and AAU, enhanced the translation efficiency compared with the wild type, whereas the effects of lower frequency codons were not significant. Experiments using 16S rRNA variants with mutations in the putative complementary sequence to the region downstream of the initiation codon showed that the translation efficiency of none of the DHFR mutants was affected. These results demonstrate that the statistically most frequent sequences for the second codon enhance translation efficiency, and this effect seems to be independent of base pairing between mRNA and 16S rRNA.
Collapse
Affiliation(s)
- T Sato
- Institute for Biomolecular Science, Gakushuin University, Mejiro, Toshima-ku, Tokyo 171-8588, Japan
| | | | | | | | | |
Collapse
|
28
|
Kawai J, Shinagawa A, Shibata K, Yoshino M, Itoh M, Ishii Y, Arakawa T, Hara A, Fukunishi Y, Konno H, Adachi J, Fukuda S, Aizawa K, Izawa M, Nishi K, Kiyosawa H, Kondo S, Yamanaka I, Saito T, Okazaki Y, Gojobori T, Bono H, Kasukawa T, Saito R, Kadota K, Matsuda H, Ashburner M, Batalov S, Casavant T, Fleischmann W, Gaasterland T, Gissi C, King B, Kochiwa H, Kuehl P, Lewis S, Matsuo Y, Nikaido I, Pesole G, Quackenbush J, Schriml LM, Staubli F, Suzuki R, Tomita M, Wagner L, Washio T, Sakai K, Okido T, Furuno M, Aono H, Baldarelli R, Barsh G, Blake J, Boffelli D, Bojunga N, Carninci P, de Bonaldo MF, Brownstein MJ, Bult C, Fletcher C, Fujita M, Gariboldi M, Gustincich S, Hill D, Hofmann M, Hume DA, Kamiya M, Lee NH, Lyons P, Marchionni L, Mashima J, Mazzarelli J, Mombaerts P, Nordone P, Ring B, Ringwald M, Rodriguez I, Sakamoto N, Sasaki H, Sato K, Schönbach C, Seya T, Shibata Y, Storch KF, Suzuki H, Toyo-oka K, Wang KH, Weitz C, Whittaker C, Wilming L, Wynshaw-Boris A, Yoshida K, Hasegawa Y, Kawaji H, Kohtsuki S, Hayashizaki Y. Functional annotation of a full-length mouse cDNA collection. Nature 2001; 409:685-90. [PMID: 11217851 DOI: 10.1038/35055500] [Citation(s) in RCA: 487] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The RIKEN Mouse Gene Encyclopaedia Project, a systematic approach to determining the full coding potential of the mouse genome, involves collection and sequencing of full-length complementary DNAs and physical mapping of the corresponding genes to the mouse genome. We organized an international functional annotation meeting (FANTOM) to annotate the first 21,076 cDNAs to be analysed in this project. Here we describe the first RIKEN clone collection, which is one of the largest described for any organism. Analysis of these cDNAs extends known gene families and identifies new ones.
Collapse
Affiliation(s)
- J Kawai
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Yokohama Institute, Kanagawa, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Abstract
Several chemokine receptors (CKRs) act as coreceptors of human immunodeficiency virus type 1 (HIV-1), type 2 (HIV-2) and simian immunodeficiency virus (SIV). These CKRs interact with the V3 domain of the envelope (env) protein of HIV/SIV. In this study, we found that the amino acid sequences of two chemokines (SDF-1beta and RANTES), whose receptors (CXCR4 and CCR5) act as major coreceptors for HIV-1, HIV-2 or SIV, showed statistically significant similarity to those of the region containing the third variable (V3) and the third conserved (C3) domains (the V3--C3 domain) of the env protein of HIV-1 and HIV-2. We made a multiple alignment of amino acid sequences for 24 chemokines and the region encompassing the second conserved (C2), V3 and C3 domains (the C2--V3--C3 region) of 10 strains of HIV/SIV. Surprisingly, the hydropathic profile and several important amino acids for protein conformation, such as cysteine and tryptophan, are remarkably conserved between chemokines and the V3--C3 region of HIV/SIV. Moreover, hydrophobic amino acids, such as leucine, isoleucine and valine, are found to be clustered both in the amino-terminal region of chemokines and the C2 domain of HIV/SIV. Thus, chemokines have significantly similar profiles of amino acid properties to those of the C2--V3--C3 region of the env protein of HIV/SIV. These findings raise a hypothesis that chemokines and the C2--V3--C3 region have a common origin. Namely, the HIV/SIV ancestor incorporated a chemokine gene into its env gene. The captured chemokine gene has rapidly diverged by frequent mutations specific to the retroviral genome, and thereby obtained the ability to interact with various CKRs in a short period of time. This paper proposes that the capture of a ligand gene of the host cells into the viral genome may be one of the important mechanisms of viral evolution to expand its host range and generate new viral species.
Collapse
Affiliation(s)
- N Shimizu
- Department of Hygiene and Virology, Gunma University School of Medicine, 3-39-22 Showa-machi, Maebashi, Gunma 371-8511, Japan
| | | |
Collapse
|
30
|
Abstract
The fish order Aulopiformes contains both synchronously hermaphroditic and gonochoristic species. From the cytogenetic viewpoint, few reports show that gonochoristic Aulopiformes have heteromorphic sex chromosomes. Because fish in this order give us a unique opportunity to elucidate the evolution of sex chromosomes, it is important to examine a phylogenetic relationship in Aulopiformes by both molecular evolutionary and cytogenetic methods. Thus, we conducted molecular phylogenetic and cytogenetic studies of six Aulopiform species. Our results suggested that hermaphroditic species were evolutionarily derived from gonochoristic species. It follows that the hermaphroditic species might have lost the heteromorphic sex chromosomes during evolution. Here, we suggest a possibility that heteromorphic sex chromosomes can disappear from the genome, even if they have appeared once in evolution. Taking into account Ohno's hypothesis that heteromorphic sex chromosomes might have emerged from autosomes, we propose the hypothesis that heteromorphic sex chromosomes may have undergone repeated events of appearance and disappearance during the course of fish evolution.
Collapse
Affiliation(s)
- K Ota
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | | | | | | |
Collapse
|
31
|
Toyoda R, Sato S, Ikeo K, Gojobori T, Numakunai T, Goding CR, Yamamoto H. Pigment cell-specific expression of the tyrosinase gene in ascidians has a different regulatory mechanism from vertebrates. Gene 2000; 259:159-70. [PMID: 11163973 DOI: 10.1016/s0378-1119(00)00433-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Tyrosinase is the key enzyme required for the synthesis of melanin pigments. Sequence comparison and functional analysis of the 5' upstream regions of vertebrate tyrosinase genes have revealed the importance of conserved E-box motifs in regulating their specific expression in pigment cells, optic cup-derived retinal pigment epithelium (RPE) and neural crest-derived melanocytes. In ascidians (more basal protochordates), two pigment cells that resemble vertebrate RPE cells are formed and specifically express the orthologous tyrosinase gene (HrTyr) in the cerebral vesicle located at the anterior end of the neural tube. To define regulatory sequences required for pigment cell-lineage-specific expression of HrTyr during embryogenesis, a series of mutations of the 5' upstream region of HrTyr were fused to the lacZ reporter gene and were microinjected into fertilized eggs. We found that the -152bp upstream of the translational start site is essential for expression in pigment cell precursors of tailbud-stage embryos. Further, additional positive and unique restriction elements were identified in the region up to -1.8kb. Surprisingly, in the -152bp minimal promoter or in other regions with regulatory activities, there are no E-box motifs or sequences correlating with other conserved elements regulating vertebrate tyrosinase promoters. The possibility that Pax proteins regulate HrTyr expression is also discussed.
Collapse
Affiliation(s)
- R Toyoda
- Biological Institute, Graduate School of Science, Tohoku University, Sendai 980-8578, Japan
| | | | | | | | | | | | | |
Collapse
|
32
|
Iwata A, Kobayashi T, Ikeo K, Imanishi T, Ono H, Umehara Y, Hamamatsu C, Sugiyama K, Ikeda Y, Sakamoto K, Fumihito A, Ohno S, Gojobori T. Evolutionary aspects of gobioid fishes based upon a phylogenetic analysis of mitochondrial cytochrome B genes. Gene 2000; 259:5-15. [PMID: 11163956 DOI: 10.1016/s0378-1119(00)00488-1] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The Gobioidei is a large suborder in the order Perciformes and consists of more than 2000 species belonging to about 270 genera. The vast number of species and their morphological specialization adapted to diverse habits and habitats makes the classification of the gobioid fishes very difficult.A comprehensive estimation of the evolutionary scenario of all gobioid fishes using only morphological information is difficult for two major reasons: first, in addition to wide ecological diversification, there is a trend towards specialization and degeneration of morphological characters among these species; second, an appropriate outgroup of gobioid fishes has not been recognized. Based upon nucleotide sequence comparisons of gobioid mitochondrial cytochrome b genes, we established the phylogenetic relationships of their differentiation into many groups of morphological and ecological diversity. The phylogenetic trees obtained show that most species examined have diverged from each other almost simultaneously or during an extremely short period of time.
Collapse
|
33
|
Gaudieri S, Dawkins RL, Habara K, Kulski JK, Gojobori T. SNP profile within the human major histocompatibility complex reveals an extreme and interrupted level of nucleotide diversity. Genome Res 2000; 10:1579-86. [PMID: 11042155 PMCID: PMC310975 DOI: 10.1101/gr.127200] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The human major histocompatibility complex (MHC) is characterized by polymorphic multicopy gene families, such as HLA and MIC (PERB11); duplications; insertions and deletions (indels); and uneven rates of recombination. Polymorphisms at the antigen recognition sites of the HLA class I and II genes and at associated neutral sites have been attributed to balancing selection and a hitchhiking effect, respectively. We, and others, have previously shown that nucleotide diversity between MHC haplotypes at non-HLA sites is unusually high (>10%) and up to several times greater than elsewhere in the genome (0.08%-0.2%). We report here the most extensive analysis of nucleotide diversity within a continuous sequence in the genome. We constructed a single nucleotide polymorphism (SNP) profile that reveals a pattern of extreme but interrupted levels of nucleotide diversity by comparing a continuous sequence within haplotypes in three genomic subregions of the MHC. A comparison of several haplotypes within one of the genomic subregions containing the HLA-B and -C loci suggests that positive selection is operating over the whole subgenomic region, including HLA and non-HLA genes. [The sequence data for the multiple haplotype comparisons within the class I region have been submitted to DDBJ/EMBL/GenBank under accession nos. AF029061, AF029062, and AB031005-AB031010. Additional sequence data have been submitted to the DDBJ data library under accession nos. AB031005-AB03101 and AF029061-AF029062.]
Collapse
Affiliation(s)
- S Gaudieri
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka-ken 411-8540, Japan
| | | | | | | | | |
Collapse
|
34
|
Nakane S, Shirabe S, Moriuchi R, Mizokami A, Furuya T, Nishiura Y, Okazaki S, Yoshizuka N, Suzuki Y, Nakamura T, Katamine S, Gojobori T, Eguchi K. Comparative molecular analysis of HTLV-I proviral DNA in HTLV-I infected members of a family with a discordant HTLV-I-associated myelopathy in monozygotic twins. J Neurovirol 2000; 6:275-83. [PMID: 10951551 DOI: 10.3109/13550280009030753] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In order to elucidate the underlying mechanisms of a discordant case with HTLV-I-associated myelopathy/tropical spastic paraparesis (HAM/TSP) in monozygotic twins, we investigated HTLV-I tax sequences of 10 - 18 polymerase chain reaction-based clones each derived from peripheral blood mononuclear cells of the twins as well as their infected mother and an elder brother who also suffered from HAM/TSP. Sequence comparison revealed that three of the infected individuals including a twin with HAM/TSP shared the consensus tax sequence identical to the reference, ATK-1, but that of another healthy twin was different at five nucleotide positions including three nonsynonymous changes from ATK-1. This finding strongly suggested that different HTLV-I strains infected the monozygotic twins and the difference in infected proviral sequences determined the discordant clinical outcomes. Transfection and subsequent reporter assays failed to show a significant difference in transactivation activity on HTLV-I LTR and NF-kappaB elements between the products of the two sequences. Two HAM/TSP patients (a twin and elder brother) among three members infected with the ATK-1 type virus shared a paternal HLA allele which was absent in the healthy individual (mother). Genetic analysis of sequence variation in the tax sequences of the discordant twins showed that the Dn/Ds ratio was high in the healthy twin but low in the twin with HAM/TSP, implying the presence of more intense selection forces in the carrier. Our findings strongly suggested that a particular combination of HTLV-I strains with an HLA genotype would be a risk for HAM/TSP.
Collapse
Affiliation(s)
- S Nakane
- The First Department of Internal Medicine, Nagasaki University School of Medicine, 1-7-1 Sakamoto, Nagasaki 852-8501, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Seki M, Higashiyama Y, Mizokami A, Kadota J, Moriuchi R, Kohno S, Suzuki Y, Takahashi K, Gojobori T, Katamine S. Up-regulation of human T lymphotropic virus type 1 (HTLV-1) tax/rex mRNA in infected lung tissues. Clin Exp Immunol 2000; 120:488-98. [PMID: 10844528 PMCID: PMC1905561 DOI: 10.1046/j.1365-2249.2000.01237.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
HTLV-1 has been implicated in certain pulmonary diseases. We previously demonstrated that expression of HTLV-1 tax/rex mRNA, encoding the transcriptional transactivator Tax, was closely associated with infiltration of activated T lymphocytes into lung tissue. To explore mechanisms of tax/rex expression in the lung, tax/rex mRNA expression and proviral DNA load were compared between peripheral blood mononuclear cells (PBMC) and bronchoalveolar lavage cells (BALC) from four patients with HTLV-1-associated myelopathy (HAM/TSP) and 13 carriers with various pulmonary symptoms. Semiquantitative detection of tax/rex mRNA strongly suggested that the lung was a preferential site for its expression. Proviral DNA loads in non-HAM/TSP carriers were variable but correlated well between PBMC and BALC in each individual, and revealed no relationship with tax/rex mRNA expression. In contrast, both cell groups from four HAM/TSP patients expressed detectable tax/rex mRNA accompanied by high proviral DNA load. The ratio of tax/rex mRNA expression to proviral DNA load was higher in BALC than in PBMC in three of four carriers and in three of four HAM/TSP patients, suggesting up-regulation of tax/rex mRNA in infected lung tissue. To analyse differences in distribution of HTLV-1 quasispecies between the two tissues, phylogenetic analysis was performed for sequence sets of the proviral tax open reading frame (ORF: 1059 bp) derived from PBMC and BALC of two infected individuals. Sequences derived from the two tissues distributed similarly to branches of phylogenetic trees, and there was no evidence of selective distribution of certain quasispecies in the lung. Our results suggest the presence of tissue-specific conditions that activate viral expression in infected cells in the lung. Constitutive exposure of this tissue to foreign antigens leading to up-regulation of basal viral promoter activity is likely to be one such mechanism.
Collapse
Affiliation(s)
- M Seki
- Second Department of Internal Medicine, Nagasaki University School of Medicine, Nagasaki, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Yamaguchi-Kabata Y, Gojobori T. Reevaluation of amino acid variability of the human immunodeficiency virus type 1 gp120 envelope glycoprotein and prediction of new discontinuous epitopes. J Virol 2000; 74:4335-50. [PMID: 10756049 PMCID: PMC111951 DOI: 10.1128/jvi.74.9.4335-4350.2000] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
To elucidate the evolutionary mechanisms of the human immunodeficiency virus type 1 gp120 envelope glycoprotein at the single-site level, the degree of amino acid variation and the numbers of synonymous and nonsynonymous substitutions were examined in 186 nucleotide sequences for gp120 (subtype B). Analyses of amino acid variabilities showed that the level of variability was very different from site to site in both conserved (C1 to C5) and variable (V1 to V5) regions previously assigned. To examine the relative importance of positive and negative selection for each amino acid position, the numbers of synonymous and nonsynonymous substitutions that occurred at each codon position were estimated by taking phylogenetic relationships into account. Among the 414 codon positions examined, we identified 33 positions where nonsynonymous substitutions were significantly predominant. These positions where positive selection may be operating, which we call putative positive selection (PS) sites, were found not only in the variable loops but also in the conserved regions (C1 to C4). In particular, we found seven PS sites at the surface positions of the alpha-helix (positions 335 to 347 in the C3 region) in the opposite face for CD4 binding. Furthermore, two PS sites in the C2 region and four PS sites in the C4 region were detected in the same face of the protein. The PS sites found in the C2, C3, and C4 regions were separated in the amino acid sequence but close together in the three-dimensional structure. This observation suggests the existence of discontinuous epitopes in the protein's surface including this alpha-helix, although the antigenicity of this area has not been reported yet.
Collapse
Affiliation(s)
- Y Yamaguchi-Kabata
- Center for Information Biology, National Institute of Genetics, Mishima 411-8540, Japan
| | | |
Collapse
|
37
|
Abstract
The frequencies of each of the 257 468 complete protein coding sequences (CDSs) have been compiled from the taxonomical divisions of the GenBank DNA sequence database. The sum of the codons used by 8792 organisms has also been calculated. The data files can be obtained from the anonymous ftp sites of DDBJ, Kazusa and EBI. A list of the codon usage of genes and the sum of the codons used by each organism can be obtained through the web site http://www.kazusa.or.jp/codon/. The present study also reports recent developments on the WWW site. The new web interface provides data in the CodonFrequency-compatible format as well as in the traditional table format. The use of the database is facilitated by keyword based search analysis and the availability of codon usage tables for selected genes from each species. These new tools will provide users with the ability to further analyze for variations in codon usage among different genomes.
Collapse
Affiliation(s)
- Y Nakamura
- Laboratory of Gene Structure 2, Kazusa DNA Research Institute, 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan.
| | | | | |
Collapse
|
38
|
Abstract
We at DDBJ (http://www.ddbj.nig.ac.jp) process and publicise the massive amounts of data submitted mainly by Japanese genome projects and sequencing teams. It is emphasised that the collaboration between data producing teams and the data bank is crucial in carrying out these processes smoothly. The amount of data submitted in 1999 is so large that it alone exceeds the total amount submitted in the preceding 10 years. To cope with this situation, we have developed tools not only for processing such massive amounts of data but also for efficiently retrieving data on demand.
Collapse
Affiliation(s)
- Y Tateno
- Center for Information Biology, National Institute of Genetics, Yata, Mishima 411-8540, Japan.
| | | | | | | | | |
Collapse
|
39
|
Abstract
A method was developed for detecting the selective force at single amino acid sites given a multiple alignment of protein-coding sequences. The phylogenetic tree was reconstructed using the number of synonymous substitutions. Then, the neutrality was tested for each codon site using the numbers of synonymous and nonsynonymous changes throughout the phylogenetic tree. Computer simulation showed that this method accurately estimated the numbers of synonymous and nonsynonymous substitutions per site, as long as the substitution number on each branch was relatively small. The false-positive rate for detecting the selective force was generally low. On the other hand, the true-positive rate for detecting the selective force depended on the parameter values. Within the range of parameter values used in the simulation, the true-positive rate increased as the strength of the selective force and the total branch length (namely the total number of synonymous substitutions per site) in the phylogenetic tree increased. In particular, with the relative rate of nonsynonymous substitutions to synonymous substitutions being 5.0, most of the positively selected codon sites were correctly detected when the total branch length in the phylogenetic tree was > or = 2.5. When this method was applied to the human leukocyte antigen (HLA) gene, which included antigen recognition sites (ARSs), positive selection was detected mainly on ARSs. This finding confirmed the effectiveness of the present method with actual data. Moreover, two amino acid sites were newly identified as positively selected in non-ARSs. The three-dimensional structure of the HLA molecule indicated that these sites might be involved in antigen recognition. Positively selected amino acid sites were also identified in the envelope protein of human immunodeficiency virus and the influenza virus hemagglutinin protein. This method may be helpful for predicting functions of amino acid sites in proteins, especially in the present situation, in which sequence data are accumulating at an enormous speed.
Collapse
Affiliation(s)
- Y Suzuki
- Center for Information Biology, National Institute of Genetics, Mishima, Japan
| | | |
Collapse
|
40
|
Gaudieri S, Kulski JK, Dawkins RL, Gojobori T. Extensive nucleotide variability within a 370 kb sequence from the central region of the major histocompatibility complex. Gene 1999; 238:157-61. [PMID: 10570993 DOI: 10.1016/s0378-1119(99)00255-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
The recent availability of the genomic sequence spanning the central and telomeric end of the major histocompatibility complex (MHC) has allowed a detailed study of its organisation, gene content and level of nucleotide variability. Previous analyses of nucleotide variability in the MHC have focused on the coding regions of the human leukocyte antigen (HLA) Class I and II genes. Non-coding nucleotide variability has been considered a by-product of exonic diversity. However, with the advent of genomic sequencing, the extent of non-coding nucleotide variability within the MHC has just begun to be appreciated. In this study, we compared different human haplotypes in 370 kb of sequence in the central region of the MHC to show the following: 1. unusually high levels of non-coding nucleotide variability, up to 80 times greater than elsewhere in the genome; 2. non-coding nucleotide variability greater than 1% at nucleotide sites distant to the Class I genes; 3. nucleotide variability greater than 1% maintained over regions containing highly linked loci; and 4. distinct troughs and peaks in the level of nucleotide variability. We will discuss these observations in relation to a possible role of nucleotide variability in the organisation of the MHC.
Collapse
Affiliation(s)
- S Gaudieri
- Centre for Information Biology, National Institute of Genetics, Mishima, Shizuoka, Japan.
| | | | | | | |
Collapse
|
41
|
Abstract
The relationship between the overall G+C content of the genome (GC) and the GC content at the third codon positions (GC3) of genes, which we refer to as a GC3-plot, was examined using 15 currently available complete genome sequences. A remarkably linear relationship was found between these two quantities, confirming previous observations of a strong positive correlation in the GC3-plot. In order to conduct a more detailed analysis of the GC3-plot, we examined the GC3 content by separating orthologous codons into three categories: synonymously different codons (namely identical amino acids, IA), different amino acids (DA), and identical codons (IC), for a pairwise comparison of two closely related species. When we took pairwise species comparisons between Mycoplasma genitalium (Mg) and Mycoplasma pneumoniae (Mp) and between Mycobacterium tuberculosis (Mt) and Mycobacterium leprae (Ml) as examples, we found that for Mp and Ml, the GC3 for IA deviated the most from the linear expectation in the GC3-plot, whereas for Mg and Mt the deviation was minimal. These findings suggest that the major changes of GC content took place in Mp and Ml, but not in Mg and Mt. This analysis also enables us to predict the future direction of the evolutionary changes of the genomic GC content.
Collapse
Affiliation(s)
- M I Bellgard
- Center for Information Biology, National Institute of Genetics, Mishima, Japan.
| | | |
Collapse
|
42
|
Bellgard M, Gojobori T. Inferring the direction of evolutionary changes of genomic base composition: Erratum. Trends Genet 1999. [DOI: 10.1016/s0168-9525(99)01829-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
43
|
Tazaki A, Gaudieri S, Ikeo K, Gojobori T, Watanabe K, Agata K. Neural network in planarian revealed by an antibody against planarian synaptotagmin homologue. Biochem Biophys Res Commun 1999; 260:426-32. [PMID: 10403785 DOI: 10.1006/bbrc.1999.0933] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
In order to investigate the neural connection of planarian, it is imperative to produce an antibody that specifically stains axons. To identify axon-specific genes, we constructed a cDNA library from a single eye by using a single cell PCR method, in which visual neurons are major components, and sequenced one thousand independent clones. We succeeded in the identification of a planarian homologue of synaptotagmin, Djsyt, whose specific expression in neurons was confirmed by in situ hybridization. The antibody against DjSYT specifically stained axons although its mRNA is distributed in the cell bodies. By using anti-DjSYT, we succeeded in the visualization of neural connections in planarians by whole mount staining. The anti-DjSYT antibody will become a powerful tool to analyze the molecular mechanisms underlying neural network formation in planarian.
Collapse
Affiliation(s)
- A Tazaki
- Department of Life Science, Himeji Institute of Technology, Hyogo, 678-1297, Japan
| | | | | | | | | | | |
Collapse
|
44
|
Affiliation(s)
- M I Bellgard
- School of Information Technology, Murdoch University, Perth, Australia.
| | | |
Collapse
|
45
|
Sato S, Toyoda R, Katsuyama Y, Saiga H, Numakunai T, Ikeo K, Gojobori T, Yajima I, Yamamoto H. Structure and developmental expression of the ascidian TRP gene: insights into the evolution of pigment cell-specific gene expression. Dev Dyn 1999; 215:225-37. [PMID: 10398533 DOI: 10.1002/(sici)1097-0177(199907)215:3<225::aid-aja5>3.0.co;2-s] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The tyrosinase family in vertebrates consists of three related melanogenic enzymes: tyrosinase, tyrosinase-related protein-1 (TRP-1), and TRP-2. These proteins control melanin production in pigment cells and play a crucial role in determining vertebrate coloration. We have isolated a gene from the ascidian Halocynthia roretzi which encodes a tyrosinase-related protein (HrTRP) with 45-49% identity with vertebrate TRP-1 and TRP-2. The expression of the HrTRP gene in pigment lineage a8.25 cells starts at the early-mid gastrula stage, which coincides with the stage when these cells are determined as pigment precursor cells; therefore, it provides the earliest pigment lineage-specific marker, which enables us to trace the complete cell lineage leading to two pigment cells in the larval brain. In addition, the expression pattern of the HrTRP gene appears to share similar characteristics with the mouse TRP-2 gene although structurally the HrTRP gene is more closely related to mammalian TRP-1 genes. Based on these observations and on results from molecular phylogenetic and hybridization analyses, we suggest that triplication of the tyrosinase family occurred during the early radiation of chordates. Initially, duplication of an ancestral tyrosinase gene produced a single TRP gene before the urochordate and cephalochordate-vertebrate divergence, and a subsequent duplication of the ancestral TRP gene in the vertebrate lineage gave rise to two TRP genes before the emergence of teleost fishes. Evolution of the melanin synthetic pathway and possible phylogenetic relationships among chordate pigment cells that accommodate the metabolic process are discussed. Dev Dyn 1999;215:225-237.
Collapse
Affiliation(s)
- S Sato
- Biological Institute, Graduate School of Science, Tohoku University, Miyagi, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Ren F, Hiroshi T, Okayama T, Gojobori T. The application of model-based complexity inference method to molecular evolution analysis. Stud Health Technol Inform 1999; 52 Pt 1:367-71. [PMID: 10384480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
In this study, a new method based on the concept of complexity in inductive inference is proposed for reconstructing molecular phylogenetic tree. This method describes the complexity of molecular phylogenetic tree by three terms, which are related to tree topology, the branch lengths and fitness between the model and data measured by likelihood function. The computer simulation is used to investigate the efficiency of this method. The results suggest that this method is superior to the traditional methods because it avoids excess-complexity in the tree model estimation available from DNA sequence.
Collapse
Affiliation(s)
- F Ren
- Medical Research Institute, Tokyo Medical and Dental University, Japan
| | | | | | | |
Collapse
|
47
|
Tanaka H, Ren F, Okayama T, Gojobori T. Topology selection in unrooted molecular phylogenetic tree by minimum model-based complexity method. Pac Symp Biocomput 1999:326-37. [PMID: 10380208 DOI: 10.1142/9789814447300_0032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In reconstruction of phylogenetic trees from molecular data, it has been pointed out that multifurcate phylogenetic trees are difficult to be correctly reconstructed by the conventional methods like maximum likelihood method(ML). In order to resolve this problem, we have been engaged in developing a new phylogenetic tree reconstruction method, based on the minimum complexity principle widely used in the inductive inference. Our method, which we call "minimum model-based complexity (MBC) method", has been proved so far to be efficient in estimating multifurcate branching when the tree is described in the form of rooted one. In this study, we make further investigations about the efficiency of MBC method in estimating the multifurcation in unrooted phylogenetic trees. To do so, we conduct computer simulation in which the estimations by MBC method are compared with those by ML, AIC and statistical test approach. The results show that MBC method also provides good estimations even in the case of multifurcate unrooted trees and suggest that it could be generally used for reconstruction of phylogenetic tree having arbitrary multifurcations.
Collapse
Affiliation(s)
- H Tanaka
- Tokyo Medical and Dental University, Japan
| | | | | | | |
Collapse
|
48
|
Gaudieri S, Kulski JK, Dawkins RL, Gojobori T. Different evolutionary histories in two subgenomic regions of the major histocompatibility complex. Genome Res 1999; 9:541-9. [PMID: 10400921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Two subgenomic regions within the major histocompatibility complex, the alpha and beta blocks, contain members of the multicopy gene families HLA class I, human endogenous retroviral sequence (HERV-16; previously known as P5 and PERB3), hemochromatosis candidate genes (HCG) (II, IV, VIII, IX), 3.8-1, and MIC (PERB11). In this study we show that the two blocks consist of imperfect duplicated segments, which contain linked members of the different gene families. The duplication and truncation sites of the segments are associated with retroelements. The retroelement sites appear to generate the imperfect duplications, insertions/deletions, and rearrangements, most likely via homologous recombination. Although the two blocks share several characteristics, they differ in the number and orientation of the duplicated segments. On the 62.1 haplotype, the alpha block consists of at least 10 duplicated segments that predominantly contain pseudogenes and gene fragments of the HLA class I and MIC (PERB11) gene families. In contrast, the beta block has two major duplications containing the genes HLA-B and HLA-C, and MICA (PERB11.1) and MICB (PERB11.2). Given the common origin between the blocks, we reconstructed the duplication history of the segments to understand the processes involved in producing the different organization in the two blocks. We then found that the beta block contains four distinct duplications from two separate events, whereas the alpha block is characterized by multisegment duplications. We will discuss these results in relation to the genetic content of the two blocks.
Collapse
Affiliation(s)
- S Gaudieri
- Centre for Information Biology, National Institute of Genetics, Mishima, Shizuoka, 411-8540 Japan.
| | | | | | | |
Collapse
|
49
|
Hatta Y, Ohashi J, Imanishi T, Kamiyama H, Iha M, Simabukuro T, Ogawa A, Tanaka H, Akaza T, Gojobori T, Juji T, Tokunaga K. HLA genes and haplotypes in Ryukyuans suggest recent gene flow to the Okinawa Islands. Hum Biol 1999; 71:353-65. [PMID: 10380372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
Polymorphism of HLA genes was investigated in a population sample of Ryukyuans living on the main island of Okinawa (n = 197), in the southwestern islands of Japan. Serological typing was applied to class I loci (HLA-A, -B, and -C) and to HLA-DRB1; nucleotide sequence-level typing was performed using PCR microtiter plate hybridization and PCR single-strand conformation polymorphism methods. Ryukyuans showed a higher frequency of DRB1*0405 and lower frequencies of DRB1*1502 and DRB1*1302 compared with Hondo Japanese living on main islands. Principal components and phylogenetic analyses of 12 East Asian populations, including Ryukyuans, were performed based on the allele frequencies of HLA-A, -B, and -DRB1. In the principal components analysis 3 Japanese populations (Ryukyuans, Hondo Japanese, and Ainu) formed a cluster and showed the highest affinity to 2 Korean populations. In the phylogenetic tree Ryukyuans and Ainu were neighbors, but the genetic distance between them was larger than the distances between Ryukyuans and Hondo Japanese and between Ryukyuans and Korean populations. The geographic cline of the predominant haplotype in Ryukyuans, A*24-B*54-DRB1*0405, suggests that an ancestral population possessing A*24-B*54-DRB1*0405 moved into the Okinawa Islands after the divergence of Ryukyuans from the Ainu. Such a recent gene flow, probably from South China to the Okinawa Islands, is considered the major cause of difference in genetic characteristics between Ryukyuans and the Ainu.
Collapse
Affiliation(s)
- Y Hatta
- Department of Human Genetics, Graduate School of Medicine, University of Tokyo, Japan
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Abstract
A new era in the elucidation of genome evolution has been heralded with the availability of numerous genome sequences. With these data, it has been possible to study evolutionary processes at a greater level of detail in order to characterize features such as gene shuffling, genome rearrangements, base bias composition, and horizontal gene transfer. In this paper, we discuss the evolutionary implications of significant rearrangements within genomes as well as characteristic genomic regions that have been conserved across genomes. This is based on our analysis of orthologous and paralogous genes. We argue that genome plasticity has most likely contributed substantially to the dynamic evolution of genomes. We also describe the characteristic mosaic features of an archaea genome that is comprised of both bacterial and eukaryal elements. Here we investigate base compositional differences as well as the similarity of this species' genes to either bacteria or eukarya. We conclude that these features can be largely explained by the mechanism of horizontal gene transfer. Finally, we introduce the concept of genome space which is defined as the entire set of genomes of all living organisms. We explain its usefulness to describe as well as to gain deeper insight into the general features of the dynamic genomic evolutionary process.
Collapse
Affiliation(s)
- M I Bellgard
- Center for Information Biology, National Institute of Genetics, Mishima, Japan
| | | | | | | | | |
Collapse
|